A Novel Segmentation Technique for Urdu Type-Written Text

2018 Recent Advances on Engineering, Technology and Computational Sciences (RAETCS)(2018)

引用 5|浏览0
暂无评分
摘要
Text segmentation is a process of subdividing the text image into its constituent parts, such as text lines, words and isolated characters. It is the first module in design of Optical character recognition systems. The problem of automatic text segmentation algorithms is increasingly becoming an important issue. Major problems arise due to the lack of standard dataset, a wide diversity of objectives and a lack of meaningful quantitative evaluation. In this paper a new technique is proposed that segments Urdu type written text into text lines on the basis of edges information of connected components. The performance of this technique is tested over the benchmark data set using precision and recall metric with accuracy of 87.36% and 84.75% respectively. Also data set collection, compilation and organization is a part of this research.
更多
查看译文
关键词
connected component,cursiveness,context sensitivity,diacritic,edge detection,kerning and text segmentation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要