摘要
针对维吾尔文手写体文本中行分割问题,基于连通域大小将图像中文字分为三类,提出了自适应涂抹细化算法,对主体文本行进行定位;并对第三类连通域中相邻两文本行间粘连的字符进行切割;此外,利用重心范围内的邻域搜索算法,解决了剩余笔画的文本行归附问题。实验结果表明,该方法与常见的水平投影法,分段投影法,及涂抹方法相比具有更好的分割效果。
To deal with the issues of text lines segmentation in Uyghur handwritten documents, based on the size of con-nected components, this paper divides the text image into three categories. In order to get the location of main text-line, it proposes adaptive painting and thinning algorithm. Furthermore, it separates connected characters and assigns them to text lines. In addition, by using the neighborhood search algorithm based on center of gravity, it solves the belonging problems for remaining small strokes. Experimental results show that separately compared with those horizontal projection based, piecewise projection based, smearing based segmentation methods, this method has better text line segmentation results indeed.
出处
《计算机工程与应用》
CSCD
2014年第18期142-146,共5页
Computer Engineering and Applications
基金
国家自然科学基金(No.61065001)
教育部新世纪优秀人才支持计划资助项目(No.NCET-10-0969)
新疆维吾尔自治区科技厅少数民族特殊培养计划项目(No.201023116)
关键词
维吾尔文
手写体文本
文本行分割
重心
邻域
Uyghur
handwritten documents
text line segmentation
center of gravity
neighborhood