期刊文献+

分段Radon变换的弯曲文本基线提取

Extraction of Text Baselines from Curved Document Images Using Radon Transform
下载PDF
导出
摘要 针对提取的文本基线不能很好贴合文本边缘,影响弯曲文档图像几何纠正效果,文中提出了一种基于Radon变换和连通域方向的弯曲文本基线提取方法.首先通过分析连通域附近不同距离内像素分布变化情况,将图像分割成横排区域和竖排区域;其次把区域分成窄形条状子图,利用局部合并连通域的方向和Radon变换得到行连通域并提取基直线,各子图基直线合并拟合区域弯曲基线.实验比较显示,文中方法能适应不同弯曲、稀疏程度的文档图像,所提曲线较好地吻合文本边缘,可以应用于弯曲文档图像几何纠正、光学字符识别中. Extracted virtual baselines cannot fit text edges very well,influence geometric rectification of curved document image,an extraction method of warped text baselines based on Radon transform and orientations of Connected Components(CCs) was proposed.By analyzing pixel quantities variation around CCs,it firstly segmented binary images into vertical and horizontal alignment regions,then divided each of regions into a sequence of overlapping vertical strips,distinguished text lines using Radon transform in the range of locally merged CCs' orientations and extracted linear baselines on strips,connected baselines between strips by cubic polynomials.By comparison,the method suited differently bent and sparse document images,extracted curves fitted text baselines better. It can be applied to geometric rectification of curved document image,Optical Character Recognition(OCR) system.
作者 罗晓萍 朱金好 LUO Xiao-ping;ZHU Jin-hao(School of Computer and Information,Anhui Normal University,Wuhu 241003,China;Department of Medical Information,Wannan Medical College,Wuhu 241002,China)
出处 《小型微型计算机系统》 CSCD 北大核心 2018年第12期2699-2704,共6页 Journal of Chinese Computer Systems
关键词 文本基线 连通域重心 连通域倾斜方向 RADON变换 曲线拟合 text baseline centroid of connected component orientation of connected component radon transform curve fitting
  • 相关文献

参考文献3

二级参考文献14

  • 1张伟业,赵群飞.读书机器人的版面分析及文字图像预处理算法[J].微型电脑应用,2011(1):58-61. 被引量:8
  • 2Liu Hong,Ye Lu.A method restore Chinese warped document imagesbased on binding characters and building curved lines [C]International Conference on System s, Man and Cybernetics:ICSM C2009:2009:989-993.
  • 3Li Zhang, Yip Andy M,Brown M ichael S,et al.A unified framework fordocum ent restoration using inpainting and shape-from-shading[J].PatternRecognition ,2009,42(11):2961-2978.
  • 4Liu Hong,Ding Runwei. International Conference on Systems Man and Cybernetics [C] ICSMC 2009:Restoring Chinese warped docum entimages based on text boundary lines,2009.
  • 5Zhang Shengnan, Yuan Shanlei,Niu Lianqiang.Automatic Recognition Method for Checkbox in Data Form Image [C]Sixth International Conference on Measuring Technology and Mechatronics Automation,2014:159-162 .
  • 6Hamed Behin ,Afsh in Ebrahimi,Sepideh Ebrahimi.Incorporated Preprocessingand Physical Layout Analysis of a Binary Document Image Using a Two Stage Classification [C]International Conference on Computer and Communication Engineering:ICCCE2010:2010.
  • 7付芦静,钱军浩,钟云飞.基于汉字联通分量的印刷图像版面分割方法[J/OL].计算机工程与应用,2013,49(3):4[2013-07-31].1^-tp://www.cnki.net/kems/detail/11.2127.TP.20130731.1817.001.html.
  • 8Amir Reza Ghods,Saeed Mozaffari, Farhad Ahmadpanahi.Document ImageDewarping using Kinect Depth Sensor [C] 21st Iranian Conference, Electrical Engineering:ICEE2013:2014:1-6 .
  • 9Tong Lijing,Zhang Guoliang, Peng Quanyao,et al.Warped document imagemosaicing method based on inflection point detection and registration, International Conference on Multimedia Information Networking and Security MINES2012:November 2-4 ,2012[C] Nanjing, 2012:306-310.
  • 10宋丽丽,吴亚东,孙波.改进的文档图像扭曲校正方法[J].计算机工程,2011,37(1):204-206. 被引量:10

共引文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部