期刊文献+

基于连通域的扭曲中文文本图像快速校正方法 被引量:3

Fast correcting method based on connected components for warped Chinese document images
下载PDF
导出
摘要 针对扭曲中文文本图像文字识别率不理想这一问题,提出一种基于连通域的文本图像快速扭曲校正方法。根据汉字结构特征合并连通域,实现切分文字;利用就近聚合文字的方法定位文本行,按行垂直校正每个文字位置,获得被校正的图像。实验结果表明,该方法校正速度快,对严重扭曲的中文文本图像能取得较好的校正效果,校正后图像的OCR识别率明显提高。 Character recognition rate of OCR (optical character recognition)processing is not satisfactory for warped Chinese document image.To resolve this problem,a fast distortion correcting method based on connected components was proposed. First,the connected components were combined together according to the Chinese character structure characteristics.Next,the Chinese characters were segmented one by one according to the combined connected components.After that,the text lines were identified based on the nearest aggregation method.Then,the vertical positions of the segmented characters were corrected ac-cording to every text line.As a result,a well corrected document image was obtained.Experimental results demonstrate that this correcting method is fast and can segment the Chinese character accurately.The OCR rate of the corrected images can be sig-nificantly improved.Even for the obviously distorted Chinese document images,this method can achieve better results.
出处 《计算机工程与设计》 北大核心 2015年第5期1251-1255,共5页 Computer Engineering and Design
基金 国家自然科学基金项目(61371142) 国家科技支撑计划基金项目(2012BAH04F03) 北京市自然科学基金项目(4132026) 北京市科技创新平台基金项目(PXM2013_014212_000011)
关键词 中文文本图像 扭曲图像 连通域 文字切分 就近聚合 Chinese document image warped image connected components character segmentation nearest aggregation
  • 相关文献

参考文献11

  • 1HE Yuan, PAN Pan, XIE Shufu, et al. A book dewarping system by boundary-based 3D surface reconstruction [C] // 12th International Conference on Document Analysis and Recog nition, 2013: 403-407.
  • 2LI Zhang, Andy M Yip, Michael S Brown, et al. A unified framework for document restoration using inpainting and shape- from-shading [J].Pattern Recognit J, 2009, 42 (11): 2961-2978.
  • 3MENG Gaofeng, PAN Chunhong, XIANG Shiming, et al. Metric rectification of curved document images [J]. Pattern Analysis and Machine Intelligence, 2012, 34 (4): 707-722.
  • 4LIU Hong, YE Lu. A method to restore Chinese warped docu- ment images based on binding characters and building curved lines [C] //International Conference on Systems, Man and Cybernetics, 2009: 984-990.
  • 5Gatos B, Pratikakis I, Ntirogiannis K. Segmentation based re- covery of arbitrarily warped document images [C] //9th Inter- national Conference on Document Analysis and Recognition, 2007: 989-993.
  • 6宋丽丽,吴亚东,孙波.改进的文档图像扭曲校正方法[J].计算机工程,2011,37(1):204-206. 被引量:10
  • 7张伟业,赵群飞.读书机器人的版面分析及文字图像预处理算法[J].微型电脑应用,2011(1):58-61. 被引量:8
  • 8LIU Hong, DING Runwei. Restoring Chinese warped document images based on text boundary lines [C] //International Conference on Systems, Man and Cybernetics, 2009: 571-576.
  • 9TONG Liiing, ZHAN Guoliang, PENG Quanyao, et al. Warped document image mosaicing method based on inflection point detection and registration [C] //International Conference on Multimedia In- formation Networking and Security, 2012: 306-310.
  • 10罗志灶,周赢武,郑忠楷.二值图像连通域标记优化算法[J].安庆师范学院学报(自然科学版),2010,16(4):34-39. 被引量:19

二级参考文献30

  • 1张修军,郭霞,金心宇.带标记矫正的二值图象连通域像素标记算法[J].中国图象图形学报(A辑),2003,8(2):198-202. 被引量:43
  • 2唐矫燕,赵群飞,杨汝清,吴心然.读书机器人机构设计[J].上海交通大学学报,2005,39(12):2025-2028. 被引量:12
  • 3朱云芳,叶秀清,顾伟康.视频序列的全景图拼接技术[J].中国图象图形学报,2006,11(8):1150-1155. 被引量:19
  • 4徐正光,鲍东来,张利欣.基于递归的二值图像连通域像素标记算法[J].计算机工程,2006,32(24):186-188. 被引量:71
  • 5张森,赵群飞,冶建科.一种数字图像几何畸变的自动校正方法[J].机电一体化,2007,13(3):60-64. 被引量:8
  • 6Brown M S, Seales W B. Image Restoration of Arbitrarily Warped Documents[J]. IEEE Transactions on Pattern Analysis and Machine/ntelligence, 2004, 26(10): 1295-1306.
  • 7Fu Bin, Wu Minghui, Li Rongfeng, et al. A Model-based Book Dewarping Method Using Text Line Detection[C]//Proc. of the 2nd International Workshop on Camera-based Document Analysis and Recognition. Curitiba, Brazil: [s. n.], 2007.
  • 8Zhang Zheng, Tan Chew Lira. Restoration of Images Scanned from Thick Bound Documents[C]//Proc. of 2001 International Conference on Image Processing. Thessaloniki, Greece: [s. n.], 2001.
  • 9Gatos B, Pratikakis I, Ntirogiannis K. Segmentation-based Recovery of Arbitrarily Warped Document Images[C]//Proc. of the 9th International Conference on Document Analysis and Recognition. Curifiba, Brazil:[s. n.], 2007.
  • 10Gatos B, Pratikakis I, Perantonis S J. Adaptive Degraded Document Image Binarization[J]. Pattern Recognition, 2006, 39(3): 317-327.

共引文献30

同被引文献11

引证文献3

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部