期刊文献+

表格型票据中框线检测与去除算法 被引量:5

Extraction and Removal of Frame Line in Form Bill
下载PDF
导出
摘要 字符笔画与表格线的粘连或交叠是表格型票据中普遍存在的现象,严重影响了后期票据自动识别处理的性能.现有方法大多基于二值图像,未能充分利用灰度图中的框线特征.基于票据图像中的框线特征,提出一种表格型票据预处理中的框线检测与去除算法,首先充分利用票据灰度图像的特点准确地检测出框线,再采用一种连通链结构描述叠加后的框线区域,然后对交叠进行判断和标记,根据标记保留字符笔划去除框线干扰.经过实际银行支票图像测试证明了算法的有效性和鲁棒性. In practical form bill images, characters usually overlap with the form frames, which will greatly affect the performance of the document image auto-processing system. Most of the form frame line removal algorithms are based on binary images, which can not make good use of line characteristics in gray images. According to the attribute of financial documents ' structure, an improved line detection and removal algorithm applied in financial form image pre-processing is proposed in this paper. In order to reduce the complexity and improve the effect of line removal, the process of line detection and removal are carried out respectively. First, frame lines are exactly detected according to the line characteristics in gray images. Then chain code method is used to describe the frame line region. Cross-points of characters and lines are detected subsequently with deterministic finite automaton in order to analyse the overlapping types. Finally, frame lines are removed with the marks in cross-points detection. Therefore, the limitation of stroke aberrance caused by thresholding is overcome and higher accuracy of line removal can be achieved. The results of experiment demonstrate that compared with different existing methods based on handwritten digit character recognition, the proposed algorithm is efficient and robust.
出处 《计算机研究与发展》 EI CSCD 北大核心 2008年第5期909-914,共6页 Journal of Computer Research and Development
基金 国家自然科学基金重点项目(60632050) 国家“八六三”高技术研究发展计划基金项目(2006AA01Z119)~~
关键词 文档分析 表格识别 直线检测 连通链结构 框线去除 document analysis form recognition line detection chain code frame line removal
  • 相关文献

参考文献15

  • 1Mark C K Yang,et al.Hough transform modified by line connectively and line thickness[J].IEEE Trans on PAMI,1997,19(8):905-910
  • 2Nitin Aggarwal,William Clem Karl.Line detection in images through regularized hough transform[J].IEEE Trans on Image Processing,2006,15(3):582-591
  • 3P Nacken.A metric for line segments[J].IEEE Trans on PAMI,1993,15(12):1312-1318
  • 4Bin Yu,Ani K Jain.A generic system for form dropout[J].IEEE Trans on PAMI,1996,18(11):1127-1134
  • 5郑冶枫,刘长松,丁晓青,潘世言.基于有向单连通链的表格框线检测算法[J].软件学报,2002,13(4):790-796. 被引量:23
  • 6Yefeng Zheng,Huiping Li,David Doermann.A parallel-line detection algorithm based on HMM decoding[J].IEEE Trans on PAMI,2005,27(5):777-792
  • 7Chun-Ta Ho,Ling-Hwei Chen,A high-speed algorithm for line detection[J].Pattern Recognition Letters,1996,17(5):467-473
  • 8刘长松,潘世言,郑冶枫,丁晓青.一种表格框线检测和字线分离算法[J].电子与信息学报,2002,24(9):1190-1196. 被引量:11
  • 9Xiangyun Ye,Mohamed Cheriet,Ching Y Suen.A generic method of cleaning and enhancing handwritten data from business forms[J].Journal of Doctor Analects Recognition,2001,4(2):84-96
  • 10Jain-Shiue Chen,Din-Chang Tseng.Overlapped-character separation and reconstruction for table-form documents[C].Int'l Conf on Image Processing,Lausanne,Switzerland,1996

二级参考文献26

  • 1任鲲鹏.表格中字符块的提取.第七届全国汉字识别会议论文集[M].昆明,1999.147-153.
  • 2刘今晖.印刷表格自动输入数据库的研究与实现.硕士学位论文[M].清华大学,1992..
  • 3Impedovo S,Automatic Bankcheck Processing,1997年
  • 4Liu Ke,Automatic Bankcheck Processing,1997年,213页
  • 5Chung Youngtae,Proc ICPR’96,1996年,155页
  • 6Mori Shunji,IEEE Proc,1992年,80卷,7期,1029页
  • 7任鲲鹏,第七届全国汉字识别会议论文集,1999年,147页
  • 8Yu B,IEEE Trans on Pattern Analysis & Machine Intelligence,1996年,18卷,1期,1127页
  • 9Xiangyun Ye, Mohamed Cheriet, Ching Y. Suen, et al.Extraction of bankcheck items by mathematical morphology. J.Doc. Anal. Recognit., 1999, 2(2): 53~66.
  • 10Xiangyun Ye, Mohamed Cheriet, Ching Y. Suen. A generic method of cleaning and enhancing handwritten data from business forms. J. Doc. Anal. Recognit., 2001, 4(2): 84~96.

共引文献42

同被引文献34

引证文献5

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部