期刊文献+

表格型文档自动识别系统及其应用 被引量:2

Automatic Recognition System of Tabular Document and Its Application
下载PDF
导出
摘要 随着文档影像系统的广泛应用,文档图像自动处理已成为当前的一个研究热点。对表格型文档自动识别系统中的若干关键技术进行了研究。首先,在版面分析中,提出了基于框线检测的文档分类方法;其次,根据表格型文档图像的特点,介绍了相应的识别域提取、框线去除以及手写字符串分割方法;最后,在手写数字识别部分,设计了一种基于形状上下文特征和梯度特征的组合识别方法。最后将该系统应用于银行票据小写金额识别,通过真实表格型票据进行仿真实验,证明了系统的有效性,系统识别率达到了实用的水平。 With the widely use of document image system, the automatic processing of document images has become a hot topic nowadays. Several pivotal techniques of the form document auto-processing system were emphatically discussed. Firstly, a document image classification method was adopted based on frame line detection in layout analysis. Secondly, corresponding algorithms were proposed on the basis of the characteristic of form document image, such as the pick-up of identification regions, frame line detection and removal and segmentation of handwritten character string. Finally, a combined recognition method based on shape context feature and gradient feature was designed during the part of handwritten digit recognition. The results of emulational experiment on real financial bill images illustrate the validity and practicability of the system.
出处 《系统仿真学报》 CAS CSCD 北大核心 2009年第10期2916-2920,共5页 Journal of System Simulation
基金 国家自然科学基金(60632050 60503026) 863计划(2006AA01Z119)
关键词 表格型文档 框线检测 框线去除 文档图像分析 手写数字识别 tabular document frame line detection frame line removal document image analysis handwritten digit recognition
  • 相关文献

参考文献13

  • 1George Nagy. Twenty Years of Document Image Analysis in PAMI [J]. IEEE Trans. on PAMI (S0162-8828), 2000, 22(1): 38-62.
  • 2N Otsu. A Threshold Selection Method from Gray-level Histogram [C]//IEEE on SMC-9, 1979. USA: IEEE, 1979, 3: 62-66.
  • 3饶晓波,邹北骥.一种基于块邻接图的手写体文本格线删除及笔画重构算法[J].中国图象图形学报,2006,11(4):549-554. 被引量:1
  • 4郑冶枫,刘长松,丁晓青.线宽阈值法去除表格框线[J].模式识别与人工智能,2001,14(2):206-210. 被引量:6
  • 5张重阳,陈强,娄震,杨静宇.基于灰度图像的表格框线去除算法[J].计算机研究与发展,2005,42(4):635-639. 被引量:9
  • 6Chen Yikai. Segmentation of single- or multiple-touching handwritten numeral string using background and foreground analysis [J]. IEEE Transactions on PAMI (S0162-8828), 2000, 22(11): 1304-1317.
  • 7Pal U, Belaid A, Cchoisy Ch. Touching numeral segmentation using water reservoir concept [J]. Pattern Recognition Letters (SO 167-8655), 2003, 24(1): 261-272.
  • 8Yu D, Yah H. An Efficient Algorithm for Smoothing, Linearization and Detection of Structure Feature Points of Binary Image Contours [J]. Pattern Recognition (S0031-3203), 1997, 30(1): 57-69.
  • 9Teow L N, Loe K F. Robust vision-based features and classification schemes for off-line handwritten digit recognition [J]. Pattern Recognition (S0031-3203), 2002, 35(3): 2355-2364.
  • 10苗夺谦,张红云,李道国,王真.基于主曲线的脱机手写数字识别[J].电子学报,2005,33(9):1639-1643. 被引量:14

二级参考文献41

  • 1王珏,苗夺谦,周育健.关于Rough Set理论与应用的综述[J].模式识别与人工智能,1996,9(4):337-344. 被引量:264
  • 2任鲲鹏.表格中字符块的提取.第七届全国汉字识别会议论文集[M].昆明,1999.147-153.
  • 3任鲲鹏,第七届全国汉字识别会议论文集,1999年,147页
  • 4Yu B,IEEE Trans on Pattern Analysis & Machine Intelligence,1996年,18卷,1期,1127页
  • 5Xiangyun Ye, Mohamed Cheriet, Ching Y. Suen, et al.Extraction of bankcheck items by mathematical morphology. J.Doc. Anal. Recognit., 1999, 2(2): 53~66.
  • 6Xiangyun Ye, Mohamed Cheriet, Ching Y. Suen. A generic method of cleaning and enhancing handwritten data from business forms. J. Doc. Anal. Recognit., 2001, 4(2): 84~96.
  • 7Bin Yu, Jain, A. K. A generic system for form dropout. IEEE Trans. Pattern Analysis and Machine Intelligence. 1996, 18(11): 1127~1134.
  • 8Jain-Shiue Chen, Din-Chang Tseng. Overlapped-character separation and reconstruction for table-form documents. Int'1 Conf. Image Processing, Lausanne, Switzerland, 1996.
  • 9J.M. Gloger. Use of the hough transform to separate merged text/graphics in forms. Int'l Conf. 11th IAPR, Hague,Netherlands, 1992.
  • 10S. Naoi, Y. Hotta, M. Yabuki, et al. Global interpolation in the segmentation of handwritten characters overlapping a border. The 1st Int'l Conf. Image Processing, Austin, TX, USA, 1994.

共引文献24

同被引文献21

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部