期刊文献+

基于混合二值化的表格手写数字串的完整提取

Integrated Extraction of Handwritten Numeral Strings in Form Document Based on Hybrid Binarization
原文传递
导出
摘要 对表格手写数字串的提取问题进行研究,提出一种基于混合二值化的单元格字符准确定位和完整提取方法,其核心是感兴趣单元格的定位与提取和断裂笔划的修复.该方法可克服书写时带来的各种常规影响,把表格中的手写数字完整提取出来.实验结果表明本文方法的有效性. The handwritten numeral string extraction in form document is studied. A method is proposed to effectively discern and capture the characters from overlapping borders based on hybrid binarization. Two key problems are investigated in detail including the location and the extraction on the cell of interest (COI) with broken strokes mended. The extracted handwritten characters remain integrated even for characters in different writing styles. Experimental results demonstrate that the proposed method is efficient.
出处 《模式识别与人工智能》 EI CSCD 北大核心 2008年第3期369-375,共7页 Pattern Recognition and Artificial Intelligence
基金 国家自然科学基金资助项目(No.60475042,10631080)
关键词 表格手写数字串 二值化 字符提取 Handwritten Numeral Strings in Form Document, Binarization, Character Extraction
  • 相关文献

参考文献26

  • 1Mori S, Suen C Y, Yamamoto K. Historical Review of OCR Research and Development. Proc of the IEEE, 1992, 80(7) : 1029 - 1058
  • 2Rodriguez C, Mugucrza J, Navarro M, et al. A Two-Stage Classifier for Broken and Blurred Digits in Forms// Proc of the 14th International Conference on Pattern Recognition. Brisbane, Australia, 1998, Ⅱ: 1101-1105
  • 3Naoi S, Yabuki M. Global Interpolation Method Ⅱ for Handwritten Numbers Overlapping a Border by Automatic Knowledge Acquisition of Overlapped Conditions//Proc of the 4th International Conference on Document Analysis and Recognition. Ulm, Germany, 1997, Ⅱ : 540 - 543
  • 4郑冶枫,刘长松,丁晓青.线宽阈值法去除表格框线[J].模式识别与人工智能,2001,14(2):206-210. 被引量:6
  • 5Yoo J Y, Kim M K, Han S Y, et al. Line Removal and Restoration of Handwritten Characters on the Form Documents//Proc of the 4th International Conference on Document Analysis and Recognition.Ulm, Germany, 1997,Ⅰ: 128-131
  • 6Chung Y, Lee K, Yaik J, et al. Extraction and Restoration of Digits Touching or Overlapping Lines // Proc of the 13th International Conference on Pattern Recognition. Vienna, Australia, 1996, Ⅲ: 155 - 159
  • 7Tseng Y H, Lee H J. Interfered-Character Recognition by Removing Interfering-Lines and Adjusting Feature Weights// Proc of the 14th International Conference on Pattern Recognition. Brisbane, Australia, 1998, Ⅱ: 1865 -1867
  • 8胡钟山,娄震,杨静宇.文档处理中消除线噪声的研究[J].计算机研究与发展,1999,36(8):992-995. 被引量:11
  • 9Hori O, Doermann D S. Robust Table-Form Structure Analysis Based on Box-Driven Reasoning // Proc of the 3rd International Conference on Document Analysis and Recognition. Montreal, Canada, 1995, Ⅰ: 218-221
  • 10张重阳,陈强,娄震,杨静宇.基于灰度图像的表格框线去除算法[J].计算机研究与发展,2005,42(4):635-639. 被引量:9

二级参考文献18

  • 1任鲲鹏.表格中字符块的提取.第七届全国汉字识别会议论文集[M].昆明,1999.147-153.
  • 2Impedovo S,Automatic Bankcheck Processing,1997年
  • 3Liu Ke,Automatic Bankcheck Processing,1997年,213页
  • 4Chung Youngtae,Proc ICPR’96,1996年,155页
  • 5Mori Shunji,IEEE Proc,1992年,80卷,7期,1029页
  • 6任鲲鹏,第七届全国汉字识别会议论文集,1999年,147页
  • 7Yu B,IEEE Trans on Pattern Analysis & Machine Intelligence,1996年,18卷,1期,1127页
  • 8Xiangyun Ye, Mohamed Cheriet, Ching Y. Suen, et al.Extraction of bankcheck items by mathematical morphology. J.Doc. Anal. Recognit., 1999, 2(2): 53~66.
  • 9Xiangyun Ye, Mohamed Cheriet, Ching Y. Suen. A generic method of cleaning and enhancing handwritten data from business forms. J. Doc. Anal. Recognit., 2001, 4(2): 84~96.
  • 10Bin Yu, Jain, A. K. A generic system for form dropout. IEEE Trans. Pattern Analysis and Machine Intelligence. 1996, 18(11): 1127~1134.

共引文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部