期刊文献+

基于灰度图像的表格框线去除算法 被引量:9

A Form Frame Line Removal Algorithm Based on Gray-Level Image
下载PDF
导出
摘要 笔画与表格框线的交叠的现象在表格型文档中普遍存在,严重影响了文档自动处理系统的性能.现有的去线算法大部分都是基于二值图像的,许多有用的局部信息已经丢失.提出了直接利用图像灰度信息的灰值线检测与去除算法.首先利用图像的边缘特征检测直线以及字线的相交位置;然后通过对直线上相交点对的分析确定字线的交叠方式,并将这些方式归纳为穿透和未穿透两类简单的形式;最后将直线划分为保护区和擦除区两部分,保护区内的像素在去线过程中被保留,而擦除区内的像素则利用灰度形态学算法来擦除.在我国现行支票上的实验表明算法是有效的. Preprocess procedure is an important procedure in a document image analysis (DIA) system. In practical document images, characters usually overlap with the preprinted form frames, creating tremendous problems for the recognition engines. Most of the form frame line removal algorithms are based on bi-level images, which have lost much useful information during the binary stage. Proposed in this paper is a line removal algorithm directly based on gray-level images. First, cross-points of characters and lines are detected by Soble gradient. Then the overlapping types of characters and lines are converted into touch type or crossover type by cross-points analysis. Finally, lines are removed with topological method. Experiment results on 1225 real life character string images demonstrate the efficiency of this algorithm. The recognition rate is improved from 75.9% to 91.4%.
出处 《计算机研究与发展》 EI CSCD 北大核心 2005年第4期635-639,共5页 Journal of Computer Research and Development
基金 高等学校博士点科研基金项目(20020288013)
关键词 文档处理 表格处理 直线检测 直线去除 document processing form processing line detection line removal
  • 相关文献

参考文献10

  • 1胡钟山,娄震,杨静宇.文档处理中消除线噪声的研究[J].计算机研究与发展,1999,36(8):992-995. 被引量:11
  • 2Xiangyun Ye, Mohamed Cheriet, Ching Y. Suen, et al.Extraction of bankcheck items by mathematical morphology. J.Doc. Anal. Recognit., 1999, 2(2): 53~66.
  • 3Xiangyun Ye, Mohamed Cheriet, Ching Y. Suen. A generic method of cleaning and enhancing handwritten data from business forms. J. Doc. Anal. Recognit., 2001, 4(2): 84~96.
  • 4Bin Yu, Jain, A. K. A generic system for form dropout. IEEE Trans. Pattern Analysis and Machine Intelligence. 1996, 18(11): 1127~1134.
  • 5Jain-Shiue Chen, Din-Chang Tseng. Overlapped-character separation and reconstruction for table-form documents. Int'1 Conf. Image Processing, Lausanne, Switzerland, 1996.
  • 6J.M. Gloger. Use of the hough transform to separate merged text/graphics in forms. Int'l Conf. 11th IAPR, Hague,Netherlands, 1992.
  • 7S. Naoi, Y. Hotta, M. Yabuki, et al. Global interpolation in the segmentation of handwritten characters overlapping a border. The 1st Int'l Conf. Image Processing, Austin, TX, USA, 1994.
  • 8Yi-Hong Tseng, Hsi-Jian Lee. Interfered-character recognition by removing interfering-lines and adjusting feature weights.Fourteenth Int' l Conf. Pattern Recognition, Brisbane, Qld,Austrialia, 1998.
  • 9Jin-Yong Yoo, Min-Ki Kim, Sang Yong Ban, et al. Line removal and restoration of handwritten characters on the form documents.The 4th Int'l Conf. Document Analysis and Recognition, Ulm,Germany, 1997.
  • 10郑冶枫,刘长松,丁晓青.线宽阈值法去除表格框线[J].模式识别与人工智能,2001,14(2):206-210. 被引量:6

二级参考文献8

  • 1任鲲鹏.表格中字符块的提取.第七届全国汉字识别会议论文集[M].昆明,1999.147-153.
  • 2Impedovo S,Automatic Bankcheck Processing,1997年
  • 3Liu Ke,Automatic Bankcheck Processing,1997年,213页
  • 4Chung Youngtae,Proc ICPR’96,1996年,155页
  • 5Mori Shunji,IEEE Proc,1992年,80卷,7期,1029页
  • 6任鲲鹏,第七届全国汉字识别会议论文集,1999年,147页
  • 7Yu B,IEEE Trans on Pattern Analysis & Machine Intelligence,1996年,18卷,1期,1127页
  • 8胡钟山,娄震,杨静宇,刘克,孙靖夷.基于多分类器组合的手写体数字识别[J].计算机学报,1999,22(4):369-374. 被引量:35

共引文献13

同被引文献95

引证文献9

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部