期刊文献+

基于距离加权的自适应字线分离算法 被引量:2

Adaptive Distance-weighted Character and Form Line Separating Algorithm
下载PDF
导出
摘要 提出一种基于距离加权的自适应字线分离算法。应用一定的启发式规则,计算表格线上像素点的权值,将权值与阈值相比较来判断该点是否为字符上的点,其中权值和阈值根据具体表格自动确定。该算法与表格线检测方法无关,且易于实现。实验结果表明,可以很好地处理字线交叠问题,提高了表格识别的正确率。 A new adaptive separating algorithm based on distance-weighted is proposed in this paper. Applying some heuristic rules, it counts the weights of the pixels on form line, then compares each weight with the threshold to judge whether the pixel belongs to character. The weights and the threshold are obtained automatically according to the processing form. The algorithm is independent of the form line detecting methods, and easier to develop. Experiments show that this method can do well with the overlaps easily with high quality, which can improve the accuracy of form recognition.
出处 《计算机工程》 CAS CSCD 北大核心 2007年第4期206-208,共3页 Computer Engineering
关键词 文档分析和识别 表格识别 字线分离 OCR Document analysis and recognition Form recognition Separation of character and line Optical character recognition
  • 相关文献

参考文献3

二级参考文献9

  • 1任鲲鹏.表格中字符块的提取.第七届全国汉字识别会议论文集[M].昆明,1999.147-153.
  • 2刘今晖.印刷表格自动输入数据库的研究与实现.硕士学位论文[M].清华大学,1992..
  • 3任鲲鹏,第七届全国汉字识别会议论文集,1999年,147页
  • 4Yu B,IEEE Trans on Pattern Analysis & Machine Intelligence,1996年,18卷,1期,1127页
  • 5Xiangyun Ye, Mohamed Cheriet, Ching Y Suen. A generic method of cleaning and enhancing handwritten data from business forms [J]. J Doc Anal Recognit, 2001, (4): 84-96.
  • 6Bin Yu, Jain A K. A generic system for form dropout [J]. IEEE Trans on Pattern Analysis and Machine Intelligence, 1996, 18(11): 1127-1134.
  • 7Chen Jain-Shiue, Tseng Din-Chang. Overlapped-character separation and reconstruction for table-form documents [C]. Proc Int Conf on Image Processing, 1996.233-236.
  • 8Tseng Yi-Hong, Lee Hsi-Jian. Interfered-character recognition by removing interfering-lines and adjusting feature weights [C].Proc Fourteenth Int Conf on Pattern Recognition, 1998. 1865-1867.
  • 9胡钟山,娄震,杨静宇.文档处理中消除线噪声的研究[J].计算机研究与发展,1999,36(8):992-995. 被引量:11

共引文献15

同被引文献19

  • 1陈优广,顾国庆,张薇,许彦冰.一种新的表格单元格矩形识别算法[J].计算机工程,2006,32(13):9-11. 被引量:3
  • 2江淑红,汪沁,张建秋,胡波.基于目标中心距离加权和图像特征识别的跟踪算法[J].电子学报,2006,34(7):1175-1180. 被引量:12
  • 3Gonzalez R C, Woods R E. Digital Image Processing[M].北京:电子工业出版社,2003.
  • 4Lam S W, Javanbakht L, Srihari S N. Anatomy of a Form Reader[C]//Proc. of Conference on Document Analysis and Recognition. [S. l.]: IEEE Press, 1993.
  • 5Shinjo H, Hadano E, Marukawa K. A Recursive Analysis for Form Cell Recognition[C]//Proc. of the 6th Int'l Conf. on Document Analysis and Recognition. Washington D. C., USA: IEEE Press, 2001.
  • 6Taylor S L, Fritzon R, Pastor J A. Extraction of Data from Preprinted Forms[J]. Machine Vision and Applications, 1992, 3(5): 211-222.
  • 7KITCHEN L,ROSENFELD A.Gray-level corner detection[J].Pattern Recognition Letters,1982,1 (2):95-102.
  • 8SMITH S,BRADY M.A new approach to low level image processing[J].International Journal of Computer Vision,1997,23 (1):45-78.
  • 9YU J H,TAN J L,WANG Y Y.Ultrasound speckle reduction by a SUSAN-controlled anisotropic diffusion method[J].Pattern Recognition,2010,43 (9):3083-3092.
  • 10梁浩,蔡健林,余有灵.基于非极大值抑制的SUSAN算法改进及硬件实现[J].电子测量技术,2008,31(9):108-111. 被引量:3

引证文献2

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部