期刊文献+

基于最优坐标系的表格版面分析

Table Layout Analysis Based on Optimal Coordinates System
下载PDF
导出
摘要 表格文档在日常生活中运用十分广泛,对这类文档进行计算机自动处理能提高文档处理速度和准确度,具有重要的现实意义。表格文档版面结构提取是文档信息处理自动化的核心。由于表格文档图像包含印刷体和手写体字符、图像、污损、噪声和一定的倾斜,在其影响下,正确的提取文档的版面结构是比较困难的。在总结国内外表格文档版面结构提取方法的基础上,提出了一种基于最优坐标系的版面结构提取方法,该方法与其它方法相比具有很强的抗干扰能力和文档版面定义灵活方便的特点。 Table documents are frequently used in daily life. Automatically handling this kind of documents by computer can not only save time but also offer high accuracy. The layout structure extraction of the table documents is the kernel of the automatic processing of table information. Because the images of table documents always consist of printed and handwritten characters, images, defiles, noises and tilts, it is difficult to extract the layout structure correctly. After summarizing the existing methods of layout structure extraction, this paper presents a new method based on optimal coordinates system to get the layout structure. The new method outperforms the traditional ones in noise - resisting and layout definition.
出处 《计算机仿真》 CSCD 2007年第4期211-215,共5页 Computer Simulation
关键词 版面分析 表格文档处理 最优坐标系 表格识别 Layout analysis Table document processing Optimal coordinates system Table recognition
  • 相关文献

参考文献6

  • 1K C Fan,J M Lu,J Y Wang.A Feature Pint Approach to the Segmentation of Form Documents.Proceedings of the 3rd International Conference on Document Analysis and Recognition.[J] Washington D.C.:IEEE Computer Press.1995.623-626.
  • 2D Wang,S N Srihari.Analysis of form images.Proceeding of the 1st International Conference on Document Analysis and Recognition[J].Washington D.C.:IEEE Computer Press,1991.181-191.
  • 3Y Belaid,A Belaid,E Turolla.Item Searching in forms:Application to French tax form.Proceeding of the 3rd International Conference on Document Analysis and Recognition[J].Washington D.C.:IEEE Computer Press.1995.744-747.
  • 4B Yu,A K Jain.A Generic System for Form Dropout[J].IEEE Transaction on Pattern Analysis and Machine Intelligence.1996,18(11):1127-1134.
  • 5李星原,高文.一种鲁棒性的结构未知表格分析方法[J].软件学报,1999,10(11):1216-1224. 被引量:4
  • 6吴洋,田学东.中文版面分析中表格的识别[J].河北工业科技,2002,19(2):40-41. 被引量:1

二级参考文献9

  • 1李星原.表格自动阅读研究(博士学位论文)[M].哈尔滨工业大学,1997..
  • 2张xin中.汉字识别技术[M].北京:清华大学出版社,1992.52-56.
  • 3章海涛 李志峰.一种基于直线提取和补全的通用表格分析方法.第七届全国汉字识别学术会议论文集[M].昆明,1999.103-108.
  • 4李星原,博士学位论文,1997年
  • 5Yu B,IEEE Trans Pattern Anal Machine Intell,1996年,18卷,11期,1127页
  • 6Fan K C,Proc 3rd International Conference on Document Analysis and Recognition,1995年,623页
  • 7Liu J,Proc 3rd International Conference on Document Analysis and Recognition,1995年,579页
  • 8Watanabe T,IEEE Trans Pattern Anal Machine Intell,1995年,7卷,4期,432页
  • 9Wang D,Proc lst lnternational Conference on Document Analysis and Recognition. AFCET- IRlSA / INRIA,1991年,181页

共引文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部