期刊文献+

一种印刷体文档内嵌数学公式提取方法的研究

RESEARCH ON AN EXTRACTION METHOD FOR MATHEMATICAL FORMULAS EMBEDDED IN PRINTED DOCUMENTS
下载PDF
导出
摘要 针对目前光学字符识别技术(OCR)较难实现对中文文献中的数学公式进行识别,提出一种改进算法来解决印刷体内嵌数学公式的识别问题。通过添加新的特征值进行文本行分类,对内嵌公式行按字符逐一分割,再从分类后的文本行中依次提取出数学公式。实验结果表明,该算法具有识别率高、高效特点,与现有同类算法比较,在解决中文印刷体的数学公式识别问题方面的优势明显。 Its difficult for optical character recognition( OCR) technology to recognise the mathematical formulas from Chinese electronic literatures at present. In light of this,we put forward an improved algorithm to solve the recognition problem with regard to the mathematical formula embedded in printed files. It classifies the lines of text by adding new eigenvalue,and segments the embedded formulas line to characters one by one,then extracts the mathematical formulas from the classified text lines in turn. Experimental results show that the new algorithm has the characteristics of high recognition rate and efficiency. Compared with existing similar algorithms,it has clear predominance in solving the problem of mathematical formulas recognition from Chinese prints.
出处 《计算机应用与软件》 CSCD 北大核心 2014年第4期102-105,110,共5页 Computer Applications and Software
关键词 数学公式提取 字符分割 公式定位 Mathematical formula extraction Character segmentation Formula positioning
  • 相关文献

参考文献7

二级参考文献37

  • 1R.H. ANDERSON. Syntax - directed Recognition of Hand - Printed Two - Dimensional mathematics[R]. In M. Klerer and J. Reinfelds, editors, Interactive Systems for Experimental Applied Matheties, Academic Press, New York,1968: 436 - 459
  • 2H.J. LEE and M. C. LEE, Understanding Mathematical Expression in a Printed Document [A]. In Proceedings of ICDAR' 93 [C]. Japan, 1993: 502 -505
  • 3A. KACEM, A. BELALD, and M. BEN AHMED, Extraction Automatique de Formules a partir d' images de Documents Scientifiques[A]. In Proceedings of RFIA,00,Paris- Fumce[C] ,2000.
  • 4J HA, R. M. HARALICK, and I. T. PHILLIPS. Understanding mathematical expressions from document images [A]. In Proceedings of ICDAR' 95[C]. Canada, 1995: 956 - 959
  • 5R.H.ANDERSON.Two- Dimensional Mathematical Notation[A] .In proceedings of Syntactic Pattern Recognition Applications[C]. K. S. Fu, Ed. Springer Verlag, New York, 1977:147- 177
  • 6H.J.LEE and J.S.WANG. Design of mathematical expression recognition system[A]. In Proceedings of ICDAR' 95 [C]. Canada, 1995:1084 - 1087
  • 7H. - J. LEE and J. S. WANG. Design of a mathematical expression recognition system[J]. Pattern Recognition Letters. 1997(18) :289 - 298
  • 8R. FATEMAN, T. TOKUYASU, B. BERMAN and N. MITCHELL. Optical Character Recognition and Parsing of Typeset Mathematics[J]. In J. of Visual Commun. And Image Representation, 1996,7 ( 1 ): 2 - 15
  • 9R Fateman,T Tokuyasu,B Berman.Optical Character Recognition and Parsing of Typeset Mathematics[J].Joumal of Visual Communication and Image Representation, 1996;7(1 ) :2-15.
  • 10A KACEM,A BELAID,M Ben AHMED.Automatic extraction of printed mathematical formulas using fuzzy logic and propagation of context[J]. IJDAR(4) ,2001 ; (2) :97-108.

共引文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部