期刊文献+

数学公式识别系统:MatheReader 被引量:13

Mathematical Expression Recognition System:MatheReader
下载PDF
导出
摘要 数学公式广泛存在于各类文献之中,但是公式的识别远比文字段落的识别困难.文章介绍了一个数学公式图像识别系统MatheReader,重点阐述了其在公式定位及公式分析方面的技术方案.在公式定位方面,抽取版式特征,采用Parzen分类器区分独立公式和普通文字行,在普通文字行内检测二维结构定位内嵌公式.在公式分析方面,定义十一种基本公式类型,并用产生式规则限定每类公式的唯一分解方法,提出先识别公式类型,然后分解为子表达式的公式分析方法.和已有系统比较,MatheReader的功能更加强大,能够处理的公式更加丰富. Numerous mathematical expressions exist in all kinds of documents, but expression recognition is far more difficult than ordinary text recognition. A mathematical expression recognition system, MatheReader, is presented in this paper, and the detail schemes of expression extraction and expression analysis are described. For expression extraction, isolated expressions and normal text lines are distinguished by Parzen classifier based on layout features and embedded expressions are extracted by 2 D structures detection. For expression analysis, eleven basic expression types are defined, and the unique decomposition way for each type is defined by a set of production rules. The expression analysis scheme is proposed with recognizing expression type at first, and then decomposing the expression into sub-expressions according to the expression type. MatheReader is more powerful and can recognize more kinds of expressions than former systems.
出处 《计算机学报》 EI CSCD 北大核心 2006年第11期2018-2026,共9页 Chinese Journal of Computers
基金 国家自然科学基金天元基金(TY10026002-04-04-01)资助.
关键词 公式定位 公式识别 公式分析 自动性能评估 文档图像处理 expression extraction expression recognition expression analysis automatic performance evaluation document image processing
  • 相关文献

参考文献20

  • 1靳简明,江红英,王庆人.数学公式图像处理综述[J].模式识别与人工智能,2005,18(4):429-440. 被引量:7
  • 2Lee Hsi-Jian,Lee Min-Chou.Understanding mathematical expressions using procedure-oriented transformation.Pattern Recognition,1994,27(3):447~457
  • 3Lee Hsi-Jian,Wang Jiumn-Shine.Design of a mathematical expression recognition system.Pattern Recognition Letters,1997,18(8):289~298
  • 4Berman B.P.,Fateman R.J..Optical character recognition for typeset mathematics.In:Proceedings of the 1994 International Symposium on Symbolic and Algebraic Computation,Oxford,UK,1994,348~353
  • 5Fateman R.J.,Tokuyasu Taku,Berman B.P.,Mitchell Nicholas.Optical character recognition and parsing of typeset mathematics.Journal of Visual Communication and Image Representation,1996,7(1):2~15
  • 6Fateman R.J.,Tokuyasu Taku.Progress in recognizing typeset mathematics.In:Proceedings of the SPIE Conference on Document Recognition Ⅲ,San Jose,CA,1996,2600:37~50
  • 7Twaakyondo H.M.,Okamoto Masayuki.Structure analysis and recognition of mathematical expressions.In:Proceedings of the 3rd International Conference on Document Analysis and Recognition,Montréal,Canada,1995,430~437
  • 8Okamoto Masayuki,Imai Hiroki,Takagi Kazuhiko.Performance evaluation of a robust method for mathematical expression recognition.In:Proceedings of the 6th International Conference on Document Analysis and Recognition,Seattle,Washington,USA,2001,121~128
  • 9Toumit J.Y.,Garcia-Salicetti S.,Emptoz H..A hierarchical and recursive model of mathematical expressions for automatic reading of mathematical documents.In:Proceedings of the 5th International Conference on Document Analysis and Recognition,Bangalore,India,1999,119~122
  • 10Kacem A.,Belaid A.,Ahmed M.B..EXTRAFOR:Automatic EXTRAction of mathematical FORmulas.In:Proceedings of the 5th International Conference on Document Analysis and Recognition,Bangalore,India,1999,527~530

二级参考文献49

  • 1Anderson R H. Syntax-Directed Recognition of Hand-Printed Two-Dimensional Mathematics. In: Klerer M, Reinfelds J, eds. Interactive Systems for Experimental Applied Mathematics. New York, USA: Academic Press, 1968, 436-459.
  • 2Nouzumi S, Inoue K, Miyazaki R, Suzuki M. Optical Recognitlon System of Printed Japanese Mathematical Documents. In:Proc of the 3rd IAPR Workshop on Document Analysis Systems. Nagano, Japan, 1998, 197-200.
  • 3Anderson R H. Syntax-Directed Recognition of Hand-Printed Two-Dimensional Mathematics. Ph. D Dissertation. Harvard University, Cambridge. USA, 1968.
  • 4Chang S K. A Method for the Structural Analysis of Two-Dimensional Mathematical Expressions. Information Sciences,1970, 2(3): 253-272.
  • 5Anderson R H. Two-Dimensional Mathematical Notations. In:Fu K S, ed. Syntactic Pattern Recognition Applications. New York, USA: Springer-Verlag, 1977. 147-177.
  • 6Belaid A, Haton J P. A Syntactic Approach for Handwritten Mathematical Formula Recognition. IEEE Trans on Pattern Analysis and Machine Intelligence, 1984, 6(I): 105-111.
  • 7Wang Z X, Faure C. Structural Analysis of Handwritten Mathematical Expressions. In: Proc of the 9th International Conference on Pattern Recognition. Rome, Italy. 1988, 32-34.
  • 8Chou P A. Recognition of Equations Using a Two-Dimensional Stochastic Context-Free Grammar. In: Proc of SPIE Conference on Visual Communicational and Image Processing. Philadelphia,USA, 1989, Ⅳ:852-863.
  • 9Faure C, Wang Z X. Automatic Perception of the Structure of Handwritten Mathematical Expressions. In: Plamondon R,Leedham C G, eds. Computer Processing of Handwriting. Singapore: World Scientific Publishing Company, 1990. 337-361.
  • 10Toumit J Y, Emptoz H. From the Segmentation to the Reading of a Mathematical Document. In: Proc of the Conference on Machine Graphics and Vision. Borki, Poland, 1998, 483-504.

共引文献6

同被引文献122

引证文献13

二级引证文献25

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部