期刊文献+

欧洲文字识别方法研究

Research on European Character Recognition
原文传递
导出
摘要 其它欧洲文字识别系统与英文OCR系统的主要差异在于字符集。因此,在当前英文OCR系统已经非常成熟的情况下,欧洲文字识别系统构造的关键在于欧洲文字识别。本文将欧洲文字的字符集分为两部分:英文字符和特殊字符。如何避免英文字符和特殊字符之间的混识以及如何提高特殊字符的识别率是本文的主要贡献。实验结果表明,本文提出的解决方案是行之有效的,系统识别率明显高于以往的欧洲文字识别系统。另外,本文提出的一些思想可以推广到任何相似符号的区分上。 The main difference between English OCR system and other European OCR systems is character set . Therefore , European OCR system construction mainly depends on European character recognition . European character set is divided into two parts in this paper : English characters and special characters. Two key problems are considered, i.e. how to decrease the misclassification rate between English characters and special characters, and how to improve the recognition accuracy for special characters. Experimental result shows that the new system is more effective than the previous ones . Furthermore , the ideas proposed in this paper can be generalized to distinguish any similar symbols.
出处 《模式识别与人工智能》 EI CSCD 北大核心 2006年第4期491-496,共6页 Pattern Recognition and Artificial Intelligence
关键词 人工智能 文档图像处理 光学字符识别 Artificial Intelligence, Document Image Processing, Optical Character Recognition
  • 相关文献

参考文献12

  • 1Rice S V, Jenkins F R, Nartker T A. The Fifth Annual Test of OCR Accuracy. Technical Report, TR-96-01, Information Science Research Institute, University of Nevada, Las Vegas,USA, 1996
  • 2王恺,王庆人.中英文混合文章识别问题[J].软件学报,2005,16(5):786-798. 被引量:18
  • 3Spitz A L. Determination of the Script and Language Content of Document Images. IEEE Trans on Pattern Analysis and Machine Intelligence, 1997, 19(3): 235-245
  • 4Takehiro N; Spitz A L. European Language Determination from Image. In: Proc of the International Conference on Document Analysis and Recognition. Tsukuba, Japan, 1993, 159-162
  • 5Korkmaz S U, Akinci G K Y, Atalay V. A Character Recognizer for Turkish Language. In: Proc of the International Conference on Document Analysis and Recognition. Edinburgh, UK,2003, Ⅱ: 1238-1241
  • 6Baird H S. Anatomy of a Versatile Page Reader. Proc of the IEEE, 1992, 80(7): 1059-1065
  • 7Baird H S, Gilbert D, Ittrier D J. A Family of European Page Readers. In: Proc of the 12th IAPR International Conference on Pattern Recognition. Jerusalem, Israel, 1994, Ⅱ: 540-543
  • 8吕岳,施鹏飞.一种实用并行细化算法及其实现[J].计算机工程与设计,2000,21(4):53-56. 被引量:12
  • 9Wang Q R. Decision Tree Approach to Pattern Recognition Problems in a Large Character Set. Ph. D Dissertation. Department of Computer Science, Concordia University, Mortreal,Canada, 1984
  • 10张炘中 闫昌德 刘秀英.汉字识别的特征点法及其一种应用[J].中文信息学报,1987,11(3):13-19.

二级参考文献35

  • 1.ExperVision公司研发OCR高科技产品,在国际同类产品20项评比中19项得第一,副标题王庆人在南开大学研发成功技术转移来美·大放异彩[N].美国:世界日报,1993-9-17(头版头条).
  • 2Rice SV, Kanai J, Nartker TA. An evaluation of OCR accuracy. Technical Report, Las Vegas: Information Science Research Institute, University of Nevada, 1993.9-33.
  • 3Rice SV, Kanai J, Nartker TA. The 3rd annual test of OCR accuracy. Technical Report, Las Vegas: Information Science Research Institute, University of Nevada, 1994. 11-38.
  • 4Kanai J, Liu YC, Rice SV, Nartker TA. A preliminary evaluation of Chinese OCR systems. Technical Report, Las Vegas:Information Science Research Institute, University of Nevada, 1994.41-47.
  • 5Guo H, Ding XQ, Zhang Z, Guo FX. Realization of a high-performance bilingual Chinese-English OCR system. In: Kavanaugh M,Storms P, eds. ICDAR'95: the 3rd Int'l Conf. on Document Analysis and Recognition. Los Alamitos: IEEE Computer Society Press,1995. 978-981.
  • 6Feng ZD, Huo Q. Confidence guided progressive search and fast match techniques for high performance Chinese/English OCR. In:Kasturi R, Laurendeau D, Suen C, eds. ICPR 2002: the 16th Int'l Conf. on Pattern Recognition. Los Alamitos: IEEE Computer Society Press, 2002. 89-92.
  • 7Huo Q, Feng ZD. Improving Chinese/English OCR performance by using MCE-based character-pair modeling and negative training. In: Antonacopoulos A, ed. ICDAR 2003: the 7th Int'l Conf. on Document Analysis and Recognition. Los Alamitos: IEEE Computer Society Press, 2003. 364-368.
  • 8靳简明 王庆人.多语言字符识别系统集成研究[J].软件学报,2002,13:225-230.
  • 9Pan WM, Jin JM, Shi GS, Wang QR. A system for automatic Chinese business card recognition. In: Antonacopoulos A, ed. ICDAR2003: the 7th Int'l Conf. on Document Analysis and Recognition. Los Alamitos: IEEE Computer Society Press, 2003.1138-1141.
  • 10Zheng YF, Liu CS, Ding XQ. Single character type identification. In: Kantor PB, Kanungo T, Zhou JY, eds. Proc. of the SPIE Document Recognition and Retrieval IX. Bellingham: SPIE-the Int'l Society for Optical Engineering, 2002,4670:49-56.

共引文献73

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部