期刊文献+

利用名片文本图像版面信息的辅助分类方法

Using Layout Information in Image to Improve Automated Classification for Text in Business Cards
下载PDF
导出
摘要 在基于一种知识工程和统计学习相结合的文本信息分类算法基础上,提出了利用文本在名片图像中的版面位置信息来辅助分类。此方法充分利用了名片版面中各种文本内容之间在图像中空间位置上的相互关系,对提高名片信息的分类准确性有显著的效果。 In this paper, based on the combined method of statistical learning approach and knowledge engineering approach for text categorization, we propose to use layout information in images to improve automated categorization for text information in business cards. This method takes the full advantage of the mutual relations in layout of different kinds of text in the image of the business card, and improves obviously the accuracy of the text classification result.
出处 《电视技术》 北大核心 2004年第8期67-70,共4页 Video Engineering
基金 国家863高技术计划(2001AA114081) 国家自然科学基金(60241005)
关键词 文本分类 图像版面信息 名片 OCR系统 text classification layout information in images business card OCR system
  • 相关文献

参考文献6

  • 1林晓帆,丁晓青,吴佑寿.名片自动录入系统的实现[J].数据采集与处理,1998,13(2):163-167. 被引量:6
  • 2吴立德.大规模中文文本分类[M].上海:复旦大学出版社,1997..
  • 3Callum Andrew Mc, Nigam Kamal. A comparison of event models for naive hayes text categorization[C].Proceedings of AAAI-98 Workshop on "Learning for Text Categorization", AAAI Press, 1998.
  • 4Sebastiani F. A Tutorial on Automated Text Categorization. in Proceedings of ASAI-99, 1st Argentinian Symposium on Artificial Intelligence. 1999. 7-35.
  • 5Yang Yiming, Liu Xin. A re-examination of text categorization methods[C]. Proceedings of ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR). 1999,42-49.
  • 6Yang Y, Pedersen J P. A comparative study on feature selection in text categorization [C]. in Fourteenth International Conference on Machine Learning (ICML' 97). 1997, 412-420.

共引文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部