期刊文献+

数字文本资料修复中的字符分割法及应用 被引量:1

A Method of Characters Segmentation and Its Application in Digital Textual Material Repairment
原文传递
导出
摘要 研究数字文本资料修复模型,提出基于投影预分割和基于字符连通性二次分割组合的方法,实现对英文数字文档中基本英文字符的准确分割,并通过实验验证该方法的有效性和实用性。该方法具有很强的可扩展性,也可用于中文单字的分割。 In this paper, the model of repairing digital textual matcrial is studied firstly. In order to accurately extract each single character from English textual material, a segmentation method based on projection and characters connectivity is presented. Experimental results show that this method is effective and practical, which can be used to extract each Chinese character because of its extendibility.
作者 王文哲
出处 《现代图书情报技术》 CSSCI 北大核心 2010年第3期82-85,共4页 New Technology of Library and Information Service
基金 西安外国语大学2009年度科研基金项目"外语院校图书馆信息化 数字化建设研究"(项目编号:09XWC19)的研究成果之一
关键词 资料修复 数字文本图像 字符分割 数字图书馆 Material repairment Digital textual image Characters segmentation Digital library
  • 相关文献

参考文献14

  • 1臧国全.论图书馆信息资源数字化项目成本节约[J].中国图书馆学报,2007,33(2):70-74. 被引量:5
  • 2夏勇,戴汝为,肖柏华,王春恒.基于OCR与词形状编码的英文扫描文档检索[J].模式识别与人工智能,2009,22(3):488-493. 被引量:7
  • 3Tan C L, Huang W H, Yu Z H, et al. Imaged Document Text Retrieval Without OCR[ J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24 (6) : 838 - 844.
  • 4Casey R G, Lecolinet E. A Survey of Methods and Strategies in Character Segmentation[ J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1996,18 ( 7 ) :690 - 706.
  • 5Harvey A L. Handwritten Character Segmentation Issues[ C ]. In: Proceedings of International Symposium on Signal Processing and Its Applications, Gold Coast, Australia. 1996:581 - 584.
  • 6Zhang Y G, Zhang C S. A New Algorithm for Character Segmentation of License Plate [ C ]. In : Proceedings of IEEE Intelligent Vehicles Symposium. 2003 : 106 - 109.
  • 7Guo J M, Liu Y F. License Plate Localization and Character Segmentation With Feedback Self - learning and Hybrid Binarization Techniques [ J ]. IEEE Transactions on Vehicular Technology, 2008, 57(3) :1417 -1424.
  • 8Sagar B M, Shobha G, Ramskanth Kumar P. Character Segmentation Algorithms for Kannada Optical Character Recognition [ C ]. In:Proceedings of the 2008 International Conference on Wavelet Analysis and Pattern Recognition, Hong Kong. 2008:339 -342.
  • 9Lee S W, Lee D J, Park H S. A New Methodology for Gray - scale Character Segmentation and Recognition [J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 1996,18(10) :1045 -1050.
  • 10Dunn C E, Wang P S P. Character Segmentation Techniques for Handwritten Text - A Survey [C]. In : Proceedings of the 11th International Conference on Pattern Recognition. 1992:577 -580.

二级参考文献19

  • 1Vincent L. Google Book Search: Document Understanding on a Massive Scale//Proc of the 9th International Conference on Document Analysis and Recognition. Curitiba, Brazil, 2007, II: 819 - 823.
  • 2Fujisawa H. A View on the Past and Future of Character and Document Recognition // Proe of the 9th International Conference on Document Analysis and Recognition. Curitiba, Brazil, 2007, I : 3-7.
  • 3Kameshiro T, Hirano T, Okada Y, et al. A Document Image Retrieval Method Tolerating Recognition and Segmentation Errors of OCR Using Shape-Feature and Multiple Candidates // Proc of the 5th International Conference on Document Analysis and Recognition. Bangalore, India, 1999:681 -684.
  • 4Kameshiro T, Hirano T, Okada Y, et al. A Document Retrieval Method from Handwitten Characters Based on OCR and Character Shape Information//Proc of the 6th International Conference on Document Analysis and Recognition. Seattle, USA, 2001:597 -601.
  • 5Katsuyama K, Takebe H, Kurokawa K, et al. Highly Accurate Retrieval of Japanese Document Images through a Combination of Morphological Analysis and OCR. Proc of the SPIE, 2002, 4670 : 57 - 67.
  • 6Nagasaki T, Takahashi T, Marukawa K. Document Retrieval System Tolerant of Segmentation Errors of Document Images // Proc of the 9th International Workshop on Frontiers in Handwriting Recognition. Tokyo, Japan, 2004: 280- 285.
  • 7Gatos B, Konidaris T, Ntzios K, et al. A Segmentation-Free Approach for Keyword Search in Historical Typewritten Documents // Proc of the 8th International Conference on Document Analysis and Recognition. Seoul, Korea, 2005, I : 54-58.
  • 8Lu Y, Tan C L. Information Retrieval in Document Image Databases. IEEE Trans on Knowledge and Data Engineering, 2004, 16(11 ) : 1398 - 1410.
  • 9Huang Weihua, Tan C L, Sung S Y, et al. Word Shape Recognition for Image-Based Document Retrieval//Proc of the 8th International Conference on hnage Processing. Thessaloniki, Greece, 2001, I :1114 -1117.
  • 10Tan C L, Huang Weihua, Yu Zhaohui, et al. Imaged Document Text Retrieval without OCR. IEEE Trans on Pattern Analysis and Machine Intelligence, 2002, 24 (6) : 838 - 844.

共引文献10

同被引文献4

引证文献1

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部