摘要
研究数字文本资料修复模型,提出基于投影预分割和基于字符连通性二次分割组合的方法,实现对英文数字文档中基本英文字符的准确分割,并通过实验验证该方法的有效性和实用性。该方法具有很强的可扩展性,也可用于中文单字的分割。
In this paper, the model of repairing digital textual matcrial is studied firstly. In order to accurately extract each single character from English textual material, a segmentation method based on projection and characters connectivity is presented. Experimental results show that this method is effective and practical, which can be used to extract each Chinese character because of its extendibility.
出处
《现代图书情报技术》
CSSCI
北大核心
2010年第3期82-85,共4页
New Technology of Library and Information Service
基金
西安外国语大学2009年度科研基金项目"外语院校图书馆信息化
数字化建设研究"(项目编号:09XWC19)的研究成果之一
关键词
资料修复
数字文本图像
字符分割
数字图书馆
Material repairment Digital textual image Characters segmentation Digital library