摘要
汉字具有丰富的字体类型,并且不同的字体在汉字结构上有显著的不同,现在的OCR技术侧重字的识别,而对字体识别的关注较少。提出文字相关的单字符字体识别方法,利用文字相关的先验信息及字体结构特征,对字体的相似性度量采用向量空间模型,并针对常用66款简体字进行实验,得到了较好的平均识别率。
Chinese characters have various fonts,and their structures show significant differences.OCR technology emphasizes on character recognition while font recognition does not get enough attention.This paper proposes character dependent font recognition approach,which makes use of prior information of character.The features are character discriminative structure features,and vector space model is used to obtain similarity measurement.Experimental results on 66 common used fonts show that average precision of each font is preferable.
出处
《计算机工程与应用》
CSCD
北大核心
2011年第10期158-160,共3页
Computer Engineering and Applications
关键词
字体识别
向量空间模型
汉字特征
font recognition
vector space model
Chinese character feature