期刊文献+

基于小波分析及改进二次鉴别函数的民族文种识别 被引量:2

Chinese minority script identification method based on wavelet feature and MQDF
下载PDF
导出
摘要 为了能够对文档中的少数民族文字种类进行正确地识别分类,提出一种基于小波分析与改进的二次分类函数(MQDF)的少数民族文字种类识别方法。该方法采用多辨识小波分解,从而获得小波能量和小波能量比例分布的特征描述,利用MQDF分类器对少数民族文种进行识别。构建藏文、西双版纳傣文、纳西象形文、维吾尔文、德宏傣文和彝文6种常用的少数民族文字及汉字、英语共8种文字的样本库,采用该方法对少数民族的样本库进行了进行训练和测试。实验结果显示,该方法在多层小波分解的情况下,对于少数民族文种识别的精度好于传统的贝叶斯和KNN。 In order to classify the type of the Chinese minority scripts, the method of identifying the kinds of Chinese minority scripts based on wavelet analysis and Modified Quadratic Discriminant Function (MQDF) was presented. Using wavelet energy and wavelet energy distribution proportion as features by wavelet multi-resolution transform, muhivariate classifier in MQDF was constructed. A sample data set was built which contained six common Chinese minority scripts: Tibetan, Tai Lue, Naxi Pictographs, Uighur, Tai Le, Yi and Chinese and English in total, some samples were used for training, others were for testing, and the proportions of the training samples in dataset were variant. Obviously, the experimental result shows that, in muhi-level decomposition, the method is better than the traditional Bayes and K-Nearest Neighbor (KNN) classification in recognition rate.
作者 郭海 赵晶莹
出处 《计算机应用》 CSCD 北大核心 2009年第12期3360-3362,3365,共4页 journal of Computer Applications
基金 国家自然科学基金资助项目(60803096) 国家民委项目(07DL07)
关键词 中国少数民族文字 文种识别 小波分析 改进的二次分类函数 Chinese minority script script identification wavelet analysis Modified Quadratic Discriminant Function (MQDF)
  • 相关文献

参考文献10

  • 1王维兰,丁晓青,祁坤钰.藏文识别中相似字丁的区分研究[J].中文信息学报,2002,16(4):60-65. 被引量:14
  • 2王华,丁晓青,哈力木拉提.多字体多字号印刷维吾尔文字符识别[J].清华大学学报(自然科学版),2004,44(7):946-949. 被引量:18
  • 3李振宏,高光来,侯宏旭,李伟.印刷体蒙古文文字识别的研究[J].内蒙古大学学报(自然科学版),2003,34(4):454-457. 被引量:9
  • 4GUO HAI, ZHAO JING-YING. The design and realization of the Naxi pictographs information processing system [J]. WSEAS Transactions on Systems, 2009, 6(2) : 302 -311.
  • 5郭海,车文刚,聂娟,李斌,许剑锋.纳西象形文Web植入技术[J].计算机工程,2005,31(17):203-204. 被引量:8
  • 6GUO HAI, ZHAO JING-YING, LIU YONG-KUI, et al. Naxi pictographs information processing based on Web embedding fonts technology [ J]. Journal of Computational Information Systems, 2009, 5 (1) :495 -501.
  • 7HOCHBERG J, KELLY P, THOMAS T, et al. Automatic script identification from document images using cluster-based templates [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1997, 19(2): 176-181.
  • 8SUEN C Y, BERGLER S, NOBILE N, et al. Automatic identification of oriental and other scripts in image documents [ J]. Intemational Journal of Computer Processing of Oriental Languages, 2005, 18(2): 77-94.
  • 9PATI P B, RAMAKRISHNAN A G. Word level multi-script identification [ J]. Pattern Recognition Letters, 2008, 29(9) : 1218 - 1229.
  • 10张振宇,黄崇林,谭恒松.基于小波变换的图像识别算法[J].计算机应用,2007,27(B12):97-99. 被引量:6

二级参考文献26

  • 1陈友斌.非特定人脱机手写汉字识别方法的研究.清华大学电子系博士学位论文[M].,1997(6).56-63.
  • 2Al-Badr B, Mahmoud A. Survey and bibliography of Arabic optical text recognition [J]. Signal Processing, 1995, 41(1): 49-77.
  • 3Al-Yousefi H, Udpa S. Recognition of Arabic characters [J]. IEEE Trans on PAMI, 1992, 14(8): 853-858.
  • 4Hou H, Andrews H. Cubic splines for image interpolation and digital filtering [J]. IEEE Trans on Acoustics, Speech, and Signal Processing, 1978, 26(6): 508-517.
  • 5Fukunaga K. Introduction to Statistical Pattern Recognition (2nd Edition) [M]. New York: Academic Press, 1990.
  • 6Kimura F, Takashina K, Tsuruoka S. Modified quadratic discriminant functions and the application to Chinese character recognition [J]. IEEE Trans on PAMI, 1987, 9(1): 149-153.
  • 7LIN Xiaofan, DING Xiaoqing, CHEN Ming, et al. Adaptive confidence transform based classifier combination for Chinese character recognition [J]. Pattern Recognition Letters, 1998, 19(10): 975-988.
  • 8Kato N, Suzuki M, Omachi S, et al. A handwritten character recognition system using directional element feature and asymmetric Mahalanobis distance [J]. IEEE Trans on PAMI, 1999, 21(3): 258-262.
  • 9马少平,夏莹,朱小燕.基于模糊方向线素特征的手写体汉字识别[J].清华大学学报(自然科学版),1997,37(3):42-45. 被引量:37
  • 10Microsoft. TrueType Open Font Specification. 1995-07.

共引文献44

同被引文献25

引证文献2

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部