期刊文献+

基于贝斯准则和待定词集模糊矩阵的满文识别后处理 被引量:1

Manchu Character Recognition Post-Processing Based on Bayes Rules and Substitution Set Confusion Matrix
下载PDF
导出
摘要 将满文单词识别系统的识别信息和满文的词组信息有机地结合起来,建立满文词组和待定词集统计信息库,利用贝叶斯准则,综合满文待定词的后验概率和词组的先验概率信息,建立合理有效便于实现的数据结构,对满文单词识别系统输出存在的拒识词和错识词进行检测和纠正,从而有效地提高满文识别系统的识别率·实验表明:后处理性能除取决于语言模型外,还取决于后概率的精确估计·另外,在单词识别系统识别率高的情况下,后处理的纠错能力会增强· After combining of organically the recognition information on single Manchu characters from relevant system with the information on phrases to set up a statistical information database of Manchu phrases and underdetermined word sets, Bayes rules are used to synthesize the prior probability of underdetermined Manchu word sets and posterior probability of phrases. A data construction is thus developed to improve efficiently the recognition rate, which is rational and easy to implement especially available to detect and correct those rejected and incorrectly recognized words output from the SCR single character recognition system. Experiment shows that the post-processing performance depends on not only the language model but the accurate estimate of posterior probability. In addition, the higher the recognition rate of SCR, the stronger the rectifiability of post-processing.
作者 李晶皎 赵骥
出处 《东北大学学报(自然科学版)》 EI CAS CSCD 北大核心 2004年第11期1061-1064,共4页 Journal of Northeastern University(Natural Science)
基金 辽宁省自然科学基金资助项目(2001113)
关键词 满文 后处理 待定词集 模糊矩阵 贝叶斯准则 特征矢量 词组库 Manchu post-processing underdetermined word set confusion matrix Bayes rules features vector phrase database
  • 相关文献

参考文献6

  • 1[3]Chang C H. Word class discovery for postprocessing Chinese handwriting recognition[A]. Proceedings of COLING[C]. Choankia, 1994.1221-1225.
  • 2[5]Lin X F, Ding X Q, Chen M, et al. Adaptive confidence transform based classifier combination for Chinese character recognition[J]. Pattern Recogn Lett, 1998,19(10):975-988.
  • 3[7]Gu H Y, Tseng C Y, Lee L S. Markov modeling of mandarin Chinese for decoding the phonetics sequence into Chinese characters[J]. Comput Speech Lang, 1991,5(4):363-377.
  • 4[8]Lee H J, Tung C H, Chang C H. A Markov model in handwritten Chinese text recognition[A]. Proceedings of ICDAR[C]. Durham, 1993.72-75.
  • 5[9]Chang J S, Chen S D. The postprocessing of optical character recognition based on statistical noisy channel and language model[A]. Proceedings of PACLIC[C]. Beijing, 1995.127-131.
  • 6[10]Wong P K, Chan C. Post-processing statistical language models for a handwritten Chinese character recognizer[J]. IEEE Trans Syst Man Cybern, 1999,29(2):286-291.

同被引文献10

引证文献1

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部