期刊文献+

基于互信息的语言模型回退平滑算法

A back-off smoothing algorithm of language model based on mutual information
下载PDF
导出
摘要 针对二元模型,提出了一种基于互信息的回退(MI Back-off)平滑算法.从互信息的角度,分析词之间的搭配关系,根据模型中每个二元对的互信息对其概率进行不同程度的折扣,并利用低阶模型对零概率事件进行补偿,通过极小化困惑度的原则体现新算法的合理性.在不同类别测试集下,该平滑算法与传统Katz平滑算法相比,模型困惑度下降均超过20%. A back-off smoothing algorithm based on mutual information for bigram model was presented. This algorithm not only analyzes the coupling relations between words from the perspective of mutual information, discounts the probabilities differently according to the mutual information of each bigram in the model, but also takes advantage of the low-order model to compensate for zero-probability case. Based on a very small degree of confusion prin- ciple, this algorithm was proved to be reasonable. For unseen events, the probabilities were back off to low-order model. Furthermore, the model parameters were estimated by minimizing the perplexity. In testing corpus of different domains, all the perplexities of the proposed smoothing algorithm decline more than 20% compared with the traditional Katz algorithm.
出处 《应用科技》 CAS 2009年第4期28-31,35,共5页 Applied Science and Technology
关键词 中文信息处理 统计语言模型 平滑算法 互信息 困惑度 information processing of Chinese characters statistical language model smoothing algorithm mutual information perplexity
  • 相关文献

参考文献10

  • 1LIDSTONE G J. Note on the general case of the Bayes-Laplace formula for inductive or a posteriori probabilities [ J ]. Transactions of the Faculty of Actuaries, 1920, 8 : 182-192.
  • 2KATZ S M. Estimation of probabilities from sparse data for the language model component of a speech recognizer [ J ]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1987, 35(3) : 400-401.
  • 3GOODMAN J, CHEN S F. An empirical study of smoothing techniques for language modeling [ J ]. Computer Speech and Language, 1999, 13(4):359-393.
  • 4CHURCH K W, GALE W A. A comparison of the enhanced good-turing and deleted estimation methods for estimating probabilities of English bigrams [ J ]. Computer Speech and Language, 1991,5(1):19-54.
  • 5JELINEK F, MERCER R L. Interpolated estimation of Markov source parameters from sparse data[ C]// Proceedings of the Workshop on Pattern Recognition in Practice. Amsterdam, 1980: 381-397.
  • 6徐望,王炳锡.N-gram语言模型中的插值平滑技术研究[J].信息工程大学学报,2002,3(4):13-15. 被引量:13
  • 7江铭虎,朱小燕,袁保宗.一种适应域的汉语N-gram语言模型平滑算法[J].清华大学学报(自然科学版),1999,39(9):99-102. 被引量:9
  • 8杨琳,张建平,颜永红.特定领域的汉语语言模型平滑算法比较研究[J].计算机工程与应用,2006,42(32):14-16. 被引量:5
  • 9黄永文,何中市.基于互信息的统计语言模型平滑技术[J].中文信息学报,2005,19(4):46-51. 被引量:8
  • 10傅祖芸.信息论基础理论与应用[M].北京:电子工业出版社,2005.

二级参考文献12

  • 1周强.基于语料库和面向统计学的自然语言处理技术[J].计算机科学,1995,22(4):36-40. 被引量:25
  • 2Zhou M,IEICE Trans Inf Syst,1996年,E79卷,4期,333页
  • 3Schenand J Goodman. An empirical study of smoothing techniques for language modeling[J]. Proceedings of the 34th meeting of the ACL, 1996:310-318.
  • 4Slava M. Katz. Estimation of probabilities from sparse data for the language model component of a speech recognizer[J]. IEEE transactions on acoustics, speech, and signal processing, March 1987, vol. ASSP-35(3):400-401.
  • 5ROSENFELD R.Two decades of statistical language modeling:where do we go from here?[C]//Proceedings of the IEEE,2000,88:1270~1278.
  • 6Rosenfeld R.A maximum entropy approach to adaptive statistical language modeling[J].Computer Speech and Language,1996,10:187~228.
  • 7CHEN S F,GOODMAN J.An empirical study of smoothing techniques for language modeling[J].Computer speech and language,1999,13:359~394.
  • 8Kneser R,Ney H.Improved backing-off for m-gram language modeling[C]//Proceedings of the IEEE International Conference on Acoustics:Speech and Signal Processing,1995:181~184.
  • 9CHEN S,ROSENFELD R.A survey of smoothing techniques for ME models[J].IEEE Trans Speech and Audio Processing,2000,8:37~50.
  • 10徐志明,王晓龙,关毅.N-gram语言模型的数据平滑技术[J].计算机应用研究,1999,16(7):37-39. 被引量:10

共引文献28

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部