期刊文献+

基于后验概率解码段模型的汉语语音数字串识别 被引量:12

Mandarin Digit String Recognition Based on Segment Model Using Posterior Probability Decoding
下载PDF
导出
摘要 通过对语音解码的分析指出了基于似然概率解码的连续语音识别的局限性,并给出了三种基于后验概率段模型(Segment Model,SM)的语音解码方法.这三种方法成功地运用于随机段模型(Stochastic Segment Model,SSM),使误识率比基线系统下降了11%;与此同时还给出了段模型的快速算法,使算法的计算复杂度降到了与隐马尔可夫模型(Hidden Markov Model,HMM)相同的数量级,满足了实用要求. The decoding algorithms of most continuous speech recognition systems are based on the likelihood score now. However, the likelihood score is only an approximate of the posterior probability and will lead to a suboptimal solution in continuous speech recognition task. In this paper, three Segment Model(SM) decoding methods based on posterior probability are introduced and successfully implemented on a Stochastic Segment Model(SSM) based system. SSM is one kind of segment models. The new decoding methods achieve 11% error rate reduction compared with the baseline system. In the meantime, a fast algorithm for SM is also proposed, which can reduce the computation complexity of the above algorithms to the same level as that of HMM and meet the requirement of real-time applications.
出处 《计算机学报》 EI CSCD 北大核心 2006年第4期635-641,共7页 Chinese Journal of Computers
基金 国家自然科学基金(60172055 60121302) 北京市自然科学基金(4042025) 国家"九七三"重点基础研究发展规划项目基金(2004CB318105)资助
关键词 后验概率 段模型 汉语数字串 语音识别 模式识别 posterior probability segment model mandarin digit string speech recognition pattern recognition
  • 相关文献

参考文献17

  • 1Huang X.D,Acero A,Hon H.W..Spoken Language Processing:A Guide to Theory,Algorithm and System Development.New Jersey:Prentice Hall,2001
  • 2Juang B,Furi S..Automatic recognition and understanding of spoken language-A first step toward natural human-machine communication.Proceedings of the IEEE,2000,88(8):1142~1165
  • 3Rabiner L,Juang B.H..Fundamentals of Speech Recognition.New Jersey:Prentice Hail,1993
  • 4Ostendorf M,Digalakis V.V,Kimball O.A..From HMM's to segment models:A unified view of stochastic modeling for speech recognition.IEEE Transactions on Speech Audio Processing,1996,4(5):360~378
  • 5Gong Y..Stochastic trajectory modeling and sentence searching for continuous speech recognition.IEEE Transactions on Speech Audio Processing,1997,5(1):33~44
  • 6Dugakakis V.V,Ostendorf M,Rohlicek J.R..Fast algorithms for phone classification and recognition using segment-based models.IEEE Transactions Speech Audio Processing,1992,40(12):2885~2896
  • 7Lee C,Glass R..Real-time probabilistic segmentation for segment-based speech recognition.In:Proceedings of the International Conference on Spoken Language Processing,Sydney,Australia,1998,1803~1806
  • 8Ostendorf M,Roukos S..A stochastic segment model for phoneme based continuous speech recognition.IEEE Transactions on Acoustics,Speech and Signal Processing,1989,37(12):1857~ 1869
  • 9Gish H,Ng K,Rohlicek R..Secondary processing using speech segments for an HMM word spotting system.In:Proceedings of the International Conference on Spoken Language Processing,Alberta,Canada,1992,1:17~20
  • 10Rueber B..Obtaining confidence measures from sentence probabilities.In:Proceedings of the 5th European Conference on Speech Communication and Technology,Rhodes,Greece,2001,739~742

二级参考文献3

共引文献8

同被引文献144

引证文献12

二级引证文献34

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部