期刊文献+

基于VQ/CDHMM的噪声环境下汉语口令识别研究 被引量:2

Chinese Spoken Password Recognition in Noise Based on VQ /HMM
下载PDF
导出
摘要 该文研究了基于改进VQ/HMM模型的语音识别方法,设计实现了基于该模型的汉语口令识别系统;研究了鲁棒性特征参数问题,提出了一些新的基于MFCC和LPCC的高维动态参数;分别进行了纯净语音和不同信噪比语音的识别实验,分析比较了不同类型特征参数、训练状态数和高斯混合度对该系统识别性能的影响。在此基础上得出了以下结论:在加性白噪声的情况下,使用高维动态参数明显提高了系统的鲁棒性;在汉语两字组的短语音(口令)识别中,状态数取4,混合度取3时实验结果较好;利用不同特征参数的优势,进行信息融合,是提高系统性能的一个很好选择。 In this paper an effective Chinese order recognition system is constructed using improved VQ/HMM model.The robustness of feature parameters is also studied here and some new dynastic parameters with high dimension are presented based on MFCC and LPCC.Influence of parameter types,trained states and Gaussian mixture degrees on sys-tem performance is analyzed and compared on the basis of voice recognition experiment in clean and noisy environ-ment.The conclusions of this paper are shown as follows :the robustness of system is obviously improved by the means of dynastic parameters with high dimension in the additive white noisy environment ;performance of Chinese spoken password recognition system is superior when state number is four and Gaussian mixture degree is three;Information fu-sion using different parameters is an effective approach to improve the recognition performance of system.
作者 黄玲 潘孟贤
出处 《计算机工程与应用》 CSCD 北大核心 2003年第28期106-108,161,共4页 Computer Engineering and Applications
关键词 语音识别 连续隐马尔可夫模型 特征参数 矢量量化 Speech Recognition,CDHMM,Feature Parameter,Vector Quantization
  • 相关文献

参考文献5

  • 1张焱,张杰,黄志同.语音识别中隐马尔可夫模型状态数的研究[J].南京理工大学学报,1998,22(3):208-211. 被引量:5
  • 2L Rabiner,B H Juang.Fundamentals of Speech Recognition[M].Prentice Hall Press, 1993 : 112-121,348-349,125-128.
  • 3Michael Kleinschmidt,Jurgen Tchorz et al.Combining Speech Enhancement and Auditory Feature Extraction for Robust Speech Recognition[J].EISEVIER Speech Communication,2001:75-92.
  • 4Charles A Micchelli,Peder Olsen.Penalized maximum-likelihood estimation,the Baum-Welch algorithm,diagonal balancing of symmetric matrices and application to training acoustic data[J].EISEVIER,Journal of Computational and Applied Mathematics, 2000; 119 : 301-331.
  • 5Montri Karnjanadecha,Stephen A Zahorian.Signal Modeling for High-Performance Robust Isolated Word Recognition[J].IEEE TRANSACTION ON SPEECH AND AUDIO PROCESSING,2001;9(6).

二级参考文献1

共引文献4

同被引文献10

  • 1SADAOKI F. Neural network based HMM adaptation for noisy speech[J]. IEEE,2001:365-368.
  • 2BURSHTEIN D. Robust parametric modeling of durations in hidden markov models [A]. Processings of IEEE ICASSP [C].Berlin,1995.
  • 3杨行峻 迟惠生.语音信号数字处理[M].北京:电子工业出版,2000..
  • 4BONAFONTE A, VIDAL J, NOGUEIRAS A. Duration modeling with expanded HMM applied to speech recognition[A].Proceedings of the Fourth International Conference on Spoken Language[C].Philadelphia, 1996.
  • 5RABINER L R. A Tutorialon hidden markov models and selected applications in speech recognition[A]. Proceedings of the IEEE[C].1989.
  • 6CHULHEE L, DONGHOON H, EUISUN C,et al Optimizing feature extraction for speech recognition[J]. IEEE Trans on Speech And Audio Processing, 2003, 11(1):80-87.
  • 7Sadaoki Furui,and Daisuke Itoh.Neural-Network-Based HMM Adaptation for Noisy Speech. . 2001
  • 8L. R. Rabiner.A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of Tricomm . 1989
  • 9Chulhee Lee,Donghoon Hyun,Euisun Choi,Jinwook Go,and Chungyong Lee.Optimizing Feature Extraction for Speech Recognition EM]. IEEE Transactions on Speech and Audio Processing . 2003
  • 10李晶皎,孙杰,张俐,姚天顺.语音识别中HMM与自组织神经网络结合的混合模型[J].东北大学学报(自然科学版),1999,20(2):144-147. 被引量:10

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部