基于VQ/CDHMM的噪声环境下汉语口令识别研究被引量：2

Chinese Spoken Password Recognition in Noise Based on VQ /HMM

下载PDF

导出

摘要该文研究了基于改进VQ/HMM模型的语音识别方法,设计实现了基于该模型的汉语口令识别系统;研究了鲁棒性特征参数问题,提出了一些新的基于MFCC和LPCC的高维动态参数;分别进行了纯净语音和不同信噪比语音的识别实验,分析比较了不同类型特征参数、训练状态数和高斯混合度对该系统识别性能的影响。在此基础上得出了以下结论:在加性白噪声的情况下,使用高维动态参数明显提高了系统的鲁棒性;在汉语两字组的短语音(口令)识别中,状态数取4,混合度取3时实验结果较好;利用不同特征参数的优势,进行信息融合,是提高系统性能的一个很好选择。 In this paper an effective Chinese order recognition system is constructed using improved VQ/HMM model.The robustness of feature parameters is also studied here and some new dynastic parameters with high dimension are presented based on MFCC and LPCC.Influence of parameter types,trained states and Gaussian mixture degrees on sys-tem performance is analyzed and compared on the basis of voice recognition experiment in clean and noisy environ-ment.The conclusions of this paper are shown as follows :the robustness of system is obviously improved by the means of dynastic parameters with high dimension in the additive white noisy environment ;performance of Chinese spoken password recognition system is superior when state number is four and Gaussian mixture degree is three;Information fu-sion using different parameters is an effective approach to improve the recognition performance of system.

作者黄玲潘孟贤

机构地区合肥工业大学计算机科学与信息工程学院

出处《计算机工程与应用》 CSCD 北大核心 2003年第28期106-108,161,共4页 Computer Engineering and Applications

关键词语音识别连续隐马尔可夫模型特征参数矢量量化 Speech Recognition,CDHMM,Feature Parameter,Vector Quantization

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献5

1张焱,张杰,黄志同.语音识别中隐马尔可夫模型状态数的研究[J].南京理工大学学报,1998,22(3):208-211. 被引量：5
2L Rabiner,B H Juang.Fundamentals of Speech Recognition[M].Prentice Hall Press, 1993 : 112-121,348-349,125-128.
3Michael Kleinschmidt,Jurgen Tchorz et al.Combining Speech Enhancement and Auditory Feature Extraction for Robust Speech Recognition[J].EISEVIER Speech Communication,2001:75-92.
4Charles A Micchelli,Peder Olsen.Penalized maximum-likelihood estimation,the Baum-Welch algorithm,diagonal balancing of symmetric matrices and application to training acoustic data[J].EISEVIER,Journal of Computational and Applied Mathematics, 2000; 119 : 301-331.
5Montri Karnjanadecha,Stephen A Zahorian.Signal Modeling for High-Performance Robust Isolated Word Recognition[J].IEEE TRANSACTION ON SPEECH AND AUDIO PROCESSING,2001;9(6).

二级参考文献1

1马明,张杰,王建宇,黄志同.语音识别中隐马尔可夫模型初值的估计[J].数据采集与处理,1997,12(2):96-100. 被引量：3

共引文献4

1鄢仁武,蔡金锭.基于离散HMM的电力电子电路故障诊断[J].电工电能新技术,2008,27(4):22-26. 被引量：6
2马伦,康建设,赵强.基于HMM的设备剩余寿命预测框架及其实现[J].计算机仿真,2010,27(5):88-91. 被引量：12
3黄志成.隐马尔可夫模型在学习系统信息挖掘中的应用[J].计算机与现代化,2013(6):13-15. 被引量：1
4谢湘,匡镜明.Novel Extended Phonemic Set for Mandarin Continuous Speech Recognition[J].Journal of Beijing Institute of Technology,2003,12(4):399-402.

同被引文献10

1SADAOKI F. Neural network based HMM adaptation for noisy speech[J]. IEEE,2001:365-368.
2BURSHTEIN D. Robust parametric modeling of durations in hidden markov models [A]. Processings of IEEE ICASSP [C].Berlin,1995.
3杨行峻迟惠生.语音信号数字处理[M].北京:电子工业出版,2000..
4BONAFONTE A, VIDAL J, NOGUEIRAS A. Duration modeling with expanded HMM applied to speech recognition[A].Proceedings of the Fourth International Conference on Spoken Language[C].Philadelphia, 1996.
5RABINER L R. A Tutorialon hidden markov models and selected applications in speech recognition[A]. Proceedings of the IEEE[C].1989.
6CHULHEE L, DONGHOON H, EUISUN C,et al Optimizing feature extraction for speech recognition[J]. IEEE Trans on Speech And Audio Processing, 2003, 11(1):80-87.
7Sadaoki Furui,and Daisuke Itoh.Neural-Network-Based HMM Adaptation for Noisy Speech. . 2001
8L. R. Rabiner.A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of Tricomm . 1989
9Chulhee Lee,Donghoon Hyun,Euisun Choi,Jinwook Go,and Chungyong Lee.Optimizing Feature Extraction for Speech Recognition EM]. IEEE Transactions on Speech and Audio Processing . 2003
10李晶皎,孙杰,张俐,姚天顺.语音识别中HMM与自组织神经网络结合的混合模型[J].东北大学学报（自然科学版）,1999,20(2):144-147. 被引量：10

引证文献2

1陈立伟,赵春晖,姜海丽.基于SCHMM/ANN噪声背景下的语音识别系统设计[J].哈尔滨工程大学学报,2005,26(1):119-122. 被引量：2
2陈立伟,黄湘松.一种基于HMM/ANN的混合语音识别系统的设计[J].弹箭与制导学报,2004,24(S7):223-225.

二级引证文献2

1齐爱学,侯阿临.基于CDHMM/SONN混合模型的带噪语音识别[J].滨州学院学报,2006,22(6):35-38.
2胡岩松,霍春宝,张瑞挢.一种基于改进SCHMM/ANN的语音识别算法[J].黑龙江科技信息,2010(6):77-77.

1吴卅建,李辉,戴蓓倩.基于DSP的口令式语音CDHMM的实时训练系统[J].微电子学与计算机,2005,22(2):128-131. 被引量：4
2王海青,戴蓓倩,李辉,吴卅建.适用于DSP实现的CDHMM口令式语音识别系统[J].计算机工程与应用,2004,40(6):111-114. 被引量：2
3沈杰,王正群,邹军,侯艳平.基于连续隐马尔可夫模型的人脸识别方法[J].计算机工程与设计,2008,29(3):707-709. 被引量：8
4赵喜玲,何勇.基于M-GCHMM步态识别研究[J].湘潭大学自然科学学报,2015,37(1):103-106. 被引量：4
5张良国,高文,陈熙霖,陈益强,王春立.面向中等词汇量的中国手语视觉识别系统[J].计算机研究与发展,2006,43(3):476-482. 被引量：11
6张新彩,张德同,耿国华,王小凤,吴江.基于PCA和CHMM的音频自动分类[J].计算机应用研究,2009,26(4):1257-1259. 被引量：4
7周红艳,田丽,钱兆刚,王勇.基于连续隐Markov模型的理论线损率预测研究[J].南阳理工学院学报,2014,6(6):38-41. 被引量：1
8李锐,陈勇,余磊.基于帧差能量图行质量向量的步态识别算法[J].计算机应用,2014,34(5):1364-1368. 被引量：4
9Yuan Ge,Yaoyiran Li.SCHMM-based Compensation for the Random Delays in Networked Control Systems[J].International Journal of Automation and computing,2016,13(6):643-652.
10张静亚.基于CHMM的高性能连续数字语音识别算法[J].常熟理工学院学报,2005,19(2):93-96. 被引量：4

计算机工程与应用

2003年第28期

浏览历史

内容加载中请稍等...

基于VQ/CDHMM的噪声环境下汉语口令识别研究被引量：2

参考文献5

二级参考文献1

共引文献4

同被引文献10

引证文献2

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

基于VQ/CDHMM的噪声环境下汉语口令识别研究 被引量：2

参考文献5

二级参考文献1

共引文献4

同被引文献10

引证文献2

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

基于VQ/CDHMM的噪声环境下汉语口令识别研究被引量：2