多种前端滤波器的ZCPA对语音多变性的鲁棒性研究

Different Front-end Filter Banks Used for ZCPA in Variability Robustness Research

下载PDF

导出

摘要针对语音多变性的鲁棒性问题,分别将FIR滤波器、Gammatone(GT)滤波器、Laguerre滤波器以及弯折滤波器(Warped Filter Banks,WFBs)用于过零峰值幅度(Zero Crossing Peak Amplitude,ZCPA)特征提取,并使用支持向量机(Support Vector Machine,SVM)作为后端识别系统,通过实验得到了不同滤波器下ZCPA的识别结果。结果表明在多变性语音的识别中,SVM系统较常用的HMM系统,更适于ZCPA特征;并且在SVM系统下,ERB尺度的弯折滤波器较其它前端滤波器识别效果更好,明显优于常用的MFCC特征。 In order to solve the variability robustness in speech recognition systems,different front-end filter banks such as FIR filter bank,Gammatone（GT） filter bank,Laguerre filter bank and Warped Filter Banks（WFBs） were used to extract Zero Crossing Peak Amplitude（ZCPA） feature.They were all based on the Support Vector Machine（SVM） system.The experiments show that the SVM was much more suitable for ZCPA than HMM in variability recognition tasks.Moreover,in the SVM system,the ERB-scale WFBs had the best recognition results compared with the other front-end filter banks.It outperformed significantly than MFCC.

作者黄丽霞张雪英刘雪艳

机构地区太原理工大学信息工程学院

出处《太原理工大学学报》 CAS 北大核心 2011年第3期215-218,223,共5页 Journal of Taiyuan University of Technology

基金国家自然科学基金项目(61072087) 山西省研究生立项优秀创新项目(20093048)

关键词 FIR滤波器 Gammatone(GT)滤波器 Laguerre滤波器弯折滤波器(WFBs) 过零峰值幅度(ZCPA) 支持向量机(SVM) FIR filter bank Gammatone（GT） filter bank Laguerre filter bank Warped Filter Banks（WFBs） Zero Crossing Peak Amplitude（ZCPA）

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献12

1Lockwood P, Boudy J. Experiments with a Nonlinear Spectral Subtractor (NHS), Hidden Markov Models and the Projec- tions, for Robust Speech Recognition in Cars[J].Speech Communication, 1993, 11(2):215-228.
2P J Moreno. Speech Recognition in Noisy Environments[D]. Pennsy lvania:Carnegie Mellon University, 1996.
3姚文冰,姚天任,韩涛.稳健语音识别技术发展现状及展望[J].信号处理,2001,17(6):484-484. 被引量：14
4Lixia Huang, Xueying Zhang, Gianpaolo Evangelista. Speaker Independent Recognition on OI.LO French Corpus by Using Different Features[C]// The First International Conference on Pervasive Computing, Signal Processing and Applications Harbin Chiro, Harbin, China, 2010 : 332 335.
5Wesker T, Meyer B, Wagener K, et al. Oldenburg Logatome Speech Corpus (OLLO) for Speech Recognition Experiments with Humans and Machines[C]//European Conference on Speech Cormmunication and Technology, Lisbon, Portugal, 2005 1273-1276.
6Kim D S, Lee S Y, Kil R M. Auditory Processing of Speech Signal for Robust Speech Recognition in Real-World Noisy Environments[J]. IEEETransSpeechand AudioProc, 1999, 7(2) 55- 69.
7Johannesma P I M. The Pre-response Stimulus Ensemble of Neurons in the Cochlear Nucleus[C]//Proc Symposium on Hearing Theory, Eindhoven, Netherlands, 1972 : 58-69.
8Lixia Huang, Xueying Zhang, Xueyan Liu. Different Channels in Gammatone Filter Bank Based on ZCPA for Speaker-lnde- pendent Recognition Task[C] // International Asia Conference on Optical Instrument and Measurement, Shenzhen, China, 2010.
9黄丽霞,张雪英.Laguerre滤波器在抗噪语音识别特征提取中的应用[J].计算机工程与应用,2008,44(18):21-24. 被引量：1
10Cusack R, Carlyon RP. Perceptual Asymmetries in Audition[J]. J Exp Psych Human Percept Perform, 2003, 29(3) 713- 725.

二级参考文献14

1黄高勇,张家树.一种抑制直扩通信窄带干扰的新型非线性自适应预测滤波器[J].电子与信息学报,2007,29(6):1328-1331. 被引量：17
2Kim D S,Lee S-Y,Kil R M.Auditory processing of speech signals for robust speech recognition in real-world noisy environments[J]. IEEE Transaction on Speech and Speech Audio Processing,1999, 7( 1 ) :55-69.
3Masnadi-Shirazi M,Aleshams M.Laguerre discrete-time filter dcsign[J].Computers and Electrical Engineering, 2003,29 : 173-192.
4Silva Toe.On the determination of the optimal pole position of Laguerre filters[J].IEEE Signal Process, 1995,4(9 ) : 2079-2087.
5Oded Ghitza.Auditory models and human performance in tasks related to speech coding and speech reeognition[J].IEEE Transactions on Speeeh and Audio Processing, 1994,2( 1 ) : 113-131.
6Muller K R, Mika S, Rtsch G, et al. An Introduction to Kernel-based Learning Algorithms[J]. IEEE Transactions on Neural Networks, 2001, 12(2):181-201.
7Vapnik V. The Nature of Statistical Learning Theory[M]. New York: Springer-Verlag, 1995.
8Chapelle,V Vapnik,et al. Choosing Multiple Parameters for Support Vector Machines[J]. Machine Learning, 2002:46:131-159.
9Xueying Zhang, Jing Bai, Wuzhou Liang. The Speech Recognition System Based On Bark Wavelet MFCC[C]. 8th International Conference on Signal Processing, 2006:16-20.
10Debnath R, Takahashi H. A Decision Based On One-Against One Method for Multi-Class Support Vector Machine[J]. Pattern Anal Applie, 2004,7:164-175.

共引文献19

1赵贤宇,王作英.用于语音识别的鲁棒自适应麦克风阵列算法[J].清华大学学报（自然科学版）,2004,44(10):1433-1436. 被引量：5
2邱洪,吴淑珍.噪声补偿应用于与文本无关的说话人辨认研究[J].北京大学学报（自然科学版）,2005,41(1):115-121.
3王新民,雷丽,徐智辉.基于线性预测的自适应语音增强技术[J].孝感学院学报,2005,25(3):31-33. 被引量：1
4赵贤宇,欧智坚,王作英.基于VTS的稳健语音识别[J].清华大学学报（自然科学版）,2005,45(7):892-895.
5马治飞,王炳锡.一种基于概率模型的特征补偿算法[J].微计算机信息,2005,21(11S):100-101.
6熊燕.抗噪声语音识别技术研究[J].中国科技信息,2006(7):204-205. 被引量：5
7蔡妍,陈苗苗.语音识别和语音合成在航管雷达模拟系统中的应用[J].中国民航飞行学院学报,2007,18(3):53-56. 被引量：3
8杨旭方,李慧.基于凌阳单片机实现的办公电器语音控制系统[J].科教文汇,2008(8):197-197.
9王军,种兰祥.麦克风阵列声源定位与跟踪性能改进[J].计算机工程与应用,2008,44(19):235-237.
10邱作春.麦克风阵列语音增强用于抗噪说话人识别[J].大众科技,2008,10(12):35-37.

1黄丽霞,张雪英.Laguerre滤波器在抗噪语音识别特征提取中的应用[J].计算机工程与应用,2008,44(18):21-24. 被引量：1
2刘成城,刘亚奇,赵拥军,杨静.基于Laguerre滤波器等价设计的IIR宽带波束形成[J].电子学报,2015,43(2):399-404. 被引量：1
3赵姝彦,张雪英,焦志平.基于ZCPA和DHMM的孤立词语音识别系统[J].太原理工大学学报,2005,36(3):246-249. 被引量：4
4张晓辉,李辉.基于ZCPA特征参数的口令识别系统[J].电子技术（上海）,2010(7):27-29.
5贺双赤.用Laguerre滤波器实现多径衰落信道自适应均衡[J].电讯技术,2004,44(1):82-86. 被引量：3
6朱海涛.基于神经网络的语音识别鲁棒性研究[J].中国科技信息,2008(5):276-277. 被引量：1
7傅洪亮,酆广增.一种基于天线阵列预处理盲多用户检测算法及其对DOA估计误差的鲁棒性研究[J].电子学报,2006,34(10):1884-1887.
8袁小刚,黄国策,刘剑,郭兴阳.用Laguerre滤波器实现自适应跳频同址干扰抵消[J].计算机科学,2009,36(11):93-96. 被引量：3
9孙颖,张雪英.情感语音特征对语料库依赖性的统计分析[J].噪声与振动控制,2011,31(4):132-136. 被引量：3
10赵海全,张家树.非线性通信信道的神经FIR自适应幅值Laguerre均衡器[J].中国科学（F辑:信息科学）,2009,39(10):1095-1103.

太原理工大学学报

2011年第3期

浏览历史

内容加载中请稍等...

多种前端滤波器的ZCPA对语音多变性的鲁棒性研究

参考文献12

二级参考文献14

共引文献19

相关作者

相关机构

相关主题

浏览历史