一种基于噪声对消与倒谱均值相减的鲁棒语音识别方法被引量：3

A robust speech recognition method by combining noise cancelling and cepstral mean subtraction

下载PDF

导出

摘要提出一种基于语音增强算法的噪声鲁棒语音识别方法.在语音识别预处理阶段,通过噪声对消语音增强法来抑制噪声提高信噪比.然后对增强语音提取Mel频段倒谱特征参数,并在倒谱域应用倒谱均值相减处理来补偿增强语音中的失真成分和剩余噪声.实验结果表明,在低信噪比(-12～0 dB)条件下,该方法对于数字语音识别具有较好的识别率,其性能明显优于基本的Mel频段倒谱参数识别器、传统的谱减法和噪声对消语音增强法. A noise resistant speech recognition method based on a speech enhancement algorithm was implemented. First, it obtains the denoised speech, with significant SNR （signal-to-noise ratio） improvement, by applying adaptive noise cancelling （ANC） to the pre-treatment stage of speech recognition. Then Mel-frequency cepstral coefficients（MFCC） are computed from the enhanced speech. Then cepstral mean subtraction （CMS） is used to compensate for components of distortion and the residual noise of the enhanced speech in the cepstral domain. When speech samples have a low SNR, ranging from 0 to 12 dB, experimental results indicate that the proposed method performs better than a standard MFCC recognizer, conventional spectral subtraction （SS） and the ANC speech enhancement for digital speech recognition.

作者王振力裴凌波于元斌

机构地区南京国际关系学院博士后流动站工程兵指挥学院训练部

出处《智能系统学报》 2008年第6期552-556,共5页 CAAI Transactions on Intelligent Systems

基金江苏省博士后科研基金资助项目(0701008C) 中国博士后科学基金资助项目(20070420561)

关键词自适应噪声对消语音增强谱减法噪声鲁棒语音识别倒谱均值相减法 adaptive noise cancelling speech enhancement spectral subtraction noise robust speech recognition cepstral mean subtraction

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献7

1[1]STEVEN F B.Suppression of acoustic noise in speech using spectral subtraction[J].IEEE Trans on Speech and Audio Processing,1979,27(2):113-120.
2[5]MAMMONE R J,ZHANG Xiaoyu,RAMACHANDRAN R P.Robust speaker recognition:a feature-based approach[J].IEEE Signal Processing Magazine,1996,13(5):58.
3[6]DAVIS S B,MERMELSTEIN P.Comparison of parametricrepresentations of monosyllabic word recognition in continuously spoken sentences[J].IEEE Trans on Speech and Audio Processing,1980,28(4):357-366.
4[7]HERMANSKY H,MORGAN N.RASTA processing of spe-ech[J].IEEE Trans on Speech and Audio Processing,1994,2(4):578-589.
5[8]HERMANSKY H.Perceptual linear predictive (PLP) analysis of speech[J].J Acoust Soc Am,1990,87(4):1738-1752.
6[9]LIU F H,ACERO A,STERN R.Efficient joint compensation of speech for the effects of additive noise and linear filtering[C]//IEEE International Conference on Acoustics,Speech,and Signal Processing.San Francisco,USA,1992(1):257-260.
7[11]VIILDU O,BYE D,IAURILA K.A recursive feature vector normalization approach for robust speech recognition in noise[C]//Proceedings'ICASSP'98.Seattle,WA,USA:IEEE Acoustics,Speech and Signal Processing Society,1998:733-736.

同被引文献23

1任治刚.基于MATLAB/Simulink的回声和噪声控制算法开发[J].系统仿真学报,2005,17(z2):77-78. 被引量：1
2汤玲,戴斌.抗噪声语音识别及语音增强算法的应用[J].计算机仿真,2006,23(9):80-82. 被引量：5
3杨三胜,刘海峰,付君,张国强.AMBE-2020在语音通信系统中的应用[J].舰船电子工程,2006,26(5):146-149. 被引量：3
41 Shajith, et al. Phase auto correlation(PAC) features noise robust speech rcc'ognition [ J . Speech Communication, 2012,54 ( 7 ) : 867 - 880.
5T A|exandrns, et al. Automatic speech recognition performance in different room acoustic environments with and without dereverbera- tion preprocessing[ J]. Computer Speech anti Language, 2013,27 (1) :380 -395.
6H lwano, K Shinoda. Spectral subtraction based on non - extensive statistics |or speech recognition Pardede [ J ]. IEICE Transactions on Information and Systems, 2013, E96 - D( 8 ) : 1774 - 1782.
7V P Jesus, D M Femando, K W Bastiaan. The synergy between bounded - distance HMM and spectral subtraction for robust speech recognition [ J ]. Speech Communication, 2010,52 ( 2 ) : 123 - 133.
8J W Hung, W H Tu, C C l,ai. Improved modulation spectnJm en- hancement methods for rohust speech recognition [ J ]. Signal Pro- cessing, 2012,92(1/) :2791 -2814.
9E Ozerov, B F Vincent. A general modular framework tbr audio source separation : Saint - Main,2010 [ C ]. France, in 9th Interna- tional Conference on Latent Variable Analysis and Signal Separation ( I,VA/ICA ' 10), 2010:33 -40.
10Yu Shao, Chip - Hong Chang. Bayesian Separation With Sparsity Promotion in Perceptual Wavelet Donmin fur Speech Enhancement and Hybrid Speec, h Recognition[ J ]. Systems, Man anti Cybernet- ics, Part A: Systems and Humans, IEEE Transactions on, 2011, 41 (2) :284 -293.

引证文献3

1张常辉,田丰,刘松.基于倒谱的炉膛声波飞行时间测量[J].沈阳航空工业学院学报,2009,26(5):35-37.
2周旺,姜弢.基于TDM的多通道声卡设计[J].应用科技,2010,37(10):31-35.
3张毅,黎小松,罗元,吴承军.基于人耳听觉特性的语音识别预处理研究[J].计算机仿真,2015,32(12):322-326. 被引量：10

二级引证文献10

1邓子龄.外界环境下语音信号快速捕获仿真研究[J].计算机仿真,2017,34(1):296-299.
2张瑞.英语语音合理性优化识别建模仿真研究[J].计算机仿真,2017,34(2):289-292. 被引量：20
3乔玲玲,郭秀婷.人体语音特征提取身份优化验证仿真研究[J].计算机仿真,2017,34(2):342-345. 被引量：3
4陈蕾,赵霞,贾嫣,魏霖静.关于人的语音声调准确识别仿真[J].计算机仿真,2017,34(3):161-164. 被引量：2
5高俊杰.通信网络中不良信息优化识别提取仿真[J].计算机仿真,2017,34(3):285-288. 被引量：1
6徐必伟,苏成利,杨微,曹江涛.基于DTW和EMD的孤立词语音识别研究[J].辽宁石油化工大学学报,2018,38(1):74-78. 被引量：2
7王艳芬.一种用于无线通信的数字语音识别系统设计[J].现代电子技术,2016,39(16):151-154. 被引量：3
8赵从健,雷菊阳,李明明.基于无监督学习的语音签到系统[J].软件,2019,40(12):183-187. 被引量：2
9刘晓晨,潘孝勤,曹金璇,芦天亮.声纹识别和语音识别技术在公安领域的应用[J].网络安全技术与应用,2021(4):153-155. 被引量：17
10石庆升,陈家良,董哲.基于听觉显著性特征的发电机组主轴承性能评估[J].科学技术与工程,2024,24(1):205-214.

1王振力,白志强,朱江.基于FSS与PLP的噪声鲁棒语音识别[J].南京邮电大学学报（自然科学版）,2008,28(4):12-15. 被引量：4
2柳春笙,王志华,窦维蓓,凌育进.一种低频有色噪声的自适应对消模型[J].电声技术,2000,24(5):30-34. 被引量：2
3陈立伟,谭志良,崔立东.改进的LMS算法在噪声对消中的应用[J].无线电工程,2015,45(6):70-73. 被引量：5
4罗斌凤,方惠均.一种适合于噪声对消的自适应算法及其硬件实现[J].桂林电子工业学院学报,1991,11(1):23-31.
5吴正茂.自适应滤波器及其应用研究[J].南昌水专学报,2004,23(2):36-38. 被引量：15
6徐玮,孙象.语音通信中的自适应噪声对消系统设计[J].现代电子技术,2007,30(11):39-41. 被引量：6
7肖瑛,董玉华.非高斯噪声背景下的自适应信号提取方法研究[J].大连民族学院学报,2008,10(1):38-40. 被引量：3
8颜文旭,孔锐.变步长LMS算法用于谐波电压检测的研究[J].电力电子技术,2015,49(11):44-46. 被引量：2
9刘少亭.抗交串自适应噪声对消及其快速算法的研究[J].现代通信技术,1993(4):7-14. 被引量：1
10王燕妮,李国民.一种改进的LMS算法在噪声对消中的应用[J].现代电子技术,2004,27(14):50-51. 被引量：1

智能系统学报

2008年第6期

浏览历史

内容加载中请稍等...

一种基于噪声对消与倒谱均值相减的鲁棒语音识别方法被引量：3

参考文献7

同被引文献23

引证文献3

二级引证文献10

相关作者

相关机构

相关主题

浏览历史

一种基于噪声对消与倒谱均值相减的鲁棒语音识别方法 被引量：3

参考文献7

同被引文献23

引证文献3

二级引证文献10

相关作者

相关机构

相关主题

浏览历史

一种基于噪声对消与倒谱均值相减的鲁棒语音识别方法被引量：3