基于非均匀谱压缩特征的模型补偿新算法

Novel Model Compensation Based on Non-uniform Spectral Compression Features

下载PDF

导出

摘要在信噪比依赖的非均匀谱压缩(SNSC)鲁棒语音特征提取技术和VTS算法的基础上,该文提出了一种新的MC-SNSC模型补偿算法。SNSC技术是一种根据人类听觉对声音强度-响度感知转化关系的谱幅度变化操作和噪声抑制技术。基于对数谱域的噪声以及SNSC特征提取对语音信号特征所产生的失配函数,推导出了MC-SNSC模型补偿算法。实验证明使用这一新算法,识别率比当前较理想的VTS和PMC算法有很明显的提升,算法的复杂度较VTS等算法仅有轻微的增加。 A novel model compensation method is proposed, which integrates the Vector Taylor Series （VTS） approach with a robust feature extraction technique called SNR-dependent Non-uniform Spectral Compression （SNSC）. The SNSC method is a spectral operation of magnitude transformation which resembles the human intensity-to-loudness conversion process and de-emphasizes noisy bands. Based on this mismatch function, which models the effect of the noise onto the clean speech in the Log-spectral domain together with the SNSC, a new model compensation procedure is derived. By adopting this novel model compensation approach, significant improvement over the PMC and VTS method can be found in different additive noisy environments at the expense of slight increase in computational complexity.

作者宁更新韦岗孔祥祝

机构地区华南理工大学电子与信息学院

出处《电子与信息学报》 EI CSCD 北大核心 2007年第6期1384-1388,共5页 Journal of Electronics & Information Technology

基金国家自然科学基金(60101002 60172048)资助课题

关键词语音识别模型补偿非均匀谱压缩 Speech recognition Model compensation Non-uniform spectral compression

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献7

1罗宇,杜利民.基于单高斯模型集的汉语美子带特征重建算法[J].电子学报,2004,32(10):1654-1657. 被引量：2
2Ding Pei and Cao Z G.An efficient robust ASR system based on the combination of speech enhancement and HMM adaptation.Chinese Journal of Electronics,2002,11(3):422-425.
3Acero A,Deng L,Kristjansson T,and Zhang J.HMM adaptation using vector Taylor series for noise speech recognition.in Proc.ICSLP'2000,Beijing,China,Oct.2000:869-872.
4Hung J W,Shen J L,and Lee L S.New approach for domain transformation and parameter combination for improved accuracy in parallel model combination (PMC) techniques.IEEE Trans.on Speech and Audio Processing,2001,9(8):842-854.
5Gong Y.Speech recognition in noisy environments:Asurvey.Speech Communication,1995,16(3):261-291.
6Chu K K and Leung S H.SNR-dependent non-uniform spectral compression for noisy speech recognition.In Proc.ICASSP'04,Montreal,Canada,May 2004:973-976.
7Abramowitz M and Stegun I A.Handbook of Mathematical Functions with Formulas,Graphs,and Mathematical Tables.New York:Dover Publications Inc.,1972.

二级参考文献8

1Bhiksha Raj,Michael L.Seltzer,Richard M.Stern.Reconstruction of damaged spectrographic features for robust speech recognition[A].International Conference on Spoken Language Processing[C].October,2000,Beijing,China.
2Philippe Renevey,Rolf Vetter,Jens Kraus.Robust speech recognition using missing feature theory and vector quantization[A].Eurospeech 2001[C].Scandinavia,pp1107.
3B Raj.Reconstruction of Incomplete Spectrograms for Robust Speech Recognition[D].Ph.D dissertation,ECE Department,CMU,April,2000.
4Steve Young,Dan Kershaw,Julian Odell,Dave Ollason,Valtcho Valtchev,Phil Woodland.The HTK Book ( for HTK Version 3.0)[M].Microsoft.
5A Vizinho,P Green,M Cooke and L.Josifovski.Missing data theory,spectral subtraction and signal-to-noise estimation for robust ASR:An integrated study[A].Eurospeech'99[C].Budapest,1999.
6Martin Cooke,Phil Green,Ljubomir Josifovski,Ascension Vizinho.Robust ASR with unreliable data and minimal assumptions[A].Robust 99[C].Tamper,Finland.
7Morris A C,Cooke M & Green P.Some solutions to the missing feature problem in data classification,with application to noise robust ASR[A].Proc.ICASSP'98[C].1998.737-740.
8B Raj,M L Seltzer,R M Stern.Robust speech recognition:the case for restoring missing features[A].Workshop on Consistent and Reliable Acoustic Cues for Sound Analysis (CRAC) 2001[C].September,2001,Aalborg, Denmark.

共引文献1

1宁更新,韦岗.一种用于抗噪语音识别的动态参数补偿新方法[J].电路与系统学报,2008,13(2):14-19.

1孙暐,吴镇扬.多带同步模型用于噪声环境下语音识别[J].中国工程科学,2006,8(3):31-34.
2吕钊,吴小培,张超.鲁棒语音识别技术综述[J].安徽大学学报（自然科学版）,2013,37(5):17-24. 被引量：4
3李强,王正志.基于小波分析的噪声抑制和数据压缩综合技术──SAR图象的噪声抑制与数据压缩[J].计算技术与自动化,1998,17(1):35-38. 被引量：2
4陈俊槟.电子线路的噪声抑制技术分析[J].电子制作,2017,25(6):24-24. 被引量：3
5李强,王正志.基于小波分析的噪声抑制和数据压缩综合技术——SAR图像的噪声抑制与数据压缩[J].系统工程与电子技术,1998,20(12):15-17. 被引量：4
6徐岩.小型音箱的设计与制作(一)[J].实用影音技术,2008(12):65-69.
7何亚宁.德国CANTON音箱的“有源低频失真修正”[J].音响技术,1996(6):5-6.
8张军,韦岗.基于相对自相关序列MFCC特征的模型补偿技术[J].信号处理,2003,19(3):284-286. 被引量：7
9ZHU Hanhao,PIAO Shengchun,ZHANG Haigang,LIU Wei.Waveguide invariant estimation in elastic Pekeris waveguide[J].Chinese Journal of Acoustics,2017,36(1):113-129. 被引量：3
10需要多少声压？[J].视听技术,2004(9):72-72.

电子与信息学报

2007年第6期

浏览历史

内容加载中请稍等...

基于非均匀谱压缩特征的模型补偿新算法

参考文献7

二级参考文献8

共引文献1

相关作者

相关机构

相关主题

浏览历史