期刊文献+

基于非均匀谱压缩特征的模型补偿新算法

Novel Model Compensation Based on Non-uniform Spectral Compression Features
下载PDF
导出
摘要 在信噪比依赖的非均匀谱压缩(SNSC)鲁棒语音特征提取技术和VTS算法的基础上,该文提出了一种新的MC-SNSC模型补偿算法。SNSC技术是一种根据人类听觉对声音强度-响度感知转化关系的谱幅度变化操作和噪声抑制技术。基于对数谱域的噪声以及SNSC特征提取对语音信号特征所产生的失配函数,推导出了MC-SNSC模型补偿算法。实验证明使用这一新算法,识别率比当前较理想的VTS和PMC算法有很明显的提升,算法的复杂度较VTS等算法仅有轻微的增加。 A novel model compensation method is proposed, which integrates the Vector Taylor Series (VTS) approach with a robust feature extraction technique called SNR-dependent Non-uniform Spectral Compression (SNSC). The SNSC method is a spectral operation of magnitude transformation which resembles the human intensity-to-loudness conversion process and de-emphasizes noisy bands. Based on this mismatch function, which models the effect of the noise onto the clean speech in the Log-spectral domain together with the SNSC, a new model compensation procedure is derived. By adopting this novel model compensation approach, significant improvement over the PMC and VTS method can be found in different additive noisy environments at the expense of slight increase in computational complexity.
出处 《电子与信息学报》 EI CSCD 北大核心 2007年第6期1384-1388,共5页 Journal of Electronics & Information Technology
基金 国家自然科学基金(60101002 60172048)资助课题
关键词 语音识别 模型补偿 非均匀谱压缩 Speech recognition Model compensation Non-uniform spectral compression
  • 相关文献

参考文献7

  • 1罗宇,杜利民.基于单高斯模型集的汉语美子带特征重建算法[J].电子学报,2004,32(10):1654-1657. 被引量:2
  • 2Ding Pei and Cao Z G.An efficient robust ASR system based on the combination of speech enhancement and HMM adaptation.Chinese Journal of Electronics,2002,11(3):422-425.
  • 3Acero A,Deng L,Kristjansson T,and Zhang J.HMM adaptation using vector Taylor series for noise speech recognition.in Proc.ICSLP'2000,Beijing,China,Oct.2000:869-872.
  • 4Hung J W,Shen J L,and Lee L S.New approach for domain transformation and parameter combination for improved accuracy in parallel model combination (PMC) techniques.IEEE Trans.on Speech and Audio Processing,2001,9(8):842-854.
  • 5Gong Y.Speech recognition in noisy environments:Asurvey.Speech Communication,1995,16(3):261-291.
  • 6Chu K K and Leung S H.SNR-dependent non-uniform spectral compression for noisy speech recognition.In Proc.ICASSP'04,Montreal,Canada,May 2004:973-976.
  • 7Abramowitz M and Stegun I A.Handbook of Mathematical Functions with Formulas,Graphs,and Mathematical Tables.New York:Dover Publications Inc.,1972.

二级参考文献8

  • 1Bhiksha Raj,Michael L.Seltzer,Richard M.Stern.Reconstruction of damaged spectrographic features for robust speech recognition[A].International Conference on Spoken Language Processing[C].October,2000,Beijing,China.
  • 2Philippe Renevey,Rolf Vetter,Jens Kraus.Robust speech recognition using missing feature theory and vector quantization[A].Eurospeech 2001[C].Scandinavia,pp1107.
  • 3B Raj.Reconstruction of Incomplete Spectrograms for Robust Speech Recognition[D].Ph.D dissertation,ECE Department,CMU,April,2000.
  • 4Steve Young,Dan Kershaw,Julian Odell,Dave Ollason,Valtcho Valtchev,Phil Woodland.The HTK Book ( for HTK Version 3.0)[M].Microsoft.
  • 5A Vizinho,P Green,M Cooke and L.Josifovski.Missing data theory,spectral subtraction and signal-to-noise estimation for robust ASR:An integrated study[A].Eurospeech'99[C].Budapest,1999.
  • 6Martin Cooke,Phil Green,Ljubomir Josifovski,Ascension Vizinho.Robust ASR with unreliable data and minimal assumptions[A].Robust 99[C].Tamper,Finland.
  • 7Morris A C,Cooke M & Green P.Some solutions to the missing feature problem in data classification,with application to noise robust ASR[A].Proc.ICASSP'98[C].1998.737-740.
  • 8B Raj,M L Seltzer,R M Stern.Robust speech recognition:the case for restoring missing features[A].Workshop on Consistent and Reliable Acoustic Cues for Sound Analysis (CRAC) 2001[C].September,2001,Aalborg, Denmark.

共引文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部