期刊文献+

噪声环境下畸变模型线性化处理的顽健语音识别方法

Linearized distortion model for robust speech recognition in noisy environments
下载PDF
导出
摘要 针对噪声环境下语音识别的顽健性问题,考虑到梅尔倒谱系数(MFCC,Mel-frequency cepstral coefficient)域的畸变模型高度非线性且难以处理,用分段线性插值函数代替对数函数,提出了一种新的线性畸变模型。在此基础上,导出了噪声参数估计和声学模型补偿方法,无需采用矢量泰勒级数(VTS,vector Taylor series)展开作近似处理,有效避免了模型误差的引入,增强了系统在噪声环境下的顽健性。 The robustness of speech recognition system in noisy environments was investigated.The distortion model in Mel-frequency cepstral coefficient(MFCC) domain is highly non-linear and difficult to deal with.A new linear distortion model was proposed by replacing the logarithm operation with its piecewise linear interpolation function.Then the esti-mation of noise parameters and compensation of acoustic models were provided.The proposed method can avoid model error introduced by utilizing linearization methods based on vector Taylor series(VTS) expansion,and significantly im-prove the robustness of recognizer in noisy environments.
出处 《通信学报》 EI CSCD 北大核心 2010年第9期8-14,共7页 Journal on Communications
基金 国家高技术研究发展计划("863"计划)基金资助项目(2006AA010103) 国家重点基础研究发展计划("973"计划)基金资助项目(2007CB311100)~~
关键词 语音识别 顽健性 畸变模型 线性化 speech recognition robustness distortion model linearization
  • 相关文献

参考文献14

  • 1YUSUKE S,MASANMI A.Bayesian feature enhancement using mixture of unscented transformations for uncertainty decoding of noisy speech[A].Proceedings of ICASSP[C].Taiwan,China,2009.4569-4572.
  • 2ACERO A,DENG L,KRISTJANSSON T,et al.HMM adaptation using vector Taylor series for noisy speech recognition[A].Proceed-ings of ICSLP[C].Beijing,China,2000.869-872.
  • 3GONG Y F.A method of joint compensation of additive and convolu-tive distortions for speaker-independent speech recognition[J].IEEE Transaction on Speech Audio Processing,2005,13(5):975-983.
  • 4LI J Y,DENG L,YU D.A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions[J].Computer Speech and Language,2009,23(3):389-405.
  • 5VAN D,GALES M.Extended VTS for noise-robust speech recogni-tion[A].Proceedings of ICASSP[C].Taiwan,China,2009.3829-3832.
  • 6GALES M,FLEGO F.Combining VTS model compensation and support vector machines[A].Proceedings of ICASSP[C].Taiwan,China,2009.3821-3824.
  • 7LIAO H,GALES M.Joint Uncertainty Decoding for Robust Large Vocabulary Speech Recognition[R].Technical Report CUED/TR552.University of Cambridge,2006.
  • 8KING-ASR-009.A Chinese speech database for speech recogni-tion[EB/OL].http://www.speechocean.com/productdetail.asp?id=King-ASR-009,2010.
  • 9STEVEN F B,DENNIS C P.Feature and score normalization for speaker verification of cellular data[A].Proceedings of ICASSP[C].Hong Kong,China,2003.49-52.
  • 10HERMANSKY H,MORGAN N,BAYYA A.RASTA-PLP speech analysis technique[A].Proceedings of ICASSP[C].San Francisco,USA,1992.1121-1124.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部