期刊文献+

Maximum likelihood polynomial regression for robust speech recognition

Maximum likelihood polynomial regression for robust speech recognition
原文传递
导出
摘要 The linear hypothesis is the main disadvantage of maximum likelihood linear re- gression (MLLR). This paper applies the polynomial regression method to model adaptation and establishes a nonlinear model adaptation algorithm using maximum likelihood polynomial regression (MLPR) for robust speech recognition. In this algorithm, the nonlinear relationship between training and testing Gaussian means in every Mel channel is approximated by a set of polynomials and the polynomial coefficients are estimated from adaptation data in test envi- ronment using the expectation- maximization (EM) algorithm and maximum likelihood (ML) criterion. The experimental results show that the second-order polynomial can approximate the actual nonlinear function better and in noise compensation and speaker adaptation, the word error rates of MLPR are significantly lower than those of MLLR. The proposed MLPR algorithm overcomes the limitation of linear hypothesis well and can decrease the impact of noise, speaker and other factors simultaneously. It is especially suitable for joint adaptation of speaker and noise. The linear hypothesis is the main disadvantage of maximum likelihood linear re- gression (MLLR). This paper applies the polynomial regression method to model adaptation and establishes a nonlinear model adaptation algorithm using maximum likelihood polynomial regression (MLPR) for robust speech recognition. In this algorithm, the nonlinear relationship between training and testing Gaussian means in every Mel channel is approximated by a set of polynomials and the polynomial coefficients are estimated from adaptation data in test envi- ronment using the expectation- maximization (EM) algorithm and maximum likelihood (ML) criterion. The experimental results show that the second-order polynomial can approximate the actual nonlinear function better and in noise compensation and speaker adaptation, the word error rates of MLPR are significantly lower than those of MLLR. The proposed MLPR algorithm overcomes the limitation of linear hypothesis well and can decrease the impact of noise, speaker and other factors simultaneously. It is especially suitable for joint adaptation of speaker and noise.
出处 《Chinese Journal of Acoustics》 2011年第3期358-370,共13页 声学学报(英文版)
基金 supported by the 973 Program of China(2002CB312102) the National Natural Science Foundation of China(60672094)
  • 相关文献

参考文献4

二级参考文献36

  • 1国立新,莫福源,李昌立.基于连续高斯混合密度HMM的汉语全音节语音识别研究[J].声学学报,1995,20(5):321-329. 被引量:11
  • 2孙暐,吴镇扬,刘海滨,周琳.并行子带HMM最大后验概率自适应非线性类估计算法[J].电路与系统学报,2005,10(6):20-24. 被引量:1
  • 3Ivandro Sanches. Noise-compensated hidden Markov models. IEEE Trans on Speech and Audio Processing, 2000;8(5): 533-540.
  • 4Hwang T H, Lee L M, Wang H C. Cepstral behavior due to additive noise and a compensation scheme for noisy speech recognition. IEEProc of Vis Image Signal Process, 1998;145(5): 316-321.
  • 5Mansour D, Juang B H. The short-time modified coherence representation and its application for noisy speech recognition. IEEE Trans Acoust , Speech, Signal Processing,1980; 28(4): 357-366.
  • 6Javier Hernando, Climent Nadeu. Linear prediction of the one-sided autocorrelation sequence for noisy speech recognition. IEEE Transactions on Speech and Audio Processing, 1997; 5(1): 80-84.
  • 7Davis S B, Mermelstein P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentence. IEEE Trans Acoust , Speech,Signal Processing, 1989; 37(6): 795-804.
  • 8Yoon Kim, Smith J O. A speech feature based on bark frequency warping-the non-uniform linear prediction cepstrum. Proc of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New York, 1999(10):17-20.
  • 9Rabiner L. Fundamentals of speech recognition. Prentice Hall, 1993.
  • 10Smith J O, Abel J S. Bark and ERB bilineax transform. IEEE Trans on Speech and Audio Processing, 1999; 7(6):697-708.

共引文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部