期刊文献+

基于最大似然线性回归的随机段模型说话人自适应研究

Research of speaker adaptation of stochastic segment models using maximum likelihood linear regression
下载PDF
导出
摘要 提出了一种随机段模型系统的说话人自适应方法。根据随机段模型的模型特性,将最大似然线性回归方法引入到随机段模型系统中。在"863-test"测试集上进行的汉语连续语音识别实验显示,在不同的解码速度下,说话人自适应后汉字错误率均有明显的下降。实验结果表明,最大似然线性回归方法在随机段模型系统中同样能取得较好的效果。 A speaker adaptation method of Stochastic Segment Model (SSM) is proposed.According to the SSM's characteristics,the theory of Maximum Likelihood Linear Regression (MLLR) method is introduced into the SSM-based systems.Continuous Chinese speech recognition experiment on " 863test" test suite shows that the proposed method makes the error rate of Chinese characters decrease obvi ously under different decoding speeds.Experiment results indicate that the proposal can also improve the recognition performance on the SSM-based systems.
出处 《计算机工程与科学》 CSCD 北大核心 2014年第8期1604-1608,共5页 Computer Engineering & Science
基金 国家自然科学基金资助项目(91120303 90820303 90820011) 国家973计划资助项目(2004CB318105) 国家863计划资助项目(20060101Z4073 2006AA01Z194)
关键词 语音识别 说话人自适应 最大似然线性回归 随机段模型 speech recognition speaker adaptation maximum likelihood linear regression stochastic segment model
  • 相关文献

参考文献3

二级参考文献26

  • 1张昊天.[D].北京:清华大学电子工程系,2000.
  • 2Chengalvarayan R,LI Deng.A maximum a posteriori approach to speaker adaptation using the trended hidden Markov model [J].IEEE Trans on Speech and Audio Processing,2001,9(5):549-557.
  • 3Lee C-H,Lin C-H,Juang B-H.Speaker adaptation of continuous density HMM's using linear regression [A].Proc 3rd Int Conf on Spoken Language Processing (ICSLP'94) [C].Yokohama:IEEE Press,1994.451-454.
  • 4Kuhn R,Junqua J-C,Nguyen P,et al.Rapid speaker adaptation in eigenvoice space [J].IEEE Trans on Speech and Audio Processing,2000,8(6):695-707.
  • 5Botterweck H.Very fast adaptation for large vocabulary continuous speech recognition using eigenvoices [A].Proc 6th Int Conf on Spoken Language Processing (ICSLP'00) [C].Piscataway,NJ,USA:IEEE Press,2000.354-357.
  • 6Jolliffe I T.Principal Component Analysis [M].Berlin:Springer-Verlag,1986.
  • 7CHEN Kuan-ting,LIAU Wen-wei,WANG Hsin-min,et al.Fast speaker adaptation using eigenspace-based maximum likelihood linear regression [A].Proc 6th Int Conf on Spoken Language Processing (ICSLP'00) [C].Piscataway,NJ,USA:IEEE Press,2000.742-745.
  • 8Lee C-H,Lin C-H,Juang B-H.A study on speaker adaptation of the parameters of continuous density hidden Markov models [J].IEEE Trans on Signal Processing,1991,39(4):806-814.
  • 9Dugakakis V.V,Ostendorf M,Rohlicek J.R..Fast algorithms for phone classification and recognition using segment-based models.IEEE Transactions Speech Audio Processing,1992,40(12):2885~2896
  • 10Lee C,Glass R..Real-time probabilistic segmentation for segment-based speech recognition.In:Proceedings of the International Conference on Spoken Language Processing,Sydney,Australia,1998,1803~1806

共引文献43

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部