期刊文献+

快速口音自适应的动态说话人选择性训练 被引量:1

Dynamic speaker selected training for rapid speaker adaptation
原文传递
导出
摘要 为解决语音识别系统实用中的说话人口音快速自适应问题,提出了一种动态说话人选择性训练方法。基于说话人选择性训练方法,采用基于Gauss混合模型似然分数计算的置信测度选择训练用说话人,改变训练用说话人的绝对数目选取方式,提高了选取的效能并拓展了选取标准的推广性。根据各个训练用说话人同被适应说话人的不同似然程度,加权地合成动态说话人选择性训练的语音模型,提高了自适应训练的效果。实验表明:该方法使识别率从80.16%提高到84.12%,相对误识率降低了19.96%,在实用中提高了基线系统的识别性能。 Practical speech recognition systems need rapid speaker adaptation to be effective with a wide variety of speakers. A dynamic speaker selected training method developed for rapid speaker adaptation improves the basic speaker selected training method by replacing the absolute number selection method used in the basic method with a confidence measure calculated from the Gaussian mixture model likelihood. The new method enhances both the training speaker selecting efficiency and the selecting adaptability. The dynamic acoustic model, which uses different weightings for each training speaker so that they resemble the adapted speaker, further increases the recognition accuracy rate. Simulation show that the dynamic method improves the baseline recognition accuracy rate from 80.1% to 84.1%, with a decrease of 19.96% in the relative error rate. Thus, the dynamic method rapidly increases practical speech recognition system performance.
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2005年第7期912-915,共4页 Journal of Tsinghua University(Science and Technology)
基金 国家自然科学基金资助项目(60272016)
关键词 语音识别 说话人快速自适应 置信测度 speech recognition rapid speaker adaptation confidence measure
  • 相关文献

参考文献6

  • 1Hazen Timothy J .A comparison of novel techniques for rapid speaker adaptation [J].Speech Communication,2000,31:15-33.
  • 2Gauvain Jean-Luc,Lee Chin-Hui.Maximum a posteriori estimation for multivariate gaussian mixture observations of Markov chains [J].IEEE Trans SAP,1994,2:291-298.
  • 3Leggetter C J,Woodland P C.Maximum likely- hood linear regression for speaker adaptation of continuous density hidden Markov models [J].Computer Speech and Language,1995,9(2):171-185.
  • 4Padmanabhan M,Bahl L R,Nahamoo D,et al.Speaker clustering and transformation for speaker adaptation in speech recognition systems [J].IEEE Trans on Speech and Audio Processing,1998,6(1):71-77.
  • 5WU Jian,CHANG Eric.Cohorts based Custom models for rapid speaker and dialect adaptation [A].Proc Eurospeech [C].Aalborg,Denmark:ISCA Press,2001,2:1261-1264.
  • 6HUANG Chao,CHEN Tao,CHANG Eric.Speaker selection training for large vocabulary continuous speech recognition [A].Proceedings of IEEE International Conference on Acoustics Speech and Signal Processing [C].Orlando,Florida:IEEE Press,2002.1:609-612.

同被引文献3

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部