期刊文献+

说话人自适应技术在维吾尔语语音识别中的应用研究 被引量:4

Speaker Adaptation Technology in Uyghur Continuous Speech Recognition
下载PDF
导出
摘要 该文针对维吾尔语说话人之间的发音差异会在一定程度上影响维吾尔语语音识别系统的性能这一情况研究了说话人自适应技术,将目前较为常用的MLLR和MAP以及MLLR和MAP相结合的自适应方法应用于维吾尔语连续语音识别的声学模型训练中,并用这三种方法自适应后的声学模型分别在测试集上进行识别实验。实验结果表明MLLR、MAP以及MAP+MLLR自适应方法使基线识别系统的单词错误识别率分别降低了0.6%、2.34%和2.57%。 Researches show that pronunciation differences between the speakers can cause serious effects on the Uy- ghur speech recognition system. Focused on the speaker adaptation technology,this paper applies MLLR, MAP and MLLR+ MAP methods to the training of acoustic models of Uyghur Continuous Speech Recognition system. Exper- imental results show that with the three speaker adaptation methods,the word error rate is reduced by 0. 6%,2. 34% and 2.57%, respectively.
出处 《中文信息学报》 CSCD 北大核心 2016年第3期79-84,共6页 Journal of Chinese Information Processing
基金 国家自然科学基金(61363064) 新疆维吾尔自治区科技计划项目(201312104) 清华大学腾讯科技有限公司互联网创新技术联合实验室创新课题(2012-04)
关键词 维吾尔语 语音识别 说话人自适应 MLLR MAP Uyghur speech recognition speaker adaptation MLLR MAP
  • 相关文献

参考文献13

二级参考文献23

  • 1徐波,史晓东,刘群,宗成庆,庞薇,陈振标,杨振东,魏玮,杜金华,陈毅东,刘洋,熊德意,侯宏旭,何中军.2005统计机器翻译研讨班研究报告[J].中文信息学报,2006,20(5):1-9. 被引量:10
  • 2张昊天.[D].北京:清华大学电子工程系,2000.
  • 3BROWN P, COCKE J, PIETRA S, et al. A statistical approach to machine translation[J]. Computational Linguistics, 1990, 16(2):79 -85.
  • 4KOEHN P, OCH F J, MARCU D. Statistical phrase-based translation[ C] // Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language. Morristown, N J: Association for Computational Linguistics, 2003:48 -54.
  • 5OCH F J, NEY H. Discriminative training and maximum entropy models for statistical machine translation[ C]// Proceedings of the 40th Annual Meeting on Association for Computational Linguistics. Morristown, NJ: Association for Computational Linguistics, 2001: 295 - 302.
  • 6STOLKE A. Srilm - An extensible language modeling toolkit [ EB / OL]. [ 2008 - 09 - 20]. http://web, iti. upv. es/-evidal/ students/doct/sht/transp/srlim2p, pdf.
  • 7OCH F J, NEY H, A systematic comparison of various statistical alignment models[ J]. Computational Linguistics, 2003, 29(!) : 19 - 51.
  • 8KOEHN P. Pharaoh: a beam search decoder for phrase-based statistical machine translation models[ EB/OL]. [ 2008 - 08 - 20]. http://www, iccs. inf. ed. ac. uk/- pkoehn/publications/pharaoh - amta2004, ps.
  • 9Lee C-H,Lin C-H,Juang B-H.A Study on Speaker Adaptation of the Parameters of Continuous Density Hidden Markov Models [J].IEEE TRANSACTIONS ON SIGNAL PROCESSING,1991,39(4):806-814.
  • 10M.J.F.Gales.Maximum likelihood linear transformations for HMM-based speech recognition [J].Computer Speech and Language,1998,Volume 12.

共引文献51

同被引文献35

引证文献4

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部