期刊文献+

基于模型融合的母语与非母语语音识别 被引量:3

Native and non-native speech recognition based on acoustic model merging
下载PDF
导出
摘要 母语与非母语英语发音方式通常存在固有的差别,这导致基于母语发音训练的语音识别模型不能适应非母语说话人。一种有效的方法是建立模型的补偿机制,来容忍母语与非母语说话人之间的发音变化。分析了中国人受母语的影响带来的英语发音变化,针对音素变化和声音变化,分别采用多发音字典和模型融合技术,实现了中国人说英语的语音识别率提高了15%,但母语英语的语音识别率下降不到1%。 The inherent differences between native and non-native language pronunciation can lead to non-native language rate of decline using the model trained with native language speech. The confusions between Native and non-Native speaker lead the rate of decline. , which need to create a new model to tolerance this change. Set up on baseline Native English recogntion system, the character of Chinese people speaking English is firstly analyzed in this paper. We propose to analyze and model the phonetic and acoustic confusuons separately, using pronunciation dictionary and acoustic model merging technology to create a new model, with a significant 15% absolute WER reduction on the Chinese English, which only sacrifics 1% recognition rate on the native English.
作者 曾定 刘加
出处 《电子测量技术》 2009年第6期81-83,115,共4页 Electronic Measurement Technology
基金 国家自然科学基金委员会与微软亚洲研究院联合资助项目60776800 国家高技术研究发展计划(863计划):项目2006AA010101 项目2007AA04Z223 项目2008AA02Z414
关键词 语音识别 非母语 模型融合 多发音字典 speech recognition non-native model merging pronunciation dictionary
  • 相关文献

参考文献7

  • 1姚竞.嵌入式英语命令词语音识别算法研究与实现[D].北京:清华大学电子工程系,2007:1-56.
  • 2常丹华,郑春蕾.基于DSP的语音识别智能控制系统[J].电子测量技术,2008,31(4):175-178. 被引量:7
  • 3OH Y R, YOON J S, KIM H K. Acoustic model adaptation based on pronunciation variability analysis for non-native speech recognition [J]. Speech Communication, 2007,49 : 59-70.
  • 4BOUSELMI G, FOHR D, ILLINA I, et al. Fully Automated Non-Native Speech Recognition Using Confusion-Based Acoustic Model Integration[C]. 9th European Conference on Speech Communication and Technology, Lisbon, 2005 :345-348.
  • 5TAN T P, BESACIER L, Acoustic Model Interpolation for non-native speech recognition[C]. ICASSP'07, Honolulu, 2007:1009-1012.
  • 6DECKER A M, LAMEL L. Pronunciation variants across system configuration, language and speaking style[J]. Speech Communication, 1999 29 : 83-98.
  • 7刘明宽,徐波,黄泰翼,胡伟湘.音节混淆字典及在汉语口音自适应中的应用研究[J].声学学报,2002,27(1):53-58. 被引量:3

二级参考文献7

共引文献8

同被引文献20

  • 1余皓,苏全.语音控制机器人的设计与实现[J].电气自动化,2007,29(5):29-31. 被引量:7
  • 2Beer J M, Smarr C A, Chen T L, et al. Ttle domesticated robot : design guidelines for assisting older adults to age inplace. In: Proceedings of HRI 12 Proceedings of the Sev- enth Annual ACM/IEEE International Conference on Hu- man-Robot Interaction, 2012. 335-342.
  • 3Kinsella K, Phillips D R. Global aging: the challenge of success. Population Bulletin, 2005, 60( 1 ): 5-39.
  • 4Krishnan R H, Pugazhenthi S. Mobility assistive devices and self-transfer robotic systems for elderly: a review. In- telligent Service Robotics, 2014, 7 ( 1 ) : 37-49.
  • 5Moscovich, Luis G. Learning discrete hidden Markov models from state distribution vectors, selected topics in Louisana State Universitay and Agricultural&Mechanical College, 2005,32 -45.
  • 6胡钢.汉语孤立词语音识别算法分析与研究.[硕士学位论文].辽宁:鞍山科技大学,2003.16-59.
  • 7DaIliel JurafSky & James H. Martin. Speech and lan- guage processing:an introduction to natural language pro- cessing. Computational Linguistics, and SpeechRecogni- tion (2ed). Prentice-Hall, 2006. 38-42.
  • 8HUANG Hao ZHU Jie.Discriminative tonal feature extraction method in mandarin speech recognition[J].The Journal of China Universities of Posts and Telecommunications,2007,14(4):126-130. 被引量:1
  • 9陈玉平,韩纪庆,郑铁然.基于动态排位信息的语音关键词确认方法[J].计算机工程,2008,34(10):161-162. 被引量:6
  • 10张震,王化清.语音识别中DTW模型的改进算法研究[J].矿山机械,2008,36(22):30-34. 被引量:1

引证文献3

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部