摘要
母语与非母语英语发音方式通常存在固有的差别,这导致基于母语发音训练的语音识别模型不能适应非母语说话人。一种有效的方法是建立模型的补偿机制,来容忍母语与非母语说话人之间的发音变化。分析了中国人受母语的影响带来的英语发音变化,针对音素变化和声音变化,分别采用多发音字典和模型融合技术,实现了中国人说英语的语音识别率提高了15%,但母语英语的语音识别率下降不到1%。
The inherent differences between native and non-native language pronunciation can lead to non-native language rate of decline using the model trained with native language speech. The confusions between Native and non-Native speaker lead the rate of decline. , which need to create a new model to tolerance this change. Set up on baseline Native English recogntion system, the character of Chinese people speaking English is firstly analyzed in this paper. We propose to analyze and model the phonetic and acoustic confusuons separately, using pronunciation dictionary and acoustic model merging technology to create a new model, with a significant 15% absolute WER reduction on the Chinese English, which only sacrifics 1% recognition rate on the native English.
出处
《电子测量技术》
2009年第6期81-83,115,共4页
Electronic Measurement Technology
基金
国家自然科学基金委员会与微软亚洲研究院联合资助项目60776800
国家高技术研究发展计划(863计划):项目2006AA010101
项目2007AA04Z223
项目2008AA02Z414
关键词
语音识别
非母语
模型融合
多发音字典
speech recognition
non-native
model merging
pronunciation dictionary