期刊文献+

中英双语混合语音识别研究

Development of a Mandarin-English bilingual speech recognition system
下载PDF
导出
摘要 介绍了针对歌曲检索中出现的中英混合现象所开发的中英双语识别系统。在双语混合语音识别中,主要面临的2个问题:①在保证双语识别率的前提下控制系统的复杂度;②有效处理插入语中原用语引起的非母语口音现象。为了解决双语混合现象以及减少统计建模所需的数据量,通过音素混合聚类方法建立起一个统一的双语识别系统。在聚类算法中,提出了一种新型基于混淆矩阵的两遍音素聚类算法(TCM),并将该方法与基于声学似然度准则的聚类方法进行了比较。实验结果表明:利用TCM进行音素聚类的识别性能优于基于声学似然度音素聚类的性能,最终得到的中英双语识别系统在纯英文测试集上的短语错误率(PER)相对基线单英文识别系统下降7.19%;在双语混合测试集上PER相对基线混合模型下降13.78%;同时在纯中文测试集上保持了基线单中文识别系统的性能。 The Mandarin-English bilingual speech recognition system which has been developed for the Mandarin-English phenomenon in song retrieval is introduced, The main difficulties to handle the bilingual speech recognition for real world application are focused on two aspects : the first is to balance the performance on inter and intra-sentential language switching and to reduce the complexity of the bilingual speech recognition system; the second is to effectively deal with the ma- trix language accents in embedded language, In order to process the intra-sentential language switching and reduce the amount of data required to robustly estimate statistical models, instead of using two separate monolingual models for each language, a compact single set of bilingual acoustic model derived by phone set merging and clustering is developed, Hence, a novel Two-pass phone clustering method based on Confusion Matrix (TCM) is presented and compared with the log-likeli- hood measure method. Experiments testify that TCM can achieve better performance. The phrase error rate (PER) of MESRS for English utterances was reduced by 7.19% relatively compared to the baseline monolingual English system while the PER on Mandarin utterances was comparable to that of the baseline monolingual Mandarin system. The performance for bilingual utterances achieved 13.78% relative PER reduction.
出处 《重庆邮电大学学报(自然科学版)》 2008年第4期391-396,共6页 Journal of Chongqing University of Posts and Telecommunications(Natural Science Edition)
基金 国家高技术研究发展计划("863"计划 2006AA010102 2006AA01Z195) 国家重点基础研究发展规划项目计划("973"计划 2004CB318106) 国家自然科学基金资助(10574140 60535030)
关键词 双语识别 聚类算法 自适应 bilingual speech recognition clustering algorithm adaptation
  • 相关文献

参考文献20

  • 1[1]MYERS-SCOTTON C.Duelling languages:Grammatical structure in codeswitching[M].Oxford:Clarendon Press,1993.(1997 edition with a new Afterword).
  • 2[2]WANGZ,TOPKARAU,SCHULTZT,et al.Towards Universal Speech Recognition[C]//Proc.ICMl2002.[s.l]:[s.n.],2002.
  • 3[3]MARTIC[C]IC.IP[S]IC,S,[Z]IBERT J,IPSIC I,et al.Bilingual Speech Recognition for a Weather Information Retfieval Dialog System[C]//EUROCON 2003 Liubljans Slovenia,The IEEE Region8.2003.
  • 4[4]Yu S,ZHANG S,XU B.Chinese-English bilingual phone modeling for cross-language speech recognition[C]//International Conference on Natural Language Proeessing and Knowledge Engineering,PP.603-609,2003.[s.l]:[s.n.],2003.
  • 5[5]YE H,YOUNG S.Improving the Speech Recognition Performanee of Beginners in Spoken Conversational Interaction for Language Learning[C]//Interspeeeh 2005.Portugal.Lisbon:[s.n.],2005
  • 6[6]LIVESCU K,GLASS J.Lexical modeling of non-native speech for automatic speech recognition[C]//Proc.ICASSP 2000.[s.l]:[s.n.],2000.
  • 7[7]AMDAL I,KORKAMZSKIY F,SURENDRAN A C.Joint pronunciation modeling of non-native speakers using datadriven methods[C]//Proc.ICSLP00.China.Beijing:[s.n.],2000.
  • 8[8]CHAN J Y C,CHING PC,LEE T.Development of Cantonese-English Code-Mixing Speech Corpus.[C]//Proceedings of the 9th European Conference on Speech Corranunication and Technology,PP.1533-1536,Lisboa,Portugal,September 2005.Portugal,Lisboa:[s.n.],2005.
  • 9[9]CHAN Y C,CHING P C,LEE T.Automatic speech recoguition of Cantonese-English Code-Mixing utterances[C]//9th International Conference on Spoken Language Processing(Interspeech 2006-ICSLP),PP.113-116,Pennsylvania,USA,September 17-21,2006.USA,Pennsylvania:[s.n.],2006.
  • 10[10]MARlNO J B.PADRELL J,MORENO A.Monolingual and bilingual Spanish-Catalan speech recognizers developed from SpeechDat databases[C]//Proceedings International Workshop on Very Large Telephone Speech Databases,PP.57-61,Athens,May 2000.Athens:[s.n.],2000.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部