混合双语语音识别的研究被引量：1

Research on Chinese-English bilingual speech recognition

下载PDF

导出

摘要随着现代社会信息的全球化,双语以及多语混合的语言现象日趋普遍,随之而产生的双语或多语语音识别也成为语音识别研究领域的热门课题。在双语混合语音识别中,主要面临的问题有两个:一是在保证双语识别率的前提下控制系统的复杂度;二是有效处理插入语中原用语引起的非母语口音现象。为了解决双语混合现象以及减少统计建模所需的数据量,通过音素混合聚类方法建立起一个统一的双语识别系统。在聚类算法中,提出了一种新型基于混淆矩阵的两遍音素聚类算法,并将该方法与传统的基于声学似然度准则的聚类方法进行比较;针对双语语音中非母语语音识别性能较低的问题,提出一种新型的双语模型修正算法用于提高非母语语音的识别性能。实验结果表明,通过上述方法建立起来的中英双语语音识别系统在有效控制模型规模的同时,实现了同时对两种语言的识别,且在单语言语音和混合语言语音上的识别性能也能得到有效保证。 In recent years, bilingual communication becomes a common phenomenon as a result of globalization. It presents a new challenge to the real world applications of speech recognition technology. The main difficulties to handle the bilingual speech recognition for real world application are focused on two aspects： the first is to balance the performance on inter- and intra-sentential language switching and to reduce the complexity of the bilingual speech recognition system; the second is to effectively deal with the matrix language accents in embedded language. In order to process the intra-sentential language switching and reduce the amount of data required to robustly estimate statistical models, instead of using two separate monolingual models for each language, a compact single set of bilingual acoustic model derived by phone set merging and clustering is developed. In our study, a novel Two-pass phone clustering method based on Confusion Matrix （TCM） is presented and compared with the log-likelihood measure method. In order to deal with the nonnative accents in the bilingual speech recognition, a novel bilingual model modification approach is presented to improve nonnative speech recognition, considering these great variations of accented pronunciations. Experiments testify that with these proposed methods, the Chinese-English bilingual speech recognition system can handle the bilingual speech recognition effectively and efficiently.

作者张晴晴潘接林颜永红

机构地区中国科学院声学研究所

出处《声学学报》 EI CSCD 北大核心 2010年第2期270-275,共6页 Acta Acustica

基金国家高技术研究发展计划(863计划,2006AA010102) 国家科技支撑计划(2008BAI50B00) 国家重点基础研究发展规划项目计划(973计划,2004CB318106) 国家自然科学基金(10874203,60875014,60535030)资助项目

关键词语音识别系统混合语言双语识别性能聚类方法聚类算法控制模型控制系统 Feature extraction Linguistics Telephone sets

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献17

1Wang Z, Topkara U, Schultz T, Waibel A. Towards universal speech recognition. In: Proc. ICMI 2002.
2Martincic-Ipsic S, Zibert J, Ipsic I, Mihelic F, Pavesic N. Bilingual speech recognition for a weather information retrieval dialog system. EUROCON 2003, Ljubljana, Slovenia. The IEEE Region 8, 2003.
3Yu S, Zhang S, Xu S. Chinese-English bilingual phone modeling for cross-language speech recognition. International Conference on Natural Language Processing and Knowledge Engineering, 2003:603-609.
4Tomokiyo L M, Waibel A. Adaptation methods for nonnative speech, in Proceedings of Multilinguality in Spoken Language Processing, 2001.
5Myers-Scotton . Duelling languages: Grammatical structure in codeswitching. (1997 edition with a new After- word). Oxford: Clarendon Press, 1993.
6Ye H, Young S. Improving the speech recognition performance of beginners in spoken conversational interaction for language learning. Interspeech 2005, Lisbon, Portugal.
7Zhang Q, Pan J, Yan Y. Mandarin-English bilingual speech recognition for real world music retrieval. In ICASSP-2008, 2008:4253-4256.
8Humphries J, Woodland P, Pearce D. Using accent-specific pronunciation modeling for robust speech recognition. In: Proc. ICSLP'96, Philadelphia, PA, 1996:2324-2327.
9Livescu K. Analysis and modeling of non-native speech for automatic speech recognition. Master's thesis, MIT, 1999.
10Wang Z, Schultz T, Waibel A. Comparison of acoustic model adaptation techniques on non-native speech. In: Proc. ICASSP 2003.

同被引文献4

1汤玲,戴斌.抗噪声语音识别及语音增强算法的应用[J].计算机仿真,2006,23(9):80-82. 被引量：5
2吕丹桔,Mei-Yuh Huang,B Hoffmeister.汉语连续语音识别之音素声学模型的改进[J].计算机仿真,2010,27(5):355-358. 被引量：7
3李宁,徐守坤,马正华,石林.自适应语音识别算法仿真研究[J].计算机仿真,2011,28(8):181-185. 被引量：9
4郑展恒.数字语音识别系统[J].桂林电子科技大学学报,2011,31(6):439-441. 被引量：4

引证文献1

1李梓钰,林子明,程晓东,杨洁.基于中英文数字语音登陆系统的仿真研究[J].电子产品世界,2012,19(6):53-55.

1张晴晴,潘接林,颜永红.中英双语混合语音识别研究[J].重庆邮电大学学报（自然科学版）,2008,20(4):391-396.
2苟建兵,倪维斗.基于DLL的混合语言编程[J].软件产业,1996(9):12-15. 被引量：2
3余军.一种高速数据传输中的软硬件解决方法[J].工业控制计算机,2002,15(5):26-29. 被引量：1
4朱大勇,许毅,冯山.Java和Lisp接口问题的研究[J].计算机应用,2003,23(4):84-85.
5朱培民,屠万生.Fortran和Pascal语言的混合编程方法[J].计算机应用与软件,2002,19(1):25-28.
6HAYS.,R,邵惠玲.一种用于构造分布式混合语言的简单系统[J].软件,1989,10(1):38-53.
7宋志宏.用C和FORTRAN开发Windows应用软件的基本方法[J].微小型计算机开发与应用,1998(5):13-16.
8谢壮宁.Microsoft　FORTRAN的鼠标编程方法[J].计算机应用研究,1995,12(6):52-55.
9董斌,熊刚,邵惠鹤.用混合编程技术实现可固化在EPROM中的控制软件[J].自动化仪表,1997,18(6):11-13.
10冯山,朱大勇,许毅.C#和Lisp编程接口问题研究[J].电讯技术,2003,43(3):126-128.

声学学报

2010年第2期

浏览历史

内容加载中请稍等...

混合双语语音识别的研究被引量：1

参考文献17

同被引文献4

引证文献1

相关作者

相关机构

相关主题

浏览历史

混合双语语音识别的研究 被引量：1

参考文献17

同被引文献4

引证文献1

相关作者

相关机构

相关主题

浏览历史

混合双语语音识别的研究被引量：1