摘要
介绍了针对歌曲检索中出现的中英混合现象所开发的中英双语识别系统。在双语混合语音识别中,主要面临的2个问题:①在保证双语识别率的前提下控制系统的复杂度;②有效处理插入语中原用语引起的非母语口音现象。为了解决双语混合现象以及减少统计建模所需的数据量,通过音素混合聚类方法建立起一个统一的双语识别系统。在聚类算法中,提出了一种新型基于混淆矩阵的两遍音素聚类算法(TCM),并将该方法与基于声学似然度准则的聚类方法进行了比较。实验结果表明:利用TCM进行音素聚类的识别性能优于基于声学似然度音素聚类的性能,最终得到的中英双语识别系统在纯英文测试集上的短语错误率(PER)相对基线单英文识别系统下降7.19%;在双语混合测试集上PER相对基线混合模型下降13.78%;同时在纯中文测试集上保持了基线单中文识别系统的性能。
The Mandarin-English bilingual speech recognition system which has been developed for the Mandarin-English phenomenon in song retrieval is introduced, The main difficulties to handle the bilingual speech recognition for real world application are focused on two aspects : the first is to balance the performance on inter and intra-sentential language switching and to reduce the complexity of the bilingual speech recognition system; the second is to effectively deal with the ma- trix language accents in embedded language, In order to process the intra-sentential language switching and reduce the amount of data required to robustly estimate statistical models, instead of using two separate monolingual models for each language, a compact single set of bilingual acoustic model derived by phone set merging and clustering is developed, Hence, a novel Two-pass phone clustering method based on Confusion Matrix (TCM) is presented and compared with the log-likeli- hood measure method. Experiments testify that TCM can achieve better performance. The phrase error rate (PER) of MESRS for English utterances was reduced by 7.19% relatively compared to the baseline monolingual English system while the PER on Mandarin utterances was comparable to that of the baseline monolingual Mandarin system. The performance for bilingual utterances achieved 13.78% relative PER reduction.
出处
《重庆邮电大学学报(自然科学版)》
2008年第4期391-396,共6页
Journal of Chongqing University of Posts and Telecommunications(Natural Science Edition)
基金
国家高技术研究发展计划("863"计划
2006AA010102
2006AA01Z195)
国家重点基础研究发展规划项目计划("973"计划
2004CB318106)
国家自然科学基金资助(10574140
60535030)
关键词
双语识别
聚类算法
自适应
bilingual speech recognition
clustering algorithm
adaptation