集成语种辨识的中英文LVCSR系统

LVCSR system for english and mandarin integrated with language identification

下载PDF

导出

摘要为了在未知一段语音所属语言种类的情况下将其转换为正确的字符序列,将语种辨识(language identification,LID)同语音识别集成在一起建立了中、英文大词汇量连续语音识别(large vocabulary continuous speech recognition,LVCSR)系统。为了在中、英文连续语音识别系统中能够尽早的对语音所属的语言种类做出判决以便进行识别,从而降低解码的计算量,对语种辨识过程中的语种剪枝进行了研究,表明采用合理的语种剪枝门限在不降低系统性能的情况下,可以有效的降低系统的计算量及识别时间。 In order to transfer the speech into the correspond text without knowing the language, the language identification （LID） is integrated into speech recognition and then the large vocabulary continuous speech recognition （LVCSR） system is developed which support English and mandarin. The language pruning during the LID is discussed for making decision which language the sp6ech belong to early, then the speech can be recognized and the calculation is reduced in decoding. The experiments show that, if the pruning threshold is set reasonable, it could decrease the calculation, and so the system output the recognition result more quickly without losing the performance.

作者孙健王作英

机构地区清华大学

出处《计算机工程与设计》 CSCD 北大核心 2007年第8期1931-1933,共3页 Computer Engineering and Design

基金国家863高技术研究发展计划基金项目(2001AA114071)

关键词连续语音识别语种辨识段长分布非齐次隐含马尔科夫模型语种剪枝 continuous speech recognition language identification duration distribution inhomogeneous hidden Markov model language pruning

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献10

1Waibel Alex,Geutner Petra,Laura Mayfield,et al.Mulitilinguality in speech and spoken language systems[J].Proceedings of the IEEE,2000,88(8):1297-1313.
2Azevedo J,Beires N,Charpentier F,et al.Multilinguality in voice activated information services:The P502 EURESCOM project[J].Speech Communication,2000,31:369-379.
3Uebler Ulla.Multilingual speech recognition in seven languages[J].Speech Communication,2001,35:53-69.
4王作英,肖熙.基于段长分布的HMM语音识别模型[J].电子学报,2004,32(1):46-49. 被引量：42
5Liu X,Gales MJG,Sim K C,et al.Investigation of acoustic modeling techniques for LVCSR system[C].Philadelphia,USA:IEEE ICASSP'05,2005.849-852.
6Graciarena Martin,Franco Horacio,Zheng Jing,et al.Voicing feature integration in SRI's decipher LVCSR system[C].Montreal,Quebec,Canada:IEEE ICASSP'04,2004.921-924.
7SantoshKumar S A,Ranmasubramanian V.Automatic language identification using Ergodic HMM[C].Philadelphia,USA:IEEE ICASSP'05,2005.609-612.
8Obuchi Y,Sato N.Language identification using phonetic and prosodic HMMs with feature normalization[C].Philadelphia,USA:IEEE ICASSP'05,2005.569-572.
9Povey D.Phone duration modeling for LVCSR[C].Montreal,Quebec,Canada:IEEE ICASSP'04,2004.829-832.
10肖熙.DDBHMM语音识别模型的训练和识别算法[D].北京:清华大学,2002.

二级参考文献2

1齐士钤张家禄.汉语普通话辅音音长分析[J].声学学报,1982,(1):8-13.
2王作英.基于段长分布的HMM语音识别模型 [A]..第二届全国汉字汉语识别会议 [C].庐山,1989.9.

共引文献41

1曹剑芬,李爱军,胡方,张利刚.语音学知识在语音识别中的应用:案例分析[J].清华大学学报（自然科学版）,2008,48(S1):748-753. 被引量：3
2李明琴,李涓子,王作英,陆大.语义分析和结构化语言模型[J].软件学报,2005,16(9):1523-1533. 被引量：7
3刘敬伟,王作英,肖熙.基于自回归模型的加性噪声环境稳健语音识别[J].清华大学学报（自然科学版）,2006,46(1):50-53. 被引量：2
4陈立伟,张晔.基于改进的隐马尔可夫和神经网络混合模型的语音识别[J].应用声学,2006,25(2):90-95.
5王宏,郭艳丽,贾新民.基于HMM的孤立字识别[J].昌吉学院学报,2006(1):94-98. 被引量：3
6范斐斐,李振波,陈佳品.基于K均值分段的语音识别在微机器人控制系统中的应用[J].电子技术应用,2006,32(5):4-6. 被引量：2
7赵蕤,王作英.语音识别中信道和噪音的联合补偿[J].声学学报,2006,31(5):466-470. 被引量：11
8贺无名.语音识别技术及其研究进展[J].中国科技信息,2006(18):157-158. 被引量：3
9孙健,王作英.融合段长信息的中、英文语种辨识[J].模式识别与人工智能,2006,19(5):567-571.
10王作英,孙健.一般拓扑结构的非齐次隐含马尔科夫模型及其在中、英文语种辨识中的应用[J].电子与信息学报,2007,29(4):867-869. 被引量：1

1王作英,孙健.一般拓扑结构的非齐次隐含马尔科夫模型及其在中、英文语种辨识中的应用[J].电子与信息学报,2007,29(4):867-869. 被引量：1
2孙健,王作英.融合段长信息的中、英文语种辨识[J].模式识别与人工智能,2006,19(5):567-571.
3孙健,王作英.基于DDBHMM的LVCSR系统的单步搜索算法[J].清华大学学报（自然科学版）,2006,46(10):1735-1738.
4倪崇嘉,刘文举,徐波.汉语大词汇量连续语音识别系统研究进展[J].中文信息学报,2009,23(1):112-123. 被引量：39
5罗骏,欧智坚,王作英.基于拼音图的两阶段关键词检索系统[J].清华大学学报（自然科学版）,2005,45(10):1356-1359. 被引量：1
6缪炜,侯丽敏.基于倒谱距离窗移最小失真分割的语种辨识[J].上海大学学报（自然科学版）,2007,13(2):116-120. 被引量：2
7飞龙,高光来,闫学亮,王炜华.基于分割识别的蒙古语语音关键词检测方法的研究[J].计算机科学,2013,40(9):208-211. 被引量：2
8陈雷,杨俊安,王一,王龙.LVCSR系统中一种基于区分性和自适应瓶颈深度置信网络的特征提取方法[J].信号处理,2015,31(3):290-298. 被引量：9
9吴治国,刘玉宇,王作英.基于段长分布的HMM的资源受限语音识别系统[J].计算机应用,2003,23(z2):316-318.
10单煜翔,邓妍,刘加.一种联合语种识别的新型大词汇量连续语音识别算法[J].自动化学报,2012,38(3):366-374. 被引量：10

计算机工程与设计

2007年第8期

浏览历史

内容加载中请稍等...

集成语种辨识的中英文LVCSR系统

参考文献10

二级参考文献2

共引文献41

相关作者

相关机构

相关主题

浏览历史