期刊文献+

集成语种辨识的中英文LVCSR系统

LVCSR system for english and mandarin integrated with language identification
下载PDF
导出
摘要 为了在未知一段语音所属语言种类的情况下将其转换为正确的字符序列,将语种辨识(language identification,LID)同语音识别集成在一起建立了中、英文大词汇量连续语音识别(large vocabulary continuous speech recognition,LVCSR)系统。为了在中、英文连续语音识别系统中能够尽早的对语音所属的语言种类做出判决以便进行识别,从而降低解码的计算量,对语种辨识过程中的语种剪枝进行了研究,表明采用合理的语种剪枝门限在不降低系统性能的情况下,可以有效的降低系统的计算量及识别时间。 In order to transfer the speech into the correspond text without knowing the language, the language identification (LID) is integrated into speech recognition and then the large vocabulary continuous speech recognition (LVCSR) system is developed which support English and mandarin. The language pruning during the LID is discussed for making decision which language the sp6ech belong to early, then the speech can be recognized and the calculation is reduced in decoding. The experiments show that, if the pruning threshold is set reasonable, it could decrease the calculation, and so the system output the recognition result more quickly without losing the performance.
作者 孙健 王作英
机构地区 清华大学
出处 《计算机工程与设计》 CSCD 北大核心 2007年第8期1931-1933,共3页 Computer Engineering and Design
基金 国家863高技术研究发展计划基金项目(2001AA114071)
关键词 连续语音识别 语种辨识 段长分布 非齐次隐含马尔科夫模型 语种剪枝 continuous speech recognition language identification duration distribution inhomogeneous hidden Markov model language pruning
  • 相关文献

参考文献10

  • 1Waibel Alex,Geutner Petra,Laura Mayfield,et al.Mulitilinguality in speech and spoken language systems[J].Proceedings of the IEEE,2000,88(8):1297-1313.
  • 2Azevedo J,Beires N,Charpentier F,et al.Multilinguality in voice activated information services:The P502 EURESCOM project[J].Speech Communication,2000,31:369-379.
  • 3Uebler Ulla.Multilingual speech recognition in seven languages[J].Speech Communication,2001,35:53-69.
  • 4王作英,肖熙.基于段长分布的HMM语音识别模型[J].电子学报,2004,32(1):46-49. 被引量:42
  • 5Liu X,Gales MJG,Sim K C,et al.Investigation of acoustic modeling techniques for LVCSR system[C].Philadelphia,USA:IEEE ICASSP'05,2005.849-852.
  • 6Graciarena Martin,Franco Horacio,Zheng Jing,et al.Voicing feature integration in SRI's decipher LVCSR system[C].Montreal,Quebec,Canada:IEEE ICASSP'04,2004.921-924.
  • 7SantoshKumar S A,Ranmasubramanian V.Automatic language identification using Ergodic HMM[C].Philadelphia,USA:IEEE ICASSP'05,2005.609-612.
  • 8Obuchi Y,Sato N.Language identification using phonetic and prosodic HMMs with feature normalization[C].Philadelphia,USA:IEEE ICASSP'05,2005.569-572.
  • 9Povey D.Phone duration modeling for LVCSR[C].Montreal,Quebec,Canada:IEEE ICASSP'04,2004.829-832.
  • 10肖熙.DDBHMM语音识别模型的训练和识别算法[D].北京:清华大学,2002.

二级参考文献2

  • 1齐士钤 张家禄.汉语普通话辅音音长分析[J].声学学报,1982,(1):8-13.
  • 2王作英.基于段长分布的HMM语音识别模型 [A]..第二届全国汉字汉语识别会议 [C].庐山,1989.9.

共引文献41

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部