期刊文献+

汉语大词汇量连续语音识别系统研究进展 被引量:39

Research on Large Vocabulary Continuous Speech Recognition System for Mandarin Chinese
下载PDF
导出
摘要 大词汇量连续语音识别(LVCSR)技术近年来发展迅速,并在许多领域得到了广泛的应用,国内外许多大公司加大了对语音识别技术的研究,不少商业化的语音识别系统已经面世,并得到较为广泛的使用。该文综述了近年来大词汇量连续语音识别技术的研究进展,描述了汉语大词汇量连续语音识别系统,主要是基于统计方法的语音识别系统的框架与设计方法,对语音识别系统的一些关键技术和原理进行了分析,并对近年来国内外对语音识别研究发展动向进行了讨论。 The technology of large vocabulary continuous speech recognition(LVCSR)has developed quickly and achieved broad application in recent years. Many big companies has reinforced the speech recognition research and various commercial systems have appeared in the market, This paper reviews the recent research progresses of LVCSR and describes the main frames and designs of current mandarin Chinese LVCSR systems. The key issues and principles in LCVSR are analyzed in detail. The prospects and research trends for LVCSR at home and abroad are also discussed.
出处 《中文信息学报》 CSCD 北大核心 2009年第1期112-123,128,共13页 Journal of Chinese Information Processing
基金 国家重点基础研究发展计划(973)资助项目(2004CB318105) 国家高技术研究发展计划(863)资助项目(2006AA01Z194,20060101Z4073) 国家自然科学基金资助项目(60675026,60121302,90820011)
关键词 计算机应用 中文信息处理 综述 语音识别 模型自适应 搜索技术 computer application Chinese information processing overview speech recognition model adaptation search technology
  • 相关文献

参考文献80

  • 1RABINER L R,JUANG B H.Fundamentals of speech recognition[M].北京:清华大学出版社,1999.
  • 2T. K. Vintsyuk, Speech recognition by dynamic programming [J]. Kibernetika, 1968, (1): 11 -18.
  • 3MAKHOUL, J., Linear Prediction: A Tutorial Re view[C]// Proc. IEEE, 1975, 63(4).
  • 4F. Jelinek, Continuous speech recognition by statistical methods[C]// Proc IEEE, 1976, 64(4): 532 -556.
  • 5Lee, K.-F., Automatic speech recognition.. The development of the SPHINX system [M].Boston: Kluwer Academic Publishers 1989.
  • 6PRICE, P.J., A database for continuous speech recog nition in a 1000-word domain[C]// Pro. ICASSP, 1988. 11: 651-654.
  • 7钱跃良,林守勋,刘群,刘宏.2005年度863计划中文信息处理与智能人机接口技术评测回顾[J].中文信息学报,2006,20(B03):1-6. 被引量:4
  • 8Davis, S.B. and P. Mermelstein, Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences [C]// IEEE Trans. on Acoustic, Speech and Signal Processing, 1980, 28(4): 357-366.
  • 9Hermansky, H., Perceptual linear predictive (PLP) analysis of speech [J]. Journal of the Acoustical Society of America, 1990, 87(4): 1738-1752
  • 10Viiki, O., D. Bye, and K. Laurila, A recursive feature vector normalization approach for robust speech recognition in noise [C]// Pro. ICASSP, 1998: 733-736.

二级参考文献9

  • 1黄昌宁.统计语言模型能做什么?[J].语言文字应用,2002(1):77-84. 被引量:31
  • 2863评测网站[EB].http://www.863data.org.cn.英文版:http://www.863data.org.cn/english.
  • 3NIST语音类评测网站[EB].http://www.nist.gov/speech/tests/index.htm.
  • 4NIST机器翻译评测网站[EB].http://www.nist.gov/speech/tests/mt/index.htm.
  • 5TREC网站[EB].http://trec.nist.gov/.
  • 6CLEF评测网站[EB].http://www.clef-campaign.org/.
  • 7NTCIR评测网站[EB].http://research.nii.ac.jp/ntcir/workshop/.
  • 8MUC7 [EB]: http://www. itl. nist. gov/iaui/894.02/related_projects/muc/proceedings/muc_7_toc. html.
  • 9SIGHAN网站[EB].http://www.sighan.org/.

共引文献5

同被引文献423

引证文献39

二级引证文献212

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部