期刊文献+

基于HMM/SVM两级结构的汉语易混淆语音识别 被引量:4

Confusable Chinese Speech Recognition Based on HMM/SVM Two-Level Architecture
原文传递
导出
摘要 基于 HMM 的汉语语音识别中,易混淆语音的识别率仍然不高.在分析 HMM 固有缺陷的基础上,本文提出一种使用 SVM 在 HMM 系统上进行二次识别来提高易混淆语音识别率的方法.通过引入置信度估计环节,提高系统性能和效率.通过充分利用 Viterbi 解码获得的信息来构造新的分类特征,从而解决标准 SVM 难以处理可变长数据的问题.详细探讨这种两级识别结构中置信度估计、分类特征提取和 SVM 识别器构造等问题.语音识别实验的结果显示,与采用 HMM/SVM 混合结构的模型相比,本文方法在对识别速度影响很小的情况下可以使识别率有明显提高.这表明所提出的具有置信估计环节的 HMM/SVM 两级结构用于易混淆语音识别是可行的. The recognition rate for confusable speech is still low in state-of-the-art Chinese speech recognition systems based on HMM. The inherent defects of HMM are analyzed, then a two-level-architecture recognition framework combining HMM and SVM is proposed. A confidence estimation module is adopted to improve the performance and efficiency of the system . The information obtained by Viterbi decoding is utilized to construct new classes of feature for SVM, which solves the problem that the conventional SVM cannot directly process variable length sequences. The relevant issues, such as confidence estimation, classification feature extraction and SVM recognizer construction, are addressed. The experimental results of confusable Chinese speech show that compared with the hybrid HMM/SVM based system the proposed method can highly improve the recognition rate with little impact on the running speed.
出处 《模式识别与人工智能》 EI CSCD 北大核心 2006年第5期578-584,共7页 Pattern Recognition and Artificial Intelligence
基金 国家自然科学基金项目(No.60575030) 黑龙江省留学归国基金项目(No.LC03C10) 教育部跨世纪优秀人才培养计划项目资助
关键词 语音识别 易混淆语音 隐马尔可夫模型(HMM) 支持向量机 Speech Recognition, Confusable Speech, Hidden Markov Model (HMM), Support Vector Machine (SVM)
  • 相关文献

参考文献11

  • 1Ganapathiraju A, Hamarker J, Pieone J. Support Vector Machines for Speech Recognition// Proc of the International Conference on Spoken Language Processing. Sydney, Australia,1998:2923-2926
  • 2Aldebaro K. Speech Recognition Using Discriminative Classifiers. Ph. D Dissertation. San Diero, USA: University of California, 2003
  • 3Ganapathiraju A, Hamaker J E, Picone J. Applications of Support Vector Machines to Speech Recognition. IEEE Trans on Signal Processing, 2004, 52(8): 2348-2355
  • 4Smith N, Gales M. Speech Recognition Using SVMs // Dietterich T G, Becker S, Ghahramani Z, eds. Advances in Neural Information Processing Systems 14. Cambridge, USA: MIT Press, 2002:117-129
  • 5Shimodaira H, Noma K, Nakai M, et al. Dynamic Time-Alignment Kernel in Support Vector Machine//Dietterich T G, Becker S, Ghahramani Z, eds. Advances in Neural Information Processing Systems 14. Cambridge, USA: MIT Press, 2002,Ⅱ: 921-928
  • 6Fine S, Saon G, Gopinath R A. Digit Recognition in Noisy Environments via a Sequential GMM/SVM System// Proe of the International Conference on Acoustics, Speech, and Signal Proeessing. Orlando, USA, 2002:2242-2246
  • 7Salomon J, King S, Osborne M. Framewise Phone Classification Using Support Vector Machines// Proe of the International Conferenee onSpoken Laoguage Processing. Denver, USA, 2002:2645-2648
  • 8Platt J C. Probabilities for SV Machines // Smola A J,Scholkopf B, Bartlett P L, et al, eds. Advances in Large Margin Classifiers. Cambridge, USA: MIT Press, 2000:61-74
  • 9Hsu C W, Lin C J. A Comparison of Methods for Multi-Class Support Vector Machines. IEEE Trans on Neural Networks,2002, 13(2): 415-425
  • 10Chang C C, Lin C J. LIBSVM: A Library for Support Vector Machines [EB/OL]. [2001-04-01] http://www.csie. ntu. edu. tw/- cjlin/libsvm

二级参考文献2

共引文献19

同被引文献59

引证文献4

二级引证文献197

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部