期刊文献+

一种引入帧间相关信息的HMM语音识别方法 被引量:2

A METHOD OF HMM SPEECH RECOGNITION INTRODUCED INTER-FRAME CORRELATION
下载PDF
导出
摘要 该文提出了一种基于复数帧段输入HMM的语音识别方法,它采用相继的复数帧组成的特征参数向量作为语音识别HMM的输入,能有效地在语音识别HMM中引入帧间相关信息。为了进一步改善复数帧段输入HMM的输出概率分布函数,作者还提出了用MGDF和RBF函数作为复数帧段输入HMM的输出概率分布函数的方法。通过对非特定人汉语孤立数字和连续数字语音识别试验,证实了该文提出的引入帧间相关信息方法的有效性。 This paper applies segmental unit into HMM for speech recognition. In this model, several successive frames are combined and treated as an input vector. It expects that segmental unit input HMM would be effective to describe the inter-frame correlation information and has also proposed the MGDF and RBF to further improve output probability function. By comparing them with the traditional HMMs based on their speech recognition performance rates through the experiments of speaker-independent spoken digit (isolated/connected) recognition, the validity of the proposed appraoch could be verified.
出处 《电子与信息学报》 EI CSCD 北大核心 2001年第4期327-331,共5页 Journal of Electronics & Information Technology
关键词 语音识别 隐马尔可夫模型 帧间相关信息 复数帧段输入 Speech recognition, Hidden Markov modei, Inter-frame correlation information, Segmental unit input
  • 相关文献

参考文献7

  • 1[1]V.N. Gupta, M. Lennig, P. Mermelstein, Integration of acoustic information in a large vocabulary word recognizer, ICASSP-87, Dallas, USA, 1987.2, 697-700.
  • 2[3]L. Deng, M. Aksmanoric, X. Sun, C. F. J. Wu, Speech recognition using hidden Markov models with polynomial regression functions as stationary states, IEEE Trans. on Speech & Audio Processing, 1994, (4), 507-520.
  • 3[4]C.J. Wellekens, Explicit correlation in hidden Maarkov model with optimized inter-frame dependence, ICASSP-95, Detroit, USA, 1995.1,209-212.
  • 4[7]M. Ostendorf, S. Roukos, A stochastic segment model for phoneme-based continuous speech recognition, IEEE Trans. on Acoust., Speech & Signal Processing, 1989, ASSP-37(12), 1857-1869.
  • 5[8]T. Wakabayashi, S. Tsuruokaet, ed al., On the size and variable transformation of feature vector for handwritten character, IEICE, J76-D- Ⅱ (12), 2495-2503.
  • 6[9]L. Zhao, H. Suzuki, S. Nakagawa, A comparison study of probability functions in HMMs through spoken digit recognition, IEICE, TRANS.INF and SYST., 1995, E78-D(6), 669-675.
  • 7[10]S. Nakagawa, Estimation of probability density function and a posteriori probability and evaluation by vowel recognition, IEICE, Technical Report, 1992, SP92-24, 61-72.

同被引文献15

引证文献2

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部