期刊文献+

用于语音识别的基于频谱调整的信道自适应方法 被引量:2

Channel adaptation method based on spectral adjusting for speech recognition
原文传递
导出
摘要 语音识别系统在实际应用时,其性能会因各种因素而下降,其中重要的一个因素是信道的不匹配。该文提出了一种新的信道自适应方法——频谱调整法。该方法在频域上定义一个分段线性信道归一化函数,根据最大似然准则利用梯度投影法求其最优参数后,对语音的幅度频谱进行归一化。实验表明,该方法可以利用很少的自适应数据使识别的字错误率下降10%左右。 Channel mismatch is one of the main causes of degradation in speech recognition performance. This paper presents a channel adaptation method named spectral adjusting (SA) normalizes the distorted speech spectrum with a piecewise linear normalization function. The function parameters were obtained using the Gradient Projection algorithm based on the maximum likelihood criterion. Tests show that the method is able to reduce the word error rate by about 10% even with very short utterance.
作者 赵蕤 王作英
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2005年第4期441-444,共4页 Journal of Tsinghua University(Science and Technology)
基金 国家"八六三"高技术项目(2001AA114071)
关键词 信息处理 语音识别 稳健性 信道自适应 information processing speech recognition robust channel adaptation
  • 相关文献

参考文献6

  • 1王作英.基于段长分布的HMM语音识别模型[A]..第二届全国汉字?汉语识别会议[C].庐山,1989..
  • 2Zhao Y. An Acoustic-phonetic-based speaker adaptation technique for improving speaker-independent continuous speech recognition [J].IEEE Trans Speech Audio Processing, 1994, (2): 380-394.
  • 3Rahim M G, Juang B J. Signal bias removal by maximum likelihood estimation for robust telephone speech recognition[J]. IEEE Trans Speech Audio Processing, 1998, (4): 19-30.
  • 4Kim D Y, Un C K, Kim N S. Speech recognition in noisy environments using first-order vector Taylor series [J].Speech Communication, 1998, (24): 39-49.
  • 5Zhao Y. Frequency-domain maximum likelihood estimation for automatic speech recognition in additive and convolutive noises [J]. IEEE Trans Speech Audio Processing, 2000,(8): 255 - 266.
  • 6Leggetter N S, Woodland P C. MLLR for speaker adaptation of continuous density hidden Markov modes [J]. Computer Speech and Language, 1995, (9): 171 - 186.

共引文献2

同被引文献5

  • 1Heck L P,Weintraub M.Handset-dependent background models for robust text-independent speaker recognition[J].ICASSP,1997,2:1071-1074.
  • 2Quatieri T F,Reynolds D A,O'Leary G C.Estimation of handset nonlinearity with application to speaker recognition[J].IEEE Transaction On Speech And Audio Processing,2000,8(5):567-584.
  • 3Westphal M.The use of cepstral means in conversational speech recognition[C]∥ Proceedings of Eurospeech 97,Berlin:[s.n.],1997:1143-1146.
  • 4Reynolds D A.Channel robust speaker verification via feature mapping[J].ICASSP,2003,2:53-56.
  • 5Reynolds D A,Quatieri T F,Dunn R B.Speaker verification using adapted Gaussion mixture models[J].Digital Signal Processing,2000,10:19-41.

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部