期刊文献+

鲁棒语音识别技术综述 被引量:4

Review of robust speech recognition
下载PDF
导出
摘要 鲁棒语音识别是为了解决噪声环境所引起的语音识别系统识别和训练不匹配的情况.依据噪声对语音识别系统的影响,从信号空间、特征空间及模型空间3个层面上分别对语音增强技术、特征增强技术及语音模型补偿、增强技术进行了总结,并分析了不同方法的特点、实现及应用. To solve the mismatch between the training and recognition environment, some robust speech recognition methods were proposed. Based on the influence of noise on Automatic Speech Recognition (ASR) system, some classified and summarized robust speech recognition technologies in the aspects of speech enhancement, feature enhancement and model compensation / enhancement aiming at the signals space, feature space and model space of ASR system were presented in this paper. Furthermore, some main ideas of these approaches were analyzed.
出处 《安徽大学学报(自然科学版)》 CAS 北大核心 2013年第5期17-24,共8页 Journal of Anhui University(Natural Science Edition)
基金 国家自然科学基金资助项目(61271352) 安徽大学校学术与技术带头人引进工程基金资助项目(02303203)
关键词 鲁棒 语音识别 语音增强 特征增强 语音模型补偿 增强 robust speech recognition speech enhancement feature enhancement model compensation/enhancement
  • 相关文献

参考文献22

  • 1Dautrich B, Rabiner L, Martin T. On the effects of varying filter bank parameters on isolated word recognition [ J ]. Acoustics, Speech and Signal Processing, IEEE Transactions on, 1983,31 (4) :793-807.
  • 2Lockwood, P A. Experiments with a nonlinear spectral subtract and hidden Markov models and the projection for robust speech recognition in cars [ J ]. Speech Communication, 1992,11 (2/3) :215-228.
  • 3Das S, Bakis R, N6das A, et al. Influence of background noise and microphone on the performance of the IBM TANGORA speech recognition system[ C ]. Acoustics, Speech, and Signal Processing, 1993 ICASSP-93,1993 IEEE International Conference on IEEE, 1993,2:71-74.
  • 4Preuss R. A frequency domain noise cancelling preprocessor for narrowband speech communications systems [ C ]. Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP"/9 IEEE, 1979,4:212-215.
  • 5Berouti M, Schwartz R, Makhoul J. Enhancement of speech corrupted by acoustic noise [ C 1. Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP'79IEEE, 1979,4:208-211.
  • 6Agarwal A, Cheng Y M. Two-stage mel-warped Wiener filter for robust speech recognition[ C ]. Proc ASRU, 1999, 99:67-70.
  • 7Lira J, Oppenheim A. All-pole modeling of degraded speech[ J]. Acoustics, Speech and Signal Processing, IEEE Transactions on, 1978,26 ( 3 ) : 197-210.
  • 8Musicus B, Lira J. Maximum likelihood parameter estimation of noisy data [ C ]. Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP79 IEEE, 1979,4:224-227.
  • 9Hansen J H, Clements M A. Constrained iterative speech enhancement with application to speech recognition [ J ]. Signal Processing, IEEE Transactions on, 1991,39 (4) :795-805.
  • 10Goh Z, Tan K C, Tan B T. Speech enhancement based on avoieed-unvoiced speech model[ C ]. Acoustics, Speech and Signal Processing, 1998 Precedings of the 1998 IEEE International Conference on IEEE, 1998,1:401-404.

二级参考文献43

  • 1包永强,赵力,邹采荣.采用归一化补偿变换的与文本无关的说话人识别[J].声学学报,2006,31(1):55-60. 被引量:13
  • 2刘波,戴礼荣,王仁华,杜俊,李锦宇.基于双高斯GMM的特征参数规整及其在语音识别中的应用[J].自动化学报,2006,32(4):519-525. 被引量:4
  • 3Moreno P J, Raj B, and Stern R M. A vector Taylor series approach for environment- independent speech recognition[C] Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Atlanta, Georgia, USA, 7-10 May 1996: 733-736.
  • 4Moreno P J. Speech recognition in noisy environments[D]. [Ph.D. dissertation], Carnegie Mellon University, 1996.
  • 5Sasou A, Asano F, and Nakamura S, et al.. HMM-based noise-robust feature compensation[J]. Speech Communication, 2006, 48(9): 1100-1111.
  • 6Kim W and Hansen J H L. Feature compensation in the cepstral domain employing model combination[J]. Speech Communication, 2009, 51(2): 83-96.
  • 7Gauvain J L and Lee C H. Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains[J]. IEEE Transactions on Speech and Audio Processing 1994, 2(2): 291-298.
  • 8Leggetter C J and Woodland P C. Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models[J]. Computer Speech and Language, 1995, 9(2): 171-185.
  • 9Gales M J F and Woodland P C. Mean and variance adaptation within the MLLR framework[J]. Computer Speech and Language, 1996, 10(4): 249-264.
  • 10Gales M J F and Young S J. Robust speech recognition in additive and convolutional noise using parallel model combination[J]. Computer Speech and Language, 1995, 9(4): 289-307.

共引文献8

同被引文献66

引证文献4

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部