期刊文献+

一种对加性噪声和信道函数联合补偿的模型估计方法 被引量:5

An algorithm of Model Compensation based on the estimation of additive noise and channel function for speech recognition
原文传递
导出
摘要 语音识别系统在面对实际环境中多变的加性噪声和信道差异的影响时性能急剧下降,抑制这些噪声和差异所造成的性能下降具有重要意义。作者提出了一种模型补偿算法,使用句子中的非语音段估计加性噪声,然后利用EM算法估计信道函数,从而在倒谱域上对失配的声学模型进行联合补偿。实验表明,相比基线系统,采用该算法的系统的平均性能相对提升幅度超过50%。算法可以动态跟踪环境的变化,性能表现优于一些传统的语音识别稳健性处理算法。 Speech recognition systems declined roughly in performance when they were facing the impacts of various additive noise and channel distortions in the actual environment, so it was of great significance for the speech recognition system to alleviate these impacts of the noise and distortions. An algorithm of model compensation was proposed, which computed the additive noise from the non-speech segments of the sentence, estimated the channel function using the EM algorithm, and jointly compensated the mismatched acoustics HMM models in the cepstral domain with them. Experiments employing this algorithm showed the significant improvement more than 50 percent relatively. The algorithm tracked the changes in the environment dynamically and it provided better performance than the traditional robust speech recognition algorithms.
出处 《声学学报》 EI CSCD 北大核心 2008年第3期238-243,共6页 Acta Acustica
  • 相关文献

参考文献16

  • 1Y.Gong. Speech recognition in noisy environments: A survery. Speech Commun., 1995; 16(3): 261- 291.
  • 2Chia-Ping Chen, Jeff Bilmes, and Katrin Kirchhoff. Low- resource noise-robust feature post processing on aurora 2.0. In: Proceedings of International Conference on Spoken Language Processing (ICSLP), 2002:2445 -2448.
  • 3Lim J S, Oppenheim A V. Enhancement and bandwidth compression of noisy speech. Proceedings IEEE, 1979; 67: 1586-1604.
  • 4Ephraim Y. Statistical-Model-Based Speech Enhancement Systems. In: Proc. IEEE, 1992; 80(10): 1526- 1555.
  • 5蒋文建,韦岗.基于掩蔽特性的噪声环境下语音识别新特征[J].声学学报,2001,26(6):516-520. 被引量:10
  • 6陶智,赵鹤鸣,龚呈卉.基于听觉掩蔽效应和Bark子波变换的语音增强[J].声学学报,2005,30(4):367-372. 被引量:39
  • 7Koehler J, Morgan N, Hermansky H, Gunter-Hirsh H, Tong G. Integrating RASTA-PLP into speech recognition. In: Proceedings ICASSP, 1994; 1:421-424.
  • 8Gales M J F. Model-based techniques for noise robust speech recognition. Ph.D. dissertation, Cambridge Univ., Cambridge, U.K., 1995.
  • 9Liu F, Moreno P J, Stern R M, Acero A. Signal processing for robust speech recognition. In: Proceedings ARPA Workshop on Human Language Technology, 1994: 309- 314.
  • 10Gales M, Young S. A fast and flexible implementation of parallel model combination. In: Proc. IGASSP'95, 1995; 1:133-136.

二级参考文献41

  • 1Xu Jinfu,Electron Lett,2000年,36卷,14期,1247页
  • 2Bu L,IEEE Trans SAP,2000年,8卷,2期,105页
  • 3沈--,电声技术,1997年,10卷,2页
  • 4Pan Davis,IEEE Trans Multi Media,1995年,2卷,2期,60页
  • 5Ephraim Y, Malah D. Speech enhancement using a minimum mean-square error log-spectral amplitude estimator.IEEE Trans Acoust Speech Signal Processing. 1985; 33(2):443-445.
  • 6Lockwood P, Boudy J. Experiments with a nonlinear spectral subtractor (NSS), hidden Markov models and projection,for robust recognition in cars. Speech Commun, 1992;11(6): 21,5-228.
  • 7Tsoukalas D E, Mourjopoulos J N, Kokkinakis G. Speech enhancement based on audible noise suppression.IEEE Transactions on SPEECH and Processing. 1997; 5(6):497-514.
  • 8Seok J W,Bae K S.Speech enhancement with reduction of noise components in the wavelet domain.Copyright 1997 IEEE:1323—1326.
  • 9Johnston J D.Transform coding of audio signal using perceptual noise criteria.IEEE J.Select Areas Commun,1983;6(2):314—323.
  • 10Evangelista G,Cavaliere S.Discrete frequency warped wavelets:theory and applications.IEEE Trans.on Signal Processing,1998;46(4):874—885.

共引文献57

同被引文献61

引证文献5

二级引证文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部