期刊文献+

基于动态单边自相关序列和频率规整线性预测的抗噪声语音识别 被引量:5

Robust speech recognition based on dynamic one-sided autocorrelation sequence and frequency warped linear predictive coding
原文传递
导出
摘要 提出了一种既符合人耳听觉特性又具有良好抗噪性的语音特征分析方法。首先将单边自相关函数序列进行时间方向的平滑处理,提高单边自相关函数的抗噪性,然后用平滑后的单边自相关函数序列代替原信号进行频率规整的LPC分析,最后经倒谱变换得到该特征参数。数字语音识别实验证明:利用该特征参数的语音识别系统的识别性能优于MEL倒谱系数、LPC倒谱系数等传统的语音特征参数。 A representation of speech that invariant to noise is introduced. The idea is to filter the temporal trajectories of short time One-Sided Autocorrelation Sequence (OSAS) of speech such that the noise effect is removed. The filtered sequences are denoted as Dynamic Autocorrelation Sequences (DAS). Then frequency warped LPC (WLPC) algorithm is applied to the DAS instead of the original speech. This speech feature set, which not only corresponds to the performance of human auditory property, but also improves the noise robustness of speech recognition, is denoted as DAS-WLPCC. Chinese digit recognition experiment based on continuous density HMM shows the effectiveness of DAS-WLPCC features in presence of white noise and color noise.
出处 《声学学报》 EI CSCD 北大核心 2004年第2期182-186,共5页 Acta Acustica
基金 国家自然科学基金(69871009和60272044)
关键词 动态单边自相关序列 频率规整线性预测 抗噪声语音识别 语音特征分析 自相关函数 倒谱变换 语音识别系统 Acoustic noise Audition Robustness (control systems) Speech analysis Speech coding
  • 相关文献

参考文献10

  • 1Ivandro Sanches. Noise-compensated hidden Markov models. IEEE Trans on Speech and Audio Processing, 2000;8(5): 533-540.
  • 2Hwang T H, Lee L M, Wang H C. Cepstral behavior due to additive noise and a compensation scheme for noisy speech recognition. IEEProc of Vis Image Signal Process, 1998;145(5): 316-321.
  • 3Mansour D, Juang B H. The short-time modified coherence representation and its application for noisy speech recognition. IEEE Trans Acoust , Speech, Signal Processing,1980; 28(4): 357-366.
  • 4Javier Hernando, Climent Nadeu. Linear prediction of the one-sided autocorrelation sequence for noisy speech recognition. IEEE Transactions on Speech and Audio Processing, 1997; 5(1): 80-84.
  • 5Davis S B, Mermelstein P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentence. IEEE Trans Acoust , Speech,Signal Processing, 1989; 37(6): 795-804.
  • 6Yoon Kim, Smith J O. A speech feature based on bark frequency warping-the non-uniform linear prediction cepstrum. Proc of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New York, 1999(10):17-20.
  • 7Rabiner L. Fundamentals of speech recognition. Prentice Hall, 1993.
  • 8Smith J O, Abel J S. Bark and ERB bilineax transform. IEEE Trans on Speech and Audio Processing, 1999; 7(6):697-708.
  • 9Aki Harma, Laine U K. A comparison of warped and conventional linear predictive coding. IEEE Trans on Speech and Audio Processing, 2001; 9(5): 579-588.
  • 10杨行竣 迟惠生.语音信号数字处理[M].北京:电子工业出版社,1999..

同被引文献57

引证文献5

二级引证文献25

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部