期刊文献+

噪声鲁棒性说话人识别语音高频加权MFCC提取 被引量:15

High frequency weighted MFCC extraction for noise robust speaker verification
下载PDF
导出
摘要 本文提出了一种可提高噪声环境下的说话人确认识别率的语音MFCC参数高频加权方法。由于Mel频率与线性频率成对数关系,频谱能量在高频部分分辨率逐减,而语音经过基音同步可变窗长加窗后的语音会在一定程度上避免语音信号的谐波泄露,从而保留更多高次谐波信息。将语音频谱能量高频部分进行加权,则可使语音增强,提高语音鲁棒性。该方法被用于基音同步预处理MFCC参数提取中,并进行了说话人确认实验。实验结果表明,即使在信噪比较低的情况下,该方法都会在一定程度上提高多种噪声环境下的说话人确认识别率。 This paper proposes a high frequency weighted MFCC extraction method to improve the performance of speaker verification in noise conditions. As the Mel frequency has a logarithmic relationship with linear frequency, spectral resolution in high frequency domain would decline. Frames of purely periodic speech signal can avoid harmonic leakage, and more high frequency information would be reserved. To get speech enhancement, high frequency energy amplitude weighted method is proposed. This method was applied in pitch synchronous preproeessing MFCC feature extraction, and speaker verification experiments were conducted. The results show that the recognition rates are improved in several kinds of noise environments even when the SNR is low.
出处 《仪器仪表学报》 EI CAS CSCD 北大核心 2008年第3期668-672,共5页 Chinese Journal of Scientific Instrument
关键词 高频加权 说话人确认 基音同步 鲁棒性 MFCC high frequency weighted speaker verification pitch synchronous robust MFCC
  • 相关文献

参考文献8

  • 1GALES M F J. Predictive model-based compensation schemes for robust speech recognition [ J ]. Speech Communication, 1998, 25 ( 1-3 ) :49-74.
  • 2WEINSTEIN E, OPPENHEIM A V, FEDER M, et al. Iterativeand sequential algorithms for multisensor signal enhancement [ C ]. IEEE Trans. on Signal Processing, 1994,42(4) : 846-859.
  • 3XU T, CAO Z G. Combination of feature weight and speech enhancement for robust ASR at low SNRs [ C ]. Proceedings of IEEE TENCON'02, 2002: 441-444.
  • 4DAVIES S B, MERMELSTEIN P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences [ C ]. IEEE Trans. Acoustics, Speech and Signal Processing, 1980, ASSP-28 (4) : 375-366.
  • 5KIM S, ERIKSSON T. A pitch synchronous feature extraction method for speaker recognition [ C ]. IEEE, Acoustics, Speech and Signal Processing Proceedings, 2004, 1 : 405-408.
  • 6易克初 田斌 付强.语音信号处理[M].北京:国防工业出版社,2003..
  • 7YANG L P, GONG W G. Multi-SNR GMMs-based noiseRobust speaker verification using 1/f "Noises" [ C ]. IEEE, The 18th International Conference on Patter Recognition, 2006, 4: 241-244.
  • 8鲍长春,樊昌信.基于归一化互相关函数的基音检测算法[J].通信学报,1998,19(10):27-31. 被引量:42

共引文献51

同被引文献113

引证文献15

二级引证文献116

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部