期刊文献+

噪声谱估计算法对语音可懂度的影响 被引量:3

Effects of noise spectrum estimation algorithms on speech intelligibility
下载PDF
导出
摘要 噪声谱估计是单通道语音增强算法的关键步骤,当前大部分语音增强算法旨在提高语音质量,提高语音可懂度的算法却很少。在传统的单通道语音增强算法中,语音质量的提高往往是以牺牲语音的可懂度为代价的。对目前主流的几种噪声谱估计算法对语音可懂度影响进行分析。在不同噪声背景、不同信噪比情况下进行噪声谱估计,并采用谱减法对含噪语音信号作去噪处理,对比分析不同噪声、不同信噪比下增强前后语音的短时客观可懂度(Short-Time Objective Intelligibility,STOI)值,最后根据信噪比,对比分析了不同噪声环境下,语音增强前后语音能量高于噪声能量的时频块所占比例。实验表明,相比其他噪声估计算法,最小统计(Minima Statistics,MS)算法由于保留了更多的以语音能量为主的时频块,使得去噪后的语音有较高的可懂度。 Noise spectrum estimation is a key step in single channel speech enhancement algorithms. Most of current speech enhancement algorithms are designed to improve speech quality, however, algorithms for increasing speech intelligibility are few. The traditional speech enhancement algorithms improve speech quality, while sacrificing speech intelligibility. In this paper, classical noise spectrum estimation algorithms are evaluated for their effects on speech intelligibility. Noise spectrum is estimated in different noise environments with SNRs between ?9 d B and 3 d B. The spectral subtraction is thereafter used for speech denoising. The STOI(Short-Time Objective Intelligibility) value of the enhanced speech is computed. At last, according to the signal-to-noise ratio, the proportions of speech dominated time-frequency blocks under different noise environments are analyzed. Experimental results show that, compared with other noise estimation algorithms, the minimum statistics(MS) obtains high speech intelligibility because it retains more speech dominated time-frequency blocks after speech denoising.
出处 《声学技术》 CSCD 北大核心 2015年第5期424-430,共7页 Technical Acoustics
基金 国家自然科学基金(61301219 61003131) 安徽省自然科学基金(1408085MF113)资助项目
关键词 噪声谱估计 谱减法 时频块 最小统计 短时客观可懂度 语音可懂度 noise spectrum estimation spectrum subtraction time-frequency blocks Minima Statistics(MS) Short-Time Objective Intelligibility(STOI) speech intelligibility
  • 相关文献

参考文献27

  • 1Yuan W, Lin J, An W, et al. Noise estimation based on time-frequency correlation for speech enhancement[J]. Applied Acoustics, 2013, 74(5): 770-781.
  • 2Lu Ching-Ta. Noise reduction using three-step gain factor and iterative-directional-median filter[J]. Applied Acoustics, 2014, 76(1): 249-261.
  • 3Ming Ji. Crookes, Danny. An iterative longest matching segment approach to speech enhancement with additive noise and channel distortion[J]. Computer Speech and Language, 2014, 28(6): 1269-1286.
  • 4Lim J. Evaluation of a correlation subtraction method for enhanc- ing speech degraded by additive noise[J]. IEEE Transactions on Acoustics, Speech and Sinai Processing, 1978, 37(6): 471-472.
  • 5Hu Y, Loizou P. A comparative intelligibility study of sin-gle-microphone noise reduction algorithms[J]. J. Acoust. Soc. Am., 2007, 122(3): 1777-1786.
  • 6Loizou P, Kim G. Reasons why current speech-enhancement algo- rithms do not improve speech intelligibility and suggested solu- tions[J]. IEEE Transactions on Audio, Speech, and Language Processing, 2011, 19(1): 47-56.
  • 7McAulay R, Malpass M. Speech enhancement using a soft-decision noise suppression filter[J]. IEEE Transactions on Acoustics, Speech and Signal Processing, 1980, 28(2): 137-145.
  • 8McKinley B, Whipple G. Model based speech pause detection[C]// Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on. 1997, 2: 1179-1182.
  • 9Meyer J, Simmer K, Kammeyer K. Comparison of one and two-channel noise-estimation techniques[C]// Proc. 5th Interna- tional Workshop on Acoustics Echo and Noise Control, IEAENC-97. 1997, 137-145.
  • 10Solm J, Kim N, Sung W. A statistical model-based voice activity detection[J]. Signal Processing Letters, IEEE, 1999, 6(1): 1-3.

同被引文献19

引证文献3

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部