期刊文献+

Speech enhancement through voice activity detection using speech absence probability based on Teager energy 被引量:2

Speech enhancement through voice activity detection using speech absence probability based on Teager energy
下载PDF
导出
摘要 In this work, a novel voice activity detection (VAD) algorithm that uses speech absence probability (SAP) based on Teager energy (TE) was proposed for speech enhancement. The proposed method employs local SAP (LSAP) based on the TE of noisy speech as a feature parameter for voice activity detection (VAD) in each frequency subband, rather than conventional LSAP. Results show that the TE operator can enhance the ability to discriminate speech and noise and further suppress noise components. Therefore, TE-based LSAP provides a better representation of LSAP, resulting in improved VAD for estimating noise power in a speech enhancement algorithm. In addition, the presented method utilizes TE-based global SAP (GSAP) derived in each frame as the weighting parameter for modifying the adopted TE operator and improving its performance. The proposed algorithm was evaluated by objective and subjective quality tests under various environments, and was shown to produce better results than the conventional method. In this work, a novel voice activity detection (VAD) algorithm that uses speech absence probability (SAP) based on Teager energy (TE) was proposed for speech enhancement. The proposed method employs local SAP (LSAP) based on the TE of noisy speech as a feature parameter for voice activity detection (VAD) in each frequency subband, rather than conventional LSAP. Results show that the TE operator can enhance the abiTity to discriminate speech and noise and further suppress noise components. Therefore, TE-based LSAP provides a better representation of LSAP, resulting in improved VAD for estimating noise power in a speech enhancement algorithm. In addition, the presented method utilizes TE-based global SAP (GSAP) derived in each frame as the weighting parameter for modifying the adopted TE operator and improving its performance. The proposed algorithm was evaluated by objective and subjective quality tests under various environments, and was shown to produce better results than the conventional method.
出处 《Journal of Central South University》 SCIE EI CAS 2013年第2期424-432,共9页 中南大学学报(英文版)
基金 Project supported by Inha University Research Grant Project(10031764) supported by the Strategic Technology Development Program of Ministry of Knowledge Economy, Korea
关键词 语音增强算法 检测 概率 能量 抑制噪声 VAD SAP 子带语音 speech enhancement Teager energy speech absence probability voice activity detection
  • 相关文献

参考文献1

二级参考文献9

  • 1COHEN I. Noise estimation by minima controlled recursive averaging for robust speech enhancement [J]. IEEE Signal Processing Letter, 2002, 9(1): 12-15.
  • 2LEE Young-woo, LEE Sang-min, JI Yoon-sang, LEE Jong-shill, CHEE Young-joon, HONG Sung-hwa, KIM S I, KIM In-young. An efficient speech enhancement algorithm for digital hearing aids based on modified spectral subtraction and companding [J]. IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, 2007, E90-A(8): 1628-1635.
  • 3SHIN Jong-Won, KIM Nam-Soo. Perceptual reinforcement of speech signal based on partial specific loudness [J]. IEEE Signal Processing Letters, 2007, 14(11): 887-890.
  • 4MOORE B C J, GLASBERG B R, BAER T. A model for the prediction of thresholds, loudness, and partial loudness [J]. J Audio Eng Soc, 1997, 45(4): 224-240.
  • 5QUACKENBUSH S R. Objective Measures of Speech Quality [M]. Prentice-Hall, N J, 1988.
  • 6Beerends, John G., Hekstra, Andries P., R.ix, Antony W., Hollier, Michael P. Perceptual evaluation of speech quality (PESQ) the new itu standard for end-to-end speech quality assessment. Part Ⅱ: Psychoacoustic model [J]. J Audio Eng Society, 2002, 50(10): 765-778.
  • 7MARTIN R. Spectral subtraction based on minimum statistics [C]// Proc 7th European Signal Processing Conf. EUSIPCO-94. Edinburgh, Scotland, 13-16, 1994:1182-1185.
  • 8ISO 226 2003 Acoustics-Normal equal-loudness-level contours, International Organization for Standardization (ISO) [S]. 2nd ed. Geneva: ISO, 2003.
  • 9JEON Yu-yong, LEE Sang-min. A speech enhancement algorithm based on human psychoacoustic property [J]. Transactions of The Korean Institute of Electrical Engineers (KIEE), 2010, 59(6): 1120-1125.

共引文献3

同被引文献30

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部