期刊文献+

改进的基于小波变换的语音活动检测算法 被引量:2

Improved Voice Activity Detector Based on the Wavelet Transform
下载PDF
导出
摘要 提出了一种改进的基于小波变换的语音活动检测算法。这种新算法能够在不同的时间和尺度上计算用于语音活动检测的参数 ,根据这些参数得到稳健的语音活动决策。实验表明 ,新算法与ITUG .72 9附录B相比 ,能够更准确地检测到语音活动 ,语音活动剪切率大为减少 ;同时新算法对于不同类型的背景噪声 ,即使全局信噪比在 10dB以下也具有较好的性能。 Owing to the flexibility of time_frequency resolution of wavelet transform,we present an improved voice activity detector (VAD) based on wavelet transform.Robust parameters in different scale and time resolution are computed for VAD decision,such as silence measure,stability measure of amplitude spectrum between adjacent frames,background noise measure of different frequency band,time_domain stability measure of scale 1. The silence measure is used to detect the existence of silence in the input frame.The stability measure of amplitude spectrum between adjacent frames is adopted to give a rough decision of the detection of stable noise based on the assumption that background noise is stable.If current input frame is noise,the energy of every frequency band is below the average of background noise energy threshold over long time.We divide the signal bandwidth into several scales by wavelet transform,and calculate the background noise measure of different scale.In low scale the input signal changes rapidly,and the variety of short time energy will be removed with long window.We calculate the mean square error of short time energy,and get time_domain stability measure from detail coefficients of scale 1.With these measures,we make the VAD decision. Compared with G.729 Annex B,the authors can detect the voice activity more accurately and reduce the ratio of speech clipping using the new algorithm.And the improved algorithm can achieve robust performance for different background noise,even in serious low signal_to_noise environment about 10dB.
出处 《武汉大学学报(信息科学版)》 EI CSCD 北大核心 2002年第1期85-88,共4页 Geomatics and Information Science of Wuhan University
基金 国家自然科学基金资助项目 ( 4 97710 6 4) 国家测绘局测绘科技发展基金资助项目 ( 970 0 9)
关键词 小波变换 语音活动检测 算法 语音编码 voice activity detection wavelet transform speech coder
  • 相关文献

参考文献5

  • 1[1]Freeman,Cosier G.The Voice Activity Detector for Pan-European Digital Mobile Telephone Service.Proceedings of ICASSP.Glasgow,1989
  • 2[2]Srinivasan K,Gersho A.Voice Activity Dectection for Cellular Networks.In:Proceedings of IEEE Workshop on Speech Coding for Telecommunications,1993
  • 3[3]ITU-T Recommendation G.729 Annex B.A Silence Compression Scheme for G.729 Optimized for Terminals Conforming to Recommendation V.70,1996
  • 4[4]ITU-T Recommendation G.723.1 Annex A.Silence Compression Scheme,1996
  • 5[5]Stegmann J,Schroder G.Robust Voice-activity Detection Based on the Wavelet Transform.In:Proceedings of IEEE Workshop on Speech Coding for Telecommunications,1997

同被引文献9

引证文献2

二级引证文献48

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部