期刊文献+

一种基于噪声估计的语音激活检测算法 被引量:1

An algorithm of voice activity detection based on noise estimation
下载PDF
导出
摘要 针对当前语音激活检测算法在低信噪比和复杂噪声模型的环境下性能损失的问题,提出了一种基于噪声估计的语音激活检测算法,通过对背景噪声进行自适应估计,得到准确的信噪比门限,同时利用估计背景噪声对短时谱进行白化处理,从而使得谱熵判决准则得以适用于复杂噪声模型的环境。实验证明,算法在低信噪比和复杂噪声模型下性能优于G.729B和AMR中的语音激活检测算法。 In the condition of low signal to noise ratio(SNR) and complex noise model,the performance of the voice activity detection(VAD) algorithms always becomes poor.This paper presents a new VAD algorithm based on noise estimation.Through the adaptive noise estimation,the accurate SNR threshold is decided.Then,after applying whitening filter to short time spectra the spectral entropy can be used to VAD algorithm in complex noise model.The experimental results show that the proposed algorithm gets better performance than G.729B and AMR VAD algorithms in the condition of low SNR and complex noise model.
出处 《信息技术》 2011年第10期5-8,共4页 Information Technology
基金 国家自然科学基金项目(60572081)
关键词 语音激活检测 噪声估计 信噪比 谱熵 白化滤波 voice activity detection noise estimation signal to noise ratio spectral entropy whitening filter
  • 相关文献

参考文献8

  • 1Rabiner L R, Sambur M R. An algorithm for determining the end- points of isolated utterances[J]. Bell Syst. Teeh, 1975, 54:297 - 315.
  • 2Renevey P, Drygajlo A. Entropy based voice activity detection in very noise conditions[J]//Proc. Eurospeech,2001:1887 - 1890.
  • 3ITU-T Recommendation G. 729 Annex B. A silence compression scheme for G. 729 optimized for teminals conforming to Recommen- dation V.70[Z]. 1996.
  • 4ETSI EN 301 708. Digital cellular telecommunications systems (Phase 2 + ) ; Voice Activity Detector (VAD) for Adaptive Multi - Rate (AMR) speech traffic channels; General description (GSM06.94 version 7.1.1 Release 1998). V 7.1.1. 1999[Z].
  • 5朱晓晶,侯旭初,崔慧娟,唐昆.基于LPCC和能量熵的端点检测[J].电讯技术,2010,50(6):41-45. 被引量:6
  • 6Martin R. Noise power spectral density estimation based on optimal smoothing and mininum statistics[J]. IEEE Trans. Speech Audio Processing, 2001,9:504 -512.
  • 7Rangaehafi S, Loizou P C. A noise - estimation algorithm for highly non - stationary enviroments[J]. Speech Communication, 2006,48 (2) :220 -231.
  • 8Renevey P, Drygajlo A. Entropy based voice activity detection in very noise conditions[J]//Proc. Eurospeech,2001 : 1887 - 1890.

二级参考文献9

  • 1李晔,张仁智,崔慧娟,唐昆.低信噪比下基于谱熵的语音端点检测算法[J].清华大学学报(自然科学版),2005,45(10):1397-1400. 被引量:37
  • 2Junqua J C,Mak B,Reaves B.A robust algorithm for word boundary detection in the presence of noise[J].IEEE Transactions on Speech and Audio Processing,1994,2(3):406-412.
  • 3Beritelli F,Casale S,Ruggeri G,et al.Performances evaluation and comparision of G.729/AMR/fuzzy voice activity detectors[J].IEEE Signal Processing Letters,2002,9(3):85-88.
  • 4Pencak J,Neloson D.The NP speech activity detection algorithm[C]//Proceedings of 1995 International Conference on Acoustics,Speech and Signal Processing.Detroit,MI,USA:[s.n.],1995:381-384.
  • 5Reynolds D,Rose R.Robust text-independent speaker identification using Gaussian mixture speaker models[J].IEEE Transactions on Speech and Audio Processing,1995,3(1):72-83.
  • 6Reynolds D A,Quatieri T F,Dunn R B.Speaker Verification Using Adapted Gaussian Mixture Models[J].Digital Signal Processing,2000,10(1):19-41.
  • 7Dempster A D,Laird N M,Rubin D B.Maximum likelihood from incomplete data via the EM algorithm[J].Journal of the Royal Statistical Society,1977,39(2):1-37.
  • 8Gish H,Schmid M.Text-Independent Speaker Identification[J].IEEE Signal Processing Magazine,1994,11(4):18-32.
  • 9徐大为,吴边,赵建伟,刘重庆.一种噪声环境下的实时语音端点检测算法[J].计算机工程与应用,2003,39(1):115-117. 被引量:30

共引文献5

同被引文献16

  • 1SOUDEN M,BENESTY J,AFFES S. Broadband source localization from an eigenanalysis perspective[J].IEEE Transactions on Audio Speech and Language Processing,2010,(06):1575-1587.
  • 2GEDALYAHU K,ELDAR Y C. Time-delay estimation from low-rate samples:a union of subspaces approach[J].{H}IEEE Transactions on Signal Processing,2010,(06):3017-3031.
  • 3SO H C,CHAN Y T,CHAN F K W. Closed-form formulae for time-difference-of-arrival estimation[J].{H}IEEE Transactions on Signal Processing,2008,(06):2614-2620.
  • 4LUI K,CHAN F,SO H C. Semidefinite programming approach for range-difference based source localization[J].{H}IEEE Transactions on Signal Processing,2009,(04):1630-1633.
  • 5KNAPP C,CARTER G. The generalized correlation method for esti-mation of time delay[J].{H}IEEE Transactions on Acoustics Speech and Signal Processing,1976,(04):320-327.
  • 6CHAMPAGNE B,BéDARD S,STéPHENNE A. Performance of time-delay estimation in the presence of room reverberation[J].{H}IEEE Transactions on Speech and Audio Processing,1996,(02):148-152.
  • 7DVORKIND T G,GANNOT S. Approaches for time difference of arrival estimation in a noisy and reverberant environment[A].Kyoto,Japan,2003.215-218.
  • 8CORNELIS B,DOCLO S,VAN DAN BOGAERT T. Theoretical analysis of binaural multimicrophone noise reduction techniques[J].IEEE Transactions on Audio Speech and Language Processing,2010,(02):342-355.
  • 9SALVATI D,CANAZZA S. Adaptive time delay estimation using filter length constraints for source localization in reverberant acoustic environments[J].Signal Processing Letters,2013,(05):507-510.
  • 10EPHRAIM Y,MALAH D. Speech enhancement using a mini-mum-mean square error short-time spectral amplitude estimator[J].{H}IEEE Transactions on Acoustics Speech and Signal Processing,1984,(06):1109-1121.

引证文献1

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部