期刊文献+

基于帧间相关性的语音活动检测方法

Voice activity detection method based on inter-frame correlation
下载PDF
导出
摘要 为了提高统计模型似然比测试的语音活动检测(VAD)的检测性能,利用前后语音帧间存在的统计相关特性,提出一种改进VAD算法。通过前帧语音频谱分量对先验信噪比进行递归估计,然后利用前一帧的语音检测状态来设计判决阈值,建立了双阈值隐马尔可夫模型语音活动判决规则。实验表明,此帧间相关性VAD算法的检测指标值优于Sohn算法。 To enhance the detection performance of statistical model-based Voice Activity Detection(VAD) using likelihood ratio test,an improved VAD was proposed by utilizing the correlation between tandem speech frames.First a priori Signal-to-Noise Ratio(SNR) was estimated using recursive estimation method based on the result of the previous speech frame instead of the traditional decision-directed method.Secondly double thresholds were designed by depending on the previous frame's detention result.Finally a detection rule was presented based on two-state Hidden Markov Model(HMM) coupled with double thresholds.The experimental results show that the inter-frame correlation based VAD scheme gets better performance than the Sohn's VAD.
出处 《计算机应用》 CSCD 北大核心 2011年第5期1447-1449,共3页 journal of Computer Applications
基金 国家自然科学基金资助项目(60874060)
关键词 语音活动检测 统计模型 相关性 似然比测试 先验信噪比 阈值 voice activity detection statistical model correlation likelihood ratio test a priori SNR threshold
  • 相关文献

参考文献11

  • 1李宇,陈建铭,谭洪舟,陈明.基于Rayleigh噪声统计分布的有音区检测[J].信号处理,2009,25(11):1809-1813. 被引量:3
  • 2A silence compressionscheme for G.729 optimized for terminals conforming to ITU-T V.70. ITU-T Recommendation G 729 Annex B . 1996
  • 3CHEN S H,WUHT,CHANG YK.Robust voice activity detectionusing perceptual wavelet-packet transform and teager energy operator. Pattern Recognition . 2007
  • 4NEMER E,GOUBRAN R,MAHMOUD S.Robust voice activitydetection using higher-order statistics in the LPC residual domain. IEEE Transactions on Speech Audio Processing . 2001
  • 5SHIN J W,KWONHJ,JINS H.Voice activity detection based onconditional MAP criterion. IEEE Signal Processing Letters . 2008
  • 6S. Gokhum Tanyer,et al.Voice activity detection in nonstationary noise. IEEE Transactions on Speech and Audio Processing . 2000
  • 7S. Gazor,W. Zhang.A soft voice activity detector based on a Laplacian-Gasussian model. IEEE Transactions on Speech and Audio Processing . 2003
  • 8Cohen I.Relaxed statistical model for speech enhancement and apriori SNR estimation. IEEE Trans.on Speech and Audio Pro-cessing . 2005
  • 9Ephraim Y,Malah D.Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator. IEEE Transactions on Acoustics Speech and Signal Processing . 1984
  • 10Jongseo Sohn,Nam Soo Kim,Wonyong Sung.A statistical model-based voice activity detection. IEEE Signal Processing Letters . 1999

二级参考文献11

  • 1ITU-T Recommendation G. 729,Annex B. , 1996.
  • 2F. Beritelli, S. Casale, and A. Cavallaro, "A robust vouce activity detector for wireless communications using soft omputing", IEEE J. Select. Areas Commun. , vol. 16, no. 9, pp. 1818-1829, Dec. 1998.
  • 3J. Sohn, N. S. Kim, and W. Sung, "A statistical modelbased voice activity detection" ,IEEE Signal Process. Lett., vol. 6,no. 1 ,pp. 1-3 ,Jan. 1999.
  • 4S. Gazor, W. Zhang,"A soft voice activity detector based on a Laplacian-Gasussian model", IEEE Trans. ASSP, vol. 11, no. 5, pp. 498-505, Sep. 2003.
  • 5J. -H. Chang, J. W. Shin, and N. S. Kim, "Likelihood ratio test with complex Laplacian model for voice activity detection", in Proc. Enrospeech, Geneva, Switzerland, pp. 1065- 1068, Aug. 2003.
  • 6J. W. Shin, H. J. Kwon, S. H. Jin, and N. S. Kim, " Voice Activity Detection Based on Conditional MAP Criterion", IEEE Signal Processing Letters, Vol. 15, pp. 257- 260, Feb. 2008.
  • 7J. -H. Chang, N. S. Kim, S. K. Mitra," Voice activity detection based on multiple statistical models", IEEE Trans. SP, vol. 54 ,no. 6 ,pp. 1965-1976 ,Jun. 2006.
  • 8A. Davis, S. Nordholm, and R. Togneri, "Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold", IEEE Trans. on Audio, Speech, and Language Processing, pp. 412-424, Mar. 2006.
  • 9Y. Ephraim, D. Malah. "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator" ,IEEE Trans. ASSP,vol. 32,no. 6,pp. 1109 - 1121, Dec. 1984.
  • 10C. Breithaupt and R. Martin, "Voice activity detection in the DFT domain based on a parametric noise model", Proc. of the Int. Worksh. of Acoustic Echo and Noise Control ( IWAENC ), Paris, 1-4 Sep. 2006.

共引文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部