期刊文献+

基于噪声倒谱阈值频谱估计的语音活动检测 被引量:3

Voice Activity Detection Based on Noise Cepstrum Thresholding Spectral Estimation
下载PDF
导出
摘要 针对低方差频谱估计的语音活动检测(VAD)中Welch频谱估计方法计算量大的问题,提出利用倒谱阈值方法估计VAD中的噪声功率谱。该方法在静音时期为噪声的倒谱设置阈值,利用快速傅里叶变换计算频谱,再更新VAD中的判决阈值。算法复杂度分析与仿真结果表明,该方法的检测性能与Welch方法相当,计算量降低约18%,同时降低整个VAD的时间复杂度。 Aiming at high computational complexity in Voice Activity Detection(VAD) which uses low variance spectral estimation, this paper proposes a method using noise cepstrum thresholding spectral estimation to compute noise Power Spectral Density(PSD). In speech absent period, a threshold is set for smoothing the noise cepstrum, then noise PSD is estimated by using Fast Fourier Transform(FFT), and detecting threshold in VAD system is updated. Computational complexity analysis and simulation results indicate that compared with Welch method, the computation of the method is reduced by 18%, and total time complexity of VAD is decreased.
出处 《计算机工程》 CAS CSCD 北大核心 2011年第14期140-142,共3页 Computer Engineering
基金 国家自然科学基金资助项目(60874060)
关键词 语音活动检测 频谱估计 倒谱阈值方法 功率谱密度 快速傅里叶变换 Voice Activity Detection(VAD) spectral estimation cepstrum thresholding method Power Spectral Density(PSD) Fast FourierTransform(FFT)
  • 相关文献

参考文献2

二级参考文献16

  • 1Sohn J, Kim N S, Sung Wonyong. A Statistical Model-based Voice Activity Detection[J]. IEEE Signal Processing Letters, 1999, 6(1): 1-3.
  • 2Cho Yongdtlk, Kondoz A. Analysis and Improvement of a Statistical Model-based Voice Activity Detector[J]. IEEE Signal Processing Letters, 2001, 8(10): 276-278.
  • 3Gazor S, Zhang Wei. Speech Enhancement Employing Laplacian- Gaussian Mixture[J]. IEEE Transactions on Speech and Audio Processing, 2005, 13(5): 896-904.
  • 4Chang Joon-Hyuk, Kim N S, Mitra S K. Voice Activity Detection Based on Multiple Statistical Models[J]. IEEE Transactions on Signal Processing, 2006, 54(6): 1965-1976.
  • 5Gazor S, Zhang Wei. Speech Probability Distribution[J]. IEEE Signal Processing Letters, 2003, 10(7): 204-207.
  • 6ITU-T Recommendation G. 729,Annex B. , 1996.
  • 7F. Beritelli, S. Casale, and A. Cavallaro, "A robust vouce activity detector for wireless communications using soft omputing", IEEE J. Select. Areas Commun. , vol. 16, no. 9, pp. 1818-1829, Dec. 1998.
  • 8J. Sohn, N. S. Kim, and W. Sung, "A statistical modelbased voice activity detection" ,IEEE Signal Process. Lett., vol. 6,no. 1 ,pp. 1-3 ,Jan. 1999.
  • 9S. Gazor, W. Zhang,"A soft voice activity detector based on a Laplacian-Gasussian model", IEEE Trans. ASSP, vol. 11, no. 5, pp. 498-505, Sep. 2003.
  • 10J. -H. Chang, J. W. Shin, and N. S. Kim, "Likelihood ratio test with complex Laplacian model for voice activity detection", in Proc. Enrospeech, Geneva, Switzerland, pp. 1065- 1068, Aug. 2003.

共引文献6

同被引文献16

  • 1Huang H J, Su S L, Chen J H. Design and performance analysis for data transmission in GSM/GPRS system with voice activity detection[J]. IEEE Trans. Veh. Technology, 2002, 51(4): 648-656.
  • 2Solm J, KJm N S, Sung W. A statistical model-based voice activity detection[j]. IEEE Signal Processing Letters, 1999, 6(1): 1-3.
  • 3Cho Y D, Kondoz A. Analysis and improvement of a statistical model-based voice activity detector[J]. IEEE Signal Processing Letters, 2003,8(10) : 276-278.
  • 4Ramirez J, Segura J C, Benitez C, et al. A statistical voice activity detection using a multiple observation likelihood ratio test[J]. IEEE Signal Processing Letters, 12(10): 689-692.
  • 5Gazor S, Zhang W. A sott voice activity detector based on a laplacian-gasussian model[J]. IEEE Trans. on Audio, Speech, and Language Processing, 2003, 11 (5): 498-505.
  • 6Tahrnasbi R, Rezaei R, A sott voice activity detection using GARCH filter and variance gamma distribution[J]. IEEE Trans. on Audio, Speech, and Language Processing,2007,15(4): 1129-1134.
  • 7ITU-T Recommendation G.729, Annex B.[R], 1996.
  • 8Davis A, Nordholm S, Togneri 1L Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold[J]. IEEE Trans. on Audio, Speech, and Language Processing, 2006, 14(2): 412-424.
  • 9Stoica P, Sandgrcn N. Total-variance reduction via thtesholding: application to cepstral analysis[J]. IEEE Transactions on Signal Processing, 2007, 55(1): 66-72.
  • 10刘思伟,吕海波,慕德俊.基于G.729的自适应实时语音活动检测方法研究[J].计算机工程与应用,2007,43(34):57-60. 被引量:4

引证文献3

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部