期刊文献+

基于复高斯混合模型的鲁棒VAD算法 被引量:2

Robust Voice Activity Detection Algorithm Based on Complex Gaussian Mixture Model
下载PDF
导出
摘要 针对语音激活检测的鲁棒性问题,提出在非平稳噪声环境下使用基于复高斯混合模型的鲁棒语音激活检测算法.算法中假设纯净语音谱满足复高斯混合模型,先验信噪比利用预先训练好的复高斯混合模型计算得到.复高斯混合模型的引入一方面提高了语音激活检测的性能,另一方面避免了使用基于最小均方误差语音增强的先验信噪比估计过程.实验中使用NOISEX-92噪声库来验证系统在噪声环境下的性能.结果表明,该种算法在非平稳噪声环境下具有良好的检测性能. In order to improve the robustness of voice activity detection (VAD),the use of an algorithm based on complex Gaussian mixture model under nonstationary noisy environments was presented. In the algorithm,the clean speech distribution was modelled by complex Gaussian mixture model, and the a priori SNR was estimated based on the pre-trained complex Gaussian mixture model. The introduction of complex Gaussian mixture model not only improved the performance of voice activity detection,but also avoided the estimation of a priori SNR using minimum mean square error short spectral amplitude estimator. The system performance under noisy environments was evaluated using NOISEX-92 database. Experimental results show that the algorithm can work more robustly under nonstationary noisy environments.
出处 《天津大学学报》 EI CAS CSCD 北大核心 2009年第4期353-356,共4页 Journal of Tianjin University(Science and Technology)
基金 国家自然科学基金资助项目(60475007) 国家"863"高技术研究发展计划资助项目(2006A010102)
关键词 复高斯混合模型 语音激活检测 似然比测试 complex Gaussian mixture model voice activity detection (VAD) likelihood ratio test
  • 相关文献

参考文献15

  • 1ITU. A Silence Compression Scheme for G.729 Optimized for Terminals Conforming to Recommendation V.70 [S]. ITU-T Recommendation G.729-Annex B, 1996.
  • 2Woo K,Yang T ,Park K ,et al. Robust voice activity detection algorithm for estimating noise spectrum[J]. Electronics Letters,2000,36 (2) : 180-181,
  • 3Junqua J C,Reavers B,Mark B. A study of endpoint detection algorithms in adverse conditions:Incidence on a DTW and HMM recognize[C]//Proceedings of Eurospeech. Genova,Italy, 1991 : 1371-1374.
  • 4Shen Jialin,Hung J W,Lee Linshan. Robust entropy- based endpoint detection for speech recognition in noisy environments [C]// Proceedings of ICSLP. Sydney , Australia, 1998:232-235.
  • 5Jia Chuan,Xu Bo. An improved entropy-based endpoint detection algorithm [C]// Proceedings of ISCSLP. Taiwan, China, 2002 :96-99.
  • 6Rabiner L R, Sambur M R. Voiced-unvoiced-silence detection using the Itakura LPC distance measure [C]// Proceedings of lCASSP. 1977:323-326.
  • 7Haigb J A,Mason J S. Robust voice activity detection using cepstral feature [ C ] // Proceedings of IEEE TELCON. China, 1993:321-324.
  • 8Elias N,Rafik G,Samy M. Robust voice activity detection using higher-order statistics in the LPC residual domain[J]. IEEE Trans on Speech and Audio Processing,2001,9 (3) :217-231.
  • 9Shin W H,Lee B S,Lee Y H,et al. Speech/non-speech classification using multiple features for robust endpoint detection [C] // Proceedings oflCASSP. Istanbul, Turkey, 2000 : 1399-1402.
  • 10Kida Y,Kawahara T. Voice activity detection based on optimal weighted combination of multiple features[C]// Proceedings of lnterspeech. Lisbon,Portugal,2005:2621- 2624.

同被引文献20

  • 1余鹏,封举富,童行伟.一种新的基于高斯混合模型的纹理图像分割方法[J].武汉大学学报(信息科学版),2005,30(6):514-517. 被引量:6
  • 2张仁志,崔慧娟.基于短时能量的语音端点检测算法研究[J].电声技术,2005,29(7):52-54. 被引量:45
  • 3Stauffer C, Grimson W E L. Adaptive Background Mixture Models for Real - time Tracking [ C ]. ICCV, 1999 : 246 - 252.
  • 4Besag J. On the Statistical Analysis of Dirty Pictures[ J]. Journal of the Royal Statistical Society, Series B ( Methodological ), 1986,48 (3) :259 -302.
  • 5Dempster A P, Laird N M, Rubin D B. Maximum Likelihood for In- complete Data Via the EM Algorithm [ J ]. Journal of the Royal Sta- tistical Society, Serious B ( Methodological ), 1977,39 : 1 - 38.
  • 6Mclanchlan G J. The EM Algorithm and Extension [ M ]. New York : Wily & Sons, 1997.
  • 7Sanjay G S, Thomas J H. Bayesian Pixel Classification Using Spa- tially Variant Finite Mixtures and the Generalized EM Algorithm [ J]. IEEE Transactions on Image Processing, 1998,7 (7) : 1014 - 1028.
  • 8Hammersley J M, Cliford P. Markov Field on Finite Graphs and Lattices [ Z ]. Unpublished manuscript, 1971.
  • 9Peter J M L, Emile H L A. Simulated Annealing:Theory and Appli- cations [ M ] Dordrecht, Holland : ReidelPub, 1987.
  • 10Geman S, Gemini D. Stochastic Relaxation, Gibes Distributions, and the Bayesian Restoration of Images [ J ]. IEEE Transactions on Pat- tern Analysis and Machine Intelligence, 1984,6 (6) :721 - 741.

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部