期刊文献+

基于自适应超高斯混合模型的语音增强算法 被引量:2

Speech Enhancement Algorithm Based on Adapted Super-Gaussian Mixture Model
下载PDF
导出
摘要 语音信号的频谱结构复杂性决定了其短时谱分布不能用单一的概率密度函数(Probability density function,PDF)准确描述。据此,提出了一种采用超高斯混合模型对语音信号幅度谱建模以实现语音增强的新方法。首先,采用超高斯混合模型对语音信号幅度谱的先验分布进行建模,相对于传统的单一模型,该模型能更好地描述语音信号的多类特性;然后,在增强过程中自适应更新混合分量的PDF及其权重,从而克服了传统模型难以跟踪语音信号分布动态变化的缺点。仿真结果表明与传统的短时谱估计算法相比,该算法的噪声抑制性能有较大的提升,增强语音的主观感知质量也有明显改善。 The observation of speech spectral structure shows that the statistics of speech signal cannot be well determined by a simple probability density function (PDF). Therefore, a speech enhancement algorithm is presented based on the super-Gaussian mixture model. Firstly, the super Gaussian mixture model is employed to model the speech spectral amplitude, which is more flexible in capturing the statistical behavior of speech signals than the conventional simple speech model. Where after, PDF and weight of the mixture components are further adapted, which can overcome the disadvantage that the traditional simple speech model cannot well track the dynamic characteristics of the speech signal. The simulation results show that the proposed algorithm achieves better noise suppression and lower speech distortion compared with the con- ventional short-time spectral estimation algorithms.
出处 《数据采集与处理》 CSCD 北大核心 2014年第2期232-237,共6页 Journal of Data Acquisition and Processing
关键词 语音增强 超高斯混合模型 自适应 speech enhancement super-Gaussian mixture model adaptation
  • 相关文献

参考文献15

  • 1Ephraim Y, Malah D. Speech enhancement using a minimum mean-square error short-time spectral am- plitude estimator [J]. IEEE Trans Acoust Speech, Signal Process, 1984,32(6) :1109- 1121.
  • 2Gazor S, Zhang W. Speech probability distribution [J]. IEEE Signal Process Lett, 2003,10(7):2042- 207.
  • 3Martin R. Speech enhancement based on minimum mean-square error estimation and super Gaussian pri- orsEJ~. IEEE Trans Speech Audio Process, 2005,13 (5) :845-856.
  • 4Lotter T, Vary P. Speech enhancement by MAP spectral amplitude estimation using a super-Gaussian speech model[J]. Eurasip J Signal Process, 2005, (7) :1110-1126.
  • 5邹霞,陈亮,张雄伟.基于Gamma语音模型的语音增强算法[J].通信学报,2006,27(10):118-123. 被引量:11
  • 6Hendriks R C. Heusdens R, Jensen J. Log-spectral magnitude MMSE estimators under super-gaussian densities[J]. Inter Speech, 2009,10(6) :1319-1322.
  • 7Ephraim Y. A Bayesian estimation approach for speech enhancement using hidden Markov models [J]. IEEE Trans Acoust Speech, Signal Process, 1992, 40(4) :725-735.
  • 8Ding Guohong, Wang Xia, Cao Yang, et al. Speech enhancement based on speech spectral complex Gaussian mixture model[C]//IEEE Int Conf Acous- tic, Speech, Signal Process (ICASSP). Philadephia, USA: IEEE, 2005 :165-168.
  • 9Erkelens J S, Jensen J, Heusdens R. Speech en- hancement based on Rayleigh mixture modeling of speech spectral amplitude distributions[C]//Europe- an Signal Proc Conf (EUSIPCO). Poznan, Poland.. [s. n. ], 2007:65-69.
  • 10Hao Jiucang, Lee Te-Won. Speech enhancement using Gaussian scale mixture models[J]. IEEE Trans on ASLP, 2010,18(6):1127-1136.

二级参考文献25

  • 1Manolakis D G.Statistical and adaptive signal processing[M].New York:McGraw-Hill,2003:21-26.
  • 2Bahoura M,Rouat J.Wavelet speech enhancement based on the teager energy operator[J].IEEE Signal Processing Letters,2001,8(1):10-12.
  • 3Yao J,Zhang Y T.Bionic wavelet transform:A newtime-frequency method based on an auditory model[J].IEEE Trans on Biomedical Engineering,2001,48(8):856-863.
  • 4Loizou H Y,Philipos C.Speech enhancement based on wavelet thresholding the multitaper spectrum[J].IEEE Trans on Speech and Audio Processing,2004,12(1):59-67.
  • 5Chen S H,Wang J F.Speech enhancement using perceptual wavelet packet decomposition and teager energy operator[J].Journal of VLSI Signal Processing,2004,36(2):125-139.
  • 6Lei S F,Tung Y K.Speech enhancement for nonstationary noises by wavelet packet transform and adaptive noise estimation[C] //International Sym on Intelligent Signal Processing and Comm Systems.Hong Kong,China:[s.n.] ,2005:41-44.
  • 7Donoho D L.De-noising soft-thresholding[J].IEEE Trans on Information Theory,1995,51(3);613-627.
  • 8EPHRAIM Y,MALAH D.Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator[J].IEEE Trans Acoustic,Speech,Signal Processing,1984,32(6):1109-1121.
  • 9EPHRAIM Y,MALAH D.Speech enhancement using a minimum mean-square error log-spectral amplitude estimator[J].IEEE Trans Acoustic,Speech,Signal Processing,1985,33(2):443-445.
  • 10SOON I Y,KOH S N,YEO C K.Noisy speech enhancement using discrete cosine transform[J].Speech Communication,1998,24(3):249-257.

共引文献11

同被引文献11

引证文献2

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部