期刊文献+

ADMM稀疏非负矩阵分解语音增强算法 被引量:2

Speech enhancement algorithm using ADMM sparse nonnegative matrix factorization
下载PDF
导出
摘要 提出一种基于交替方向乘子法的(Alternating Direction Method of Multipliers,ADMM)稀疏非负矩阵分解语音增强算法,该算法既能克服经典非负矩阵分解(Nonnegative Matrix Factorization,NMF)语音增强算法存在收敛速度慢、易陷入局部最优等问题,也能发挥ADMM分解矩阵具有的强稀疏性。算法分为训练和增强两个阶段:训练时,采用基于ADMM非负矩阵分解算法对噪声频谱进行训练,提取噪声字典,保存其作为增强阶段的先验信息;增强时,通过稀疏非负矩阵分解算法,从带噪语音频谱中对语音字典和语音编码进行估计,重构原始干净的语音,实现语音增强。实验表明,该算法速度更快,增强后语音的失真更小,尤其在瞬时噪声环境下效果显著。 This paper proposes a speech enhancement algorithm putting the theory of Alternating Direction Method of Multipliers(ADMM) into the algorithm of sparse nonnegative matrix factorization, which can solve the problems such as slow convergence and poor local optima in the traditional speech enhancement based Nonnegative Matrix Factorization(NMF). It mainly consists of a training stage and an enhancement stage. During the training stage, the dictionaries of the noise are constructed as the prior information by using the ADMM based nonnegative matrix factorization. In the enhancement stage, the spectrum of noisy speech is analyzed by the sparse normegative matrix factorization algorithm. After that, the noise dictionary is combined with iterative formulation to evaluate the speech dictionary and the coding matrix of speech. The clean part of the speech is finally reconstructed from the noisy speech. Compared with the traditional speech enhancement methods of NMF, extensive experiments indicate that this algorithm not only has faster speed but also gets better noise suppression performance especially under instantaneous noise environment.
出处 《计算机工程与应用》 CSCD 北大核心 2016年第3期108-112,共5页 Computer Engineering and Applications
基金 国家自然科学基金(No.61471394) 江苏省自然科学青年基金(No.Bk20140074)
关键词 语音增强 稀疏非负矩阵分解 交替方向乘子法 speech enhancement sparse nonnegative matrix factorization alternating direction method of multipliers
  • 相关文献

参考文献18

  • 1张雄伟,等.现代语音处理技术及应用[M].北京:机械丁业出版社,2009.
  • 2So S,Paliwal K K.Suppressing the influence of additive noise on the Kalman gain for low residual noise speech enhancement[J].Speech Communication,2011,53(3):355-378.
  • 3Paliwal K,Wokcicki K,Schwerin B.Single channel speech enhancement using spectral subtraction in the short time modulation domain[J].Speech Communication,2010,52(5):450-475.
  • 4邹霞,陈亮,张雄伟.基于Gamma语音模型的语音增强算法[J].通信学报,2006,27(10):118-123. 被引量:11
  • 5Mohammadiha N,Taghia J,Leijon A.Single channel speech enhancement using Bayesian NMF with recursive temporal updates of prior distributions[C]//Acoustics,Speech and Signal Process(ICASSP),2012:4561-4564.
  • 6Xu Y,Du J,Lee L C H.An experimental study on speech enhancement based on deep neural networks[J].IEEE Signal Processing Letters,2014,21(1):65-68.
  • 7Civier O,Tasko S M,Guenther F H.Overreliance on auditory feedback may lead to sound/syllable repetitions:simulations of stuttering and fluency-inducing conditions with a neural model of speech production[J].Journal of Fluency Disorders,2010,35(3):246-279.
  • 8Lee D D,Seung H S.Learning the parts of objects by non-negative matrix factorization[J].Nature,1999,401(10):788-791.
  • 9黄建军,张雄伟,张亚非,邹霞.时频字典学习的单通道语音增强算法[J].声学学报,2012,37(5):539-547. 被引量:13
  • 10张立伟,贾冲,张雄伟,闵刚,曾理.稀疏卷积非负矩阵分解的语音增强算法[J].数据采集与处理,2014,29(2):259-264. 被引量:13

二级参考文献83

  • 1邹霞,陈亮,张雄伟.基于Gamma语音模型的语音增强算法[J].通信学报,2006,27(10):118-123. 被引量:11
  • 2EPHRAIM Y,MALAH D.Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator[J].IEEE Trans Acoustic,Speech,Signal Processing,1984,32(6):1109-1121.
  • 3EPHRAIM Y,MALAH D.Speech enhancement using a minimum mean-square error log-spectral amplitude estimator[J].IEEE Trans Acoustic,Speech,Signal Processing,1985,33(2):443-445.
  • 4SOON I Y,KOH S N,YEO C K.Noisy speech enhancement using discrete cosine transform[J].Speech Communication,1998,24(3):249-257.
  • 5GAZOR S,ZHANG W.Speech probability distribution[J].IEEE Signal Processing Letters,2003,10(7):204-207.
  • 6MARTIN R.Statistical methods for the enhancement of noisy speech[A].IWAENC'2003[C].2003.1-6.
  • 7MARTIN R.Speech enhancement using MMSE short time spectral estimation with Gamma distributed speech priors[A].ICASSP'2002[C].2002.253-256.
  • 8MARTIN R,BREITHAUPT C.Speech enhancement in the DFT domain using Laplacian speech priors[A].IWAENC'2003[C].2003.87-90.
  • 9BREITHAUPT C,MARTIN R.MMSE estimation of magnitudesquared DFT coefficients with supergaussian priors[A].ICASSP'2003[C].2003.896-899.
  • 10CHEN B,LOIZOU P C.Speech enhancement using a MMSE short time spectral amplitude estimator with Laplacian speech modeling[A].ICASSP'2005[C].2005.1097-1100.

共引文献53

同被引文献22

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部