期刊文献+

Monaural speech enhancement combining deep neural network and convex optimization

原文传递
导出
摘要 The accuracy of noise estimation directly affects the quality of speech enhancement algorithm.To improve the noise suppression effect of current speech enhancement algorithm when noise is estimated and effectively solve the unconstrained optimization problem,a time-frequency mask algorithm based on DNN combined with convex optimization is proposed for monaural speech enhancement.Firstly,the power spectra of noisy speech is extracted as the input of DNN.Secondly,the inter-channel correlation factor between noise and noisy speech is taken as the training target of DNN.Next,the objective function of convex optimization is constructed by using the correlation factor obtained from DNN model.Finally,a new hybrid conjugate gradient method combined with convex optimization,is used for iterative processing on an initial mask.The final mask is used to obtain the enhanced speech.Compared with conventional methods,the simulation results show that under different background noise with low SNR,the obtained ratio mask makes the enhanced speech achieve better LSD,PESQ,STOI and segSNR indices,and improves the overall quality of speech and can effectively suppress noise.
出处 《Chinese Journal of Acoustics》 CSCD 2021年第3期460-476,共17页 声学学报(英文版)
基金 supported by the National Natural Science Foundation of China(61671095,61702065,61701067,61771085) the Project of Key Laboratory of Signal and Information Processing of Chongqing(CSTC2009CA2003) the Chongqing Graduate Research and Innovation Project(CYS19248) the Research Project of Chongqing Educational Commission(KJ1600427,KJ1600429).
  • 相关文献

参考文献5

二级参考文献47

  • 1王晶,傅丰林,张运伟.语音增强算法综述[J].声学与电子工程,2005(1):22-26. 被引量:21
  • 2王莉,胡剑凌,徐盛.基于听觉掩蔽效应的语音增强算法的研究[J].电声技术,2006,30(7):39-42. 被引量:3
  • 3焦宝聪,陈兰平,潘翠英.Goldstein线搜索下混合共轭梯度法的全局收敛性[J].计算数学,2007,29(2):137-146. 被引量:8
  • 4张家禄 齐士钤 宋美珍 等.汉语声调在言语可懂度中的重要作用.声学学报,1981,7:237-237.
  • 5Song Myung-Suk, Lee Chang-Heon, Kang Hong-Goo. Performance analysis of various single channel speech enhancement algorithms for automatic speech recognition. Inter- speech2006, 1451-1454, Pittsburgh, Pennsylvania.
  • 6Hu Guoning, Wang DeLiang. Monaural speech segregation based on pitch tracking and amplitude modulation. IEEE Trans. Neural Networks, 2004; 15(5): 1135-1150.
  • 7Hu Yi, Loizou P C. A comparative intelligibility study of single-microphone noise reduction algorithms. J. Acoust. Soc. Am., 2007; 122(3): 1777-1786.
  • 8Hu Yi, Loizou P C. Subjective evaluation and comparison of speech enhancement algorithms. Speech Communication, 2007; 49:588-601.
  • 9Kang Jian. Comparison of speech intelligibility between English and Chinese. J. Acoust. Soc. Am., 1998; 103(2): 1213-1216.
  • 10Loizou P C. Speech enhancement: Theory and practice. CRC Press, 2007.

共引文献34

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部