期刊文献+

DNN与谱减法相结合的语音分离技术 被引量:2

Speech Separation Combined with DNN and Spectral Subtraction
下载PDF
导出
摘要 针对传统DNN语音分离中噪声干扰的问题,提出了一种在DNN语音分离后期处理中结合DNN和谱减法的语音分离方法。首先提取语音声级特征,通过DNN学习带噪特征到分离目标语音的映射,得到分离目标语音;然后对分离目标语音中每一时频单元进行噪声能量估计;最后,通过快速傅里叶逆变换得到谱减后的分离语音波形信号。通过对不同类型的噪声和不同输入信噪比混合后的语音信号进行试验,结果表明,加入谱减法后分离的语音信号与只经DNN网络输出的语音信号相比,前者分离的语音可懂度和信噪比得到了显著提高,并且分离语音的信号更接近于纯净语音的信号。 In view of the problem of noise interference in traditional DNN speech separation,a speech separation method based on DNN and spectral subtraction was proposed in the post processing of DNN speech separation.Firstly,the features of speech were extracted and the DNN was used to learn the mapping of the noisy features to the separated target speech.Then the noise energy is estimated for each time frequency unit in the separated target speech.Finally,the speech waveform was obtained by the inverse fast fourier transform.By testing the speech signal mixed by different types of noise and different input SNR,the experimental results show that compared with the speech signal output only by the DNN network,the speech signal separated after adding spectral subtraction is significantly improved in the speech intelligibility and signal to noise ratio of the proposed algorithm.The similarities between the separated speech signal and the original clean speech signal has also been greatly improved.
作者 冯利琪 江华 闫格 闵长伟 李玲香 FENG Li-qi;JIANG Hua;YAN Ge;MIN Chang-wei;LI Ling-xiang(Key Laboratory of Granular Computing and Application,Minnan Normal University;School of Computer Science, Minnan Normal University, Zhangzhou 363000,China;School of Electronics and Information Engineering,Hunan University of Science and Engineering,Yongzhou 425199,China)
出处 《软件导刊》 2018年第12期12-17,共6页 Software Guide
基金 国家自然科学基金项目(61472406) 福建省自然科学基金项目(2016J01304) 闽南师范大学人才引进项目
关键词 语音分离 神经网络 谱减法 目标语音 噪声能量估计 speech separation neural networks spectral subtraction target speech noise energy estimation
  • 相关文献

参考文献2

二级参考文献68

  • 1邹霞,陈亮,张雄伟.基于Gamma语音模型的语音增强算法[J].通信学报,2006,27(10):118-123. 被引量:11
  • 2Kim G, Lu Y, Hu Y, Loizou P C. An algorithm that im- proves speech intelligibility in noise for normal-hearing lis- teners. The Journal of the Acoustical Society of America, 2009, 126(3): 1486-1494.
  • 3Dillon H. Hearing Aids. New York: Thieme, 2001.
  • 4Allen J B. Articulation and intelligibility. Synthesis Lectures on Speech and Audio Processing, 2005, 1(1): 1-124.
  • 5Seltzer M L, Raj B, Stern R M. A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition. Speech Communication, 2004, 43(4): 379-393.
  • 6Weninger F, Erdogan H, Watanabe S, Vincent E, Le Roux J, Hershey J R, Schuller B. Speech enhancement with LSTM recurrent neural networks and its application to noise-robust ASR. In: Proceedings of the 12th International Conference on Latent Variable Analysis and Signal Separation. Liberec, Czech Republic: Springer International Publishing, 2015.91 -99.
  • 7Weng C, Yu D, Seltzer M L, Droppo J. Deep neural networks for single-channel multi-talker speech recognition. IEEE/ ACM Transactions on Audio, Speech, and Language Pro- cessing, 2015, 23(10): 1670-1679.
  • 8Boll S F. Suppression of acoustic noise in speech using spec- tral subtraction. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1979, 27(2): 113-120.
  • 9Chen J D, Benesty J, Huang Y T, Doclo S. New insights into the noise reduction wiener filter. IEEE Transactions on Audio, Speech, and Language Processing, 2006, 14(4): 1218 -1234.
  • 10Loizou P C. Speech Enhancement: Theory and Practice. New York: CRC Press, 2007.

共引文献74

同被引文献4

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部