期刊文献+

基于生成对抗网络合成噪声的语音增强方法研究 被引量:5

Research on speech enhancement method based on generating noise using GAN
下载PDF
导出
摘要 在语音增强领域,深度神经网络通过对大量含有不同噪声的语音以监督学习方式进行训练建模,从而提升网络的语音增强能力。然而不同类型噪声的获取成本较大,噪声类型难以全面采集,影响了模型的泛化能力。针对这个问题,提出一种基于生成对抗网络(Generative Adversarial Networks,GAN)的噪声数据样本增强方法,该方法对真实噪声数据进行学习,根据数据特征合成虚拟噪声,以此扩充训练集中噪声数据的数量和类型。通过实验验证,所采用的噪声合成方法能够有效扩展训练集中噪声来源,增强模型的泛化能力,有效提高语音信号去噪处理后的信噪比和可理解性。 In the field of speech enhancement,deep neural network can improve the enhancement ability of the model by training and modeling a large number of data with different noises in the supervised learning way.However,the acquisition cost of different types of noise is large and the noise types are difficult to be comprehensive,which affects the generalization ability of the model.Aiming at this problem,this paper proposes a noise data augmentation method based on generative adversarial network(GAN),which learns from the real noise data and synthesizes virtual noises according to the data features,so as to expand the number and type of the noise data in the training set.Experimental results show that the method of noise synthesis adopted in this article can effectively expand the source of noise in the training set,enhance the generalization ability of the model,and effectively improve the signal-to-noise ratio and intelligibility of speech signal after denoising.
作者 夏鼎 徐文涛 Xia Ding;Xu Wentao(School of Science,Nanjing University of Aeronautics and Astronautics,Nanjing 211106,China)
出处 《电子技术应用》 2020年第11期56-59,64,共5页 Application of Electronic Technique
关键词 语音增强 生成对抗网络 数据增强 speech enhancement generative adversarial network data augmentation
  • 相关文献

参考文献2

二级参考文献29

  • 1胡广书.数字信号处理-理论、算法与实现[M].北京:清华大学出版社,2001..
  • 2Gabrea M. Adaptive Kalman filtering-based speech enhancement algorithm. Canadian Conference on Electrical and Computer Engineering, Toronto, 2001.
  • 3Ogata S, Shimamum T. Reinforced spectral subtraction method to enhance speech signal. Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology, Singapore, 2001.
  • 4Pei Ding, Zhigang Cao.Combining MMSE enhancement with LA model adaptation for robust automatic speech recognition[J]. Electronics Letters, 2001,37(8):539 - 540.
  • 5Martin R. Speech enhancement using MMSE short time spectral estimation with gamma distributed speech priors. IEEE International Conference on Acoustics, Speech, and Signal Processing, Orlando, 2002.
  • 6ChangHuai You, SooNgee Koh, Rahardja S. Adaptive /spl beta/-order MMSE estimation for speech enhancement. IEEE International Conference on Acoustics, Speech, and Signal Processing, Singapore, 2003.
  • 7Dai Qijun, Chen Yanpu, Bian Zhengzhong. Optimizing speech enhancement based on noise masked probability.2002 6th International Conference on SignalProcessing,, Xi'an, 2002.
  • 8Yi Hu, Loizou P C. Speech enhancement based on wavelet thresholding the multitaper pectrum[J]. IEEE Transactions on Speech and Audio Processing, 2004, 12(1):59- 67.
  • 9Rezayee A, Gazor S. An adaptive KLT approach for speech enhancement[J]. IEEE Transactions on Speech and Audio Processing, 2001, 9(2): 87 - 95.
  • 10Hasan M K, Zilany M S A, Khan M R. DCT speech enhancement with hard and soft thresholding criteria[J].Electronics Letters, 2002,38(13):669 - 670.

共引文献25

同被引文献54

引证文献5

二级引证文献26

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部