摘要
为了进一步提高基于深度神经网络的语音增强方法的性能,针对单独使用卷积神经网络难以对含噪语音中的长期依赖关系进行建模的问题,提出一种基于卷积门控循环神经网络的语音增强方法.该方法首先采用卷积神经网络提取含噪语音中的局部特征,然后采用门控循环神经网络将含噪语音中不同时间段的局部特征进行关联,通过结合两种网络的不同特性,在语音增强中更好地利用含噪语音中的上下文信息.实验结果表明:该方法能够有效提高未知噪声条件下的语音增强性能,增强后的语音具有更好的语音质量和可懂度.
In order to further improve the performance of speech enhancement methods based on deep neural networks,a speech enhancement method based on the convolutional gated recurrent neural network was proposed for the problem that it is difficult to model long-term dependencies in noisy speech using convolutional neural networks alone.First,the local feature of noisy speech was extracted using a convolutional neural network,and then the local feature in different time periods was correlated using a gated recurrent neural network.By combining the different characteristics of these two networks,the method made full use of the contextual information in noisy speech in speech enhancement.Experimental results show that the method can effectively improve the speech enhancement performance under unknown noise conditions,and the enhanced speech has better speech quality and intelligibility.
作者
袁文浩
娄迎曦
夏斌
孙文珠
YUAN Wenhao;LOU Yingxi;XIA Bin;SUN Wenzhu(College of Computer Science and Technology,Shandong University of Technology,Zibo 255000,Shandong China)
出处
《华中科技大学学报(自然科学版)》
EI
CAS
CSCD
北大核心
2019年第4期13-18,共6页
Journal of Huazhong University of Science and Technology(Natural Science Edition)
基金
国家自然科学基金青年基金资助项目(61701286
11704229)
山东省自然科学基金资助项目(ZR2015FL003
ZR2017MF047
ZR2017LA011
ZR2017LF004)
关键词
语音增强
深度学习
卷积神经网络
循环神经网络
局部特征
speech enhancement
deep learning
convolutional neural network
recurrent neural network
local feature