Monaural speech enhancement combining deep neural network and convex optimization

导出

摘要 The accuracy of noise estimation directly affects the quality of speech enhancement algorithm.To improve the noise suppression effect of current speech enhancement algorithm when noise is estimated and effectively solve the unconstrained optimization problem,a time-frequency mask algorithm based on DNN combined with convex optimization is proposed for monaural speech enhancement.Firstly,the power spectra of noisy speech is extracted as the input of DNN.Secondly,the inter-channel correlation factor between noise and noisy speech is taken as the training target of DNN.Next,the objective function of convex optimization is constructed by using the correlation factor obtained from DNN model.Finally,a new hybrid conjugate gradient method combined with convex optimization,is used for iterative processing on an initial mask.The final mask is used to obtain the enhanced speech.Compared with conventional methods,the simulation results show that under different background noise with low SNR,the obtained ratio mask makes the enhanced speech achieve better LSD,PESQ,STOI and segSNR indices,and improves the overall quality of speech and can effectively suppress noise.

作者 ZHANG Xiaoyan ZHANG Tianqi GE Wanying BAI Yangliu

机构地区 School of Communication and Information Engineering/Chongqing Key Laboratory of Signal and Information PTOcessing(CQKLS&IP)

出处《Chinese Journal of Acoustics》 CSCD 2021年第3期460-476,共17页 声学学报（英文版）

基金 supported by the National Natural Science Foundation of China(61671095,61702065,61701067,61771085) the Project of Key Laboratory of Signal and Information Processing of Chongqing(CSTC2009CA2003) the Chongqing Graduate Research and Innovation Project(CYS19248) the Research Project of Chongqing Educational Commission(KJ1600427,KJ1600429).

关键词 NETWORK OPTIMIZATION noise

分类号 TN912.35 [电子电信—通信与信息系统] TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献5

1曹亮,张天骐,高洪兴,易琛.基于听觉掩蔽效应的多频带谱减语音增强方法[J].计算机工程与设计,2013,34(1):235-240. 被引量：9
2李季碧,马永保,夏杰,刘金刚.一种基于修正倒谱平滑技术改进的维纳滤波语音增强算法[J].重庆邮电大学学报（自然科学版）,2016,28(4):462-467. 被引量：3
3杨琳,张建平,颜永红.单通道语音增强算法对汉语语音可懂度影响的研究[J].声学学报,2010,35(2):248-253. 被引量：17
4葛宛营,张天骐.基于掩蔽估计与优化的单通道语音增强算法[J].计算机应用,2019,39(10):3065-3070. 被引量：9
5唐天国.一种求解无约束优化问题的新混合共轭梯度法[J].西南师范大学学报（自然科学版）,2019,44(9):34-39. 被引量：6

二级参考文献47

1王晶,傅丰林,张运伟.语音增强算法综述[J].声学与电子工程,2005(1):22-26. 被引量：21
2王莉,胡剑凌,徐盛.基于听觉掩蔽效应的语音增强算法的研究[J].电声技术,2006,30(7):39-42. 被引量：3
3焦宝聪,陈兰平,潘翠英.Goldstein线搜索下混合共轭梯度法的全局收敛性[J].计算数学,2007,29(2):137-146. 被引量：8
4张家禄齐士钤宋美珍等.汉语声调在言语可懂度中的重要作用.声学学报,1981,7:237-237.
5Song Myung-Suk, Lee Chang-Heon, Kang Hong-Goo. Performance analysis of various single channel speech enhancement algorithms for automatic speech recognition. Inter- speech2006, 1451-1454, Pittsburgh, Pennsylvania.
6Hu Guoning, Wang DeLiang. Monaural speech segregation based on pitch tracking and amplitude modulation. IEEE Trans. Neural Networks, 2004; 15(5): 1135-1150.
7Hu Yi, Loizou P C. A comparative intelligibility study of single-microphone noise reduction algorithms. J. Acoust. Soc. Am., 2007; 122(3): 1777-1786.
8Hu Yi, Loizou P C. Subjective evaluation and comparison of speech enhancement algorithms. Speech Communication, 2007; 49:588-601.
9Kang Jian. Comparison of speech intelligibility between English and Chinese. J. Acoust. Soc. Am., 1998; 103(2): 1213-1216.
10Loizou P C. Speech enhancement: Theory and practice. CRC Press, 2007.

共引文献34

1梁瑞宇,邹采荣,赵力,王青云,奚吉.汉语数字助听器高频听损增强方法的实验研究[J].声学学报,2012,37(5):527-533. 被引量：1
2王辉,张玲华.数字助听器中广义旁瓣抵消器结构的汉语语音处理技术[J].声学学报,2012,37(5):534-538.
3蒋斌,匡正,吴鸣,杨军.汉语音段反转言语的可懂度研究[J].声学学报,2012,37(6):659-666. 被引量：3
4雍雅琴,沙洪,李鹏.数字助听器中广义旁瓣消除器的仿真研究[J].医疗卫生装备,2013,34(5):13-15. 被引量：1
5周健,郑文明,王青云,赵力.提高耳语音可懂度的非对称压缩语音增强方法[J].声学学报,2014,39(4):501-508. 被引量：3
6ZHOU Jian,ZHENG Wenming,WANG Qingyun,ZHAO Li.Intelligibility enhancement for noisy whispered speech using asymmetric cost function[J].Chinese Journal of Acoustics,2014,33(3):312-322. 被引量：2
7徐昕,张天骐,石穗,张亚娟.结合语音增强的基音检测改进方法[J].计算机工程与设计,2015,36(3):699-704. 被引量：4
8梁瑞宇,周健,王青云,奚吉,赵力.仿人耳听觉的助听器双耳声源定位算法[J].声学学报,2015,40(3):446-454. 被引量：12
9张勇,刘轶.非平稳噪声环境下结合听觉掩蔽的语音增强[J].计算机工程与设计,2015,36(5):1279-1284. 被引量：3
10杨龙,陈建明.语音增强算法及进展[J].电声技术,2015,39(7):35-39. 被引量：5

1Chao Li,Ting Jiang,Sheng Wu.Speech Enhancement Based on Approximate Message Passing[J].China Communications,2020,17(8):187-198. 被引量：1
2Li XU,Guo HUANG,Qing-li CHEN,Hong-yin QIN,Tao MEN,Yi-fei PU.An improved method for image denoising based on fractional-order integration[J].Frontiers of Information Technology & Electronic Engineering,2020,21(10):1485-1493. 被引量：6
3朱明,孙世若.基于复值掩蔽与扩张卷积的实时语音增强方法[J].电子器件,2021,44(3):612-615. 被引量：1
4Ruifan Liu,Yuan Ma,Xingjian Zhang,Yue Gao.Deep Learning-Based Spectrum Sensing in Space-Air-Ground Integrated Networks[J].Journal of Communications and Information Networks,2021,6(1):82-90. 被引量：8
5Supriya Dhabal,Palaniandavar Venkateswaran.An Efficient Nonuniform Cosine Modulated Filter Bank Design Using Simulated Annealing[J].Journal of Signal and Information Processing,2012,3(3):330-338.
6Chihyun Park,Minsu Han,Jinbo Kim,Woojae Lee,Eunkyoung Kim.Effect of ionic composition on thermal properties of energetic ionic liquids[J].npj Computational Materials,2018(1):433-442. 被引量：3
7SHI Wenhua,ZHANG Xiongwei,ZOU Xia,SUN Meng,LI Li,REN Zhengbing.Time-frequency mask estimation-based speech enhancement using deep encoder-decoder neural network[J].Chinese Journal of Acoustics,2021,40(1):141-154.
8徐祥,汪亚中,郑驰超,彭虎.基于自适应加权算法的远聚焦超声成像[J].生物医学工程研究,2020,39(4):319-329. 被引量：3
9LI Liang-Sheng,YIN Hong-Cheng,ZHENG Ning.Nonlinear Model of Photoconductive Antennas[J].Chinese Physics Letters,2013,30(6):98-101.
10Tao JIANG,Jianhua ZHANG,Pan TANG,Lei TIAN.Astudy of uplink and downlink channel spatial characteristics in an urban micro scenario at 28GHz[J].Frontiers of Information Technology & Electronic Engineering,2021,22(4):488-502. 被引量：2

Chinese Journal of Acoustics

2021年第3期

浏览历史

内容加载中请稍等...

Monaural speech enhancement combining deep neural network and convex optimization

参考文献5

二级参考文献47

共引文献34

相关作者

相关机构

相关主题

浏览历史