基于联合图学习的多通道语音增强方法

Multi‑channel Speech Enhancement Based on Joint Graph Learning

下载PDF

导出

摘要考虑到通道间存在的空间关系影响着其降噪问题,图信号处理可以捕获该潜在关系,若直接采用其空间物理分布图,无法实时反映其时变特性,因此本文提出了一种基于联合图学习的多通道语音增强方法。首先,提出一种联合时间‑空间图学习方法,以最小化多通道含噪语音信号在空间图上的平滑度、参考通道信号在语音帧内图上的平滑度、空间图的稀疏度和帧内图的稀疏度之和为目标,优化阵列空间图和语音帧内图。基于学习的空间图和帧内图,构建多通道语音信号的时间‑空间联合图。在此基础上,将多通道语音图信号进行联合图傅里叶变换,进而采用固定波束形成(Fixed beam forming,FBF)方法进行增强。实验结果表明,与传统的FBF方法相比,所提出的基于联合图学习的FBF(Joint graph learning based FBF,JGL‑FBF)方法显著提升了增强语音的信噪比(Signal‑to‑noise ratio,SNR)和主观语音质量评估(Perceptual evaluation of speech quality,PESQ)。另外,实验结果也表明,JGL‑FBF方法的语音增强性能会受到时延补偿准确性的影响。 Considering that the spatial relationship between channels affects the noise reduction,graph signal processing can capture the potential relationship.If the spatial physical distribution map is directly used,its time-varying characteristics cannot be reflected in real time.Therefore,we propose a multichannel speech enhancement method based on joint graph learning.Firstly,we propose a joint time-space graph learning method,which jointly optimizes the array space graph and the speech frame inner graph,for the sake of minimizing the sum of the smoothness of the multi-channel noisy speech signal on the spatial graph,the smoothness of the nosiy speech signal from the reference channel on the speech frame graph,the sparsity of the Laplace matrix and the sparsity of the adjacency matrix.Based on the learned space graph and frame inner graph,the time-space joint graph of multi-channel speech signal is constructed.On this basis,the multi-channel speech graph signal is enhanced by applying the joint graph transform and the fixed beam forming(FBF)method.Experimental results show that the proposed joint graph learning based FBF(JGL-FBF)method can significantly improve the signal-to-noise ratio(SNR)of enhanced speech and perceptual evaluation of speech quality(PESQ)compared with the traditional FBF method.In addition,the experimental results also show that the accuracy of delay compensation affects the speech enhancement performance of JGL-FBF.

作者张鹏程郭海燕王婷婷杨震 ZHANG Pengcheng;GUO Haiyan;WANG Tingting;YANG Zhen(College of Communication and Information Engineering,Nanjing University of Posts and Telecommunications,Nanjing 210003,China;National Local Joint Engineering Research Center for Communications and Network Technology,Nanjing University of Posts and Telecommunications,Nanjing 210003,China)

机构地区南京邮电大学通信与信息工程学院南京邮电大学通信与网络技术国家地方联合工程研究中心

出处《数据采集与处理》 CSCD 北大核心 2023年第2期283-292,共10页 Journal of Data Acquisition and Processing

基金国家自然科学基金(62071242)。

关键词联合图学习语音增强多通道波束形成 joint graph learning speech enhancement multi-channel beam forming

分类号 TN911.7 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献5

1杨立山,游康勇,郭文彬.基于扩散算子的带限图信号加权重建策略[J].电子与信息学报,2017,39(12):2937-2944. 被引量：4
2蒋俊正,杨杰,欧阳缮.一种新的无线传感器网络中异常节点检测定位算法[J].电子与信息学报,2018,40(10):2358-2364. 被引量：27
3WANG Tingting,GUO Haiyan,LYU Bin,YANG Zhen.Speech Signal Processing on Graphs: Graph Topology, Graph Frequency Analysis and Denoising[J].Chinese Journal of Electronics,2020,29(5):926-936. 被引量：7
4杨洋,杨震.基于图信号处理的MVDR波束形成多通道语音增强[J].南京邮电大学学报（自然科学版）,2021,41(6):35-40. 被引量：4
5Ran Li,Junyi Wang,Wenjun Xu,Jiming Lin,Hongbing Qiu.Graph Laplacian Matrix Learning from Smooth Time-Vertex Signal[J].China Communications,2021,18(3):187-204. 被引量：1

二级参考文献7

1GUO Haiyan,YANG Zhen,ZHU Weiping,YE Lei.Single-channel Speech Separation by l0 Optimization Using Quasi-KLT Bases[J].Chinese Journal of Electronics,2012,21(3):535-540. 被引量：1
2李鹏,王建新,曹建农.无线传感器网络中基于压缩感知和GM(1,1)的异常检测方案[J].电子与信息学报,2015,37(7):1586-1590. 被引量：9
3XU Longting,YANG Zhen,SUN Linhui.Simplification of I-Vector Extraction for Speaker Identification[J].Chinese Journal of Electronics,2016,25(6):1121-1126. 被引量：4
4Zhenglian Li,Lixin Ji,Ruiyang Huang,Shuxin Liu.Improving Centralized Path Calculation Based on Graph Compression[J].China Communications,2018,15(6):120-124. 被引量：1
5Lejun Zhang,Tong Wang,Zilong Jin,Nan Su,Chunhui Zhao,Yongjun He.The Research on Social Networks Public Opinion Propagation Influence Models and Its Controllability[J].China Communications,2018,15(7):98-110. 被引量：8
6Meijia Wang,Qingshan Li,Yishuai Lin.A Personalized Search Model Using Online Social Network Data Based on a Holonic Multiagent System[J].China Communications,2020,17(2):176-205. 被引量：2
7WANG Tingting,GUO Haiyan,LYU Bin,YANG Zhen.Speech Signal Processing on Graphs: Graph Topology, Graph Frequency Analysis and Denoising[J].Chinese Journal of Electronics,2020,29(5):926-936. 被引量：7

共引文献36

1史丽娟.船舶机舱无线传感器网络丢失节点预测方法研究[J].舰船科学技术,2019,0(22):61-63. 被引量：2
2张天.基于自适应级联陷波的网络异常节点定位方法[J].电脑知识与技术,2020,0(4):26-27. 被引量：1
3祝振宇,陈冰红.面向航天器的无线传感器网络环境可靠性测试分析[J].环境技术,2019,37(3):57-63. 被引量：2
4宋伟奇,王代远.基于节点优化的无线传感网络拓扑控制方法研究[J].广西民族大学学报（自然科学版）,2019,25(3):80-83. 被引量：3
5袁焦,王珣,潘兆马,杨学锋,姚书琴.无线传感器网络的异常检测[J].电子技术与软件工程,2019,0(24):10-11. 被引量：1
6杨杰,蒋俊正.利用联合图模型的传感器网络数据修复方法[J].西安电子科技大学学报,2020,47(1):44-51. 被引量：8
7卢光跃,周亮,吕少卿,施聪,苏可可.基于图信号处理的无线传感器网络异常节点检测算法[J].计算机应用,2020,40(3):783-787. 被引量：29
8盖昊宇,张震,李慧.基于物联网的无线通信网络数据完整性检测方法[J].齐齐哈尔大学学报（自然科学版）,2020,36(4):29-33. 被引量：8
9贾鹏.基于互联网+的电子信息数据异常监测系统设计[J].齐齐哈尔大学学报（自然科学版）,2020,36(6):44-49. 被引量：3
10张定祥.无线传感器枝干识别网络错误数据检测算法[J].兵器装备工程学报,2020,41(11):186-189. 被引量：2

1常雅婷,于玲.一种改进的GSC自适应波束形成的语音增强方法[J].电脑知识与技术,2022,18(15):68-71.
2高志强,戴琳琳,景辉,王心雨.面向铁路客运站场景的语音降噪模型研究[J].铁路计算机应用,2023,32(2):7-12.
3王贵鑫.FBF网络和小波包分析在机车轴承故障诊断中的应用[J].现代工程科技,2022,1(10):8-11.
4苏兆品,张羚,张国富.低比特率语音流大容量分层隐写方法[J].中国图象图形学报,2022,27(12):3461-3475. 被引量：1
5叶中付,赵紫微,于润祥.基于临界频带的交互性双支路单通道语音增强模型[J].数据采集与处理,2023,38(2):262-273. 被引量：1
6Xiaojin Ding,Zhuangzhuang Ren,Huanbin Lu,Gengxin Zhang.Improving SINR via Joint Beam and Power Management for GEO and LEO Spectrum-Sharing Satellite Communication Systems[J].China Communications,2022,19(7):25-36.
7蒲敏刚,李立春,江横,张海龙.基于最大输出SINR波束形成的最优稀疏阵列设计[J].信息工程大学学报,2022,23(6):666-671.
8Chengwei FEI,Haotian LIU,Shaolin LI,Huan LI,Liqiang AN,Cheng LU.Dynamic parametric modeling-based model updating strategy of aeroengine casings[J].Chinese Journal of Aeronautics,2021,34(12):145-157. 被引量：6
9Jiayang BAI,Jie GUO,Chenchen WANG,Zhenyu CHEN,Zhen HE,Shan YANG,Piaopiao YU,Yan ZHANG,Yanwen GUO.Deep graph learning for spatially-varying indoor lighting prediction[J].Science China(Information Sciences),2023,66(3):169-183. 被引量：2
10高铁成,王昊,李聪,远桂民.基于稀疏矩阵的全聚焦阵列优化算法[J].天津工业大学学报,2022,41(6):63-69. 被引量：2

数据采集与处理

2023年第2期

浏览历史

内容加载中请稍等...

基于联合图学习的多通道语音增强方法

参考文献5

二级参考文献7

共引文献36

相关作者

相关机构

相关主题

浏览历史