基于Kalman滤波与频率聚焦的单声源到达方向实时估计与跟踪方法

Real time estimation and tracking method for the direction of arrival of single sound source based on Kalman filtering and frequency focusing

原文传递

导出

摘要为改善在噪声、混响及声源移动情况下传统到达方向(direction of arrival,DOA)估计方法的性能,该文提出一种基于Kalman滤波与频率聚焦的单声源DOA实时估计与跟踪方法。该方法由去噪、去混响和DOA估计3个步骤构成。其中:去噪与去混响步骤的目标函数分别由最小化去噪信号误差和多通道线性预测系数误差建立,并分别通过Kalman滤波求解;DOA估计步骤通过基于频率聚焦的导向响应功率实现。该文所提方法建立在传播矩阵集成去混响与去噪步骤的基础上,通过波束形成获得的期望信号的先验估计,DOA估计步骤被进一步集成,从而促进3个步骤间的因果有序迭代。实验结果表明:与参考方法相比,该文所提方法的DOA估计与跟踪性能更优。 [Objective] Estimation of direction of arrival(DOA) is critical in spatial audio coding,speech enhancement,sound field synthesis,and sound source imaging.Commonly used signal model-based DOA estimation methods,such as the multiple signal classification method,can effectively estimate DOA information in noise-free and anechoic scenarios.However,real-world environments always have noise and reverberation,particularly in far-field speech communication scenarios characterized by low signal-to-noise ratios and strong reverberation.Furthermore,the sound source may be in motion.These factors considerably impair the performance of DOA estimation methods based on signal models.To address this issue,this paper introduces a real-time estimation and tracking method for the DOA of a single sound source,using Kalman filtering and frequency focusing.[Methods] The proposed method consists of three procedures:denoising,dereverberation,and DOA estimation.With regard to the denoising procedure,an objective optimization function to minimize the error of the denoised signal is established.This function is solved using a Kalman filter,which leads to obtaining the denoised signal through Kalman gain-based posterior estimation.For the dereverberation procedure,based on the autoregressive coefficients of the late reverberation components,an objective optimization function to minimize the error of the multichannel linear prediction(MCLP) coefficients is established.This function is also solved through another Kalman filter to obtain the MCLP coefficients.The DOA estimation procedure is implemented by using a frequency focusing based steered response power(FF-SRP) method,which can circumvent signal component diffusion within subspace decomposition.In particular,a structure that effectively intertwines these three procedures,enhancing the contribution of denoising and dereverberation results to DOA estimation.In this structure,a propagation matrix is utilized to integrate the denoising and dereverberation procedures,creating a causative iteration between them.Subsequently,a minimum variance distortionless response(MVDR) beamforming method is used to replace the multichannel Wiener filtering method.This is to obtain a prior estimation of the covariance matrix of the target signal.The MVDR beamforming method offers two advantages:it reduces the distortion of the target signal and integrates the DOA estimation procedure with the denoising procedure,thereby promoting a causal and orderly iteration among the three procedures.[Results] Experiments were conducted using a microphone array signal simulator and the TIMIT corpus.The mean absolute error(MAE) of the estimated DOA,along with the DOA track of the moving speaker,served as the evaluation measures.Experimental results revealed several key findings:(1) As RT_(60) increased,the MAE of all methods increased,clearly demonstrating that reverberation significantly affects DOA estimation performance.(2) Compared with the reference methods,the proposed method consistently delivered the lowest MAE values under different RT_(60)s and SNRs.This suggests that the proposed method has higher accuracy in DOA estimation.(3) In terms of DOA trajectory,the proposed method again outperformed the reference methods by producing the smallest error.This indicates that the proposed method has better performance in DOA tracking.[Conclusions] By integrating denoising,dereverberation,and DOA estimation through a causal and recursive iteration structure,the performance of DOA estimation and tracking can be significantly enhanced.The proposed method effectively mitigates the detrimental impact of noise and reverberation on DOA estimation and tracking accuracy in single sound source scenarios.

作者周静鲍长春段海威 ZHOU Jing;BAO Changchun;DUAN Haiwei(Institute of Speech and Audio Signal Processing,Faculty of Information Technology,Beijing University of Technology,Beijing 100124,China)

机构地区北京工业大学信息学部

出处《清华大学学报（自然科学版）》 EI CAS CSCD 北大核心 2024年第11期1902-1910,共9页 Journal of Tsinghua University(Science and Technology)

基金国家自然科学基金项目(61831019)。

关键词到达方向估计多通道线性预测 KALMAN滤波频率聚焦去混响 direction of arrival estimation multichannel linear prediction Kalman filtering frequency focusing dereverberation

分类号 TP912.3 [自动化与计算机技术]

引文网络
相关文献

参考文献3

1周静,鲍长春,张旭.基于聚焦信号子空间估计导向矢量的干扰声源抑制方法[J].电子学报,2023,51(1):76-85. 被引量：1
2Xue Yang,Changchun Bao,Zihao Cui.Weighting Function Modification Used for Phase Transform-Based Time Delay Estimation[J].China Communications,2022,19(11):241-256. 被引量：1
3厉剑,彭任华,郑成诗,李晓东.球谐域自适应混响抵消与声源定位算法[J].声学学报,2019,44(5):874-886. 被引量：5

二级参考文献7

1鄢社锋,侯朝焕,马晓川.从阵元域到模态域阵列信号处理[J].声学学报,2011,36(5):461-468. 被引量：18
2刘月婵,何元安,商德江,尚大晶,孙超.高精度球面阵聚焦声源定位方法研究[J].声学学报,2013,38(5):533-540. 被引量：9
3张揽月,丁丹丹,杨德森,时胜国,朱中锐.阵元随机均匀分布球面阵列联合噪声源定位方法[J].物理学报,2017,66(1):140-151. 被引量：10
4杨志伟,张攀,陈颖,许华健.导向矢量和协方差矩阵联合迭代估计的稳健波束形成算法[J].电子与信息学报,2018,40(12):2874-2880. 被引量：10
5何礼,周翊,刘宏清.利用相位时频掩蔽的麦克风阵列噪声消除方法[J].信号处理,2018,34(12):1490-1498. 被引量：4
6曹司磊,曾维贵,王磊.色噪声下基于差分聚焦的宽带DOA估计方法[J].哈尔滨工业大学学报,2021,53(2):140-145. 被引量：3
7贾思宇,路茗,丁华泽,陈明,赵鲁阳.一种改进的信号子空间聚焦宽带DOA估计算法[J].计算机工程,2022,48(1):175-181. 被引量：3

共引文献4

1柯雨璇,厉剑,彭任华,郑成诗,李晓东.用于自适应波束形成语音增强的球谐域掩蔽函数估计方法[J].声学学报,2021,46(1):67-80. 被引量：4
2韩哲,郑成诗,柯雨璇,李晓东.分布式无线声传感网加权预测误差语声去混响方法[J].应用声学,2022,41(1):21-31. 被引量：3
3张巧花,张纯.圆形阵列无线传感器的鸟鸣声检测方法[J].应用声学,2022,41(3):381-387. 被引量：1
4高伟霞,陈华伟.利用时频相关性的球谐波阶数感知鲁棒伪声强多声源定位[J].声学学报,2023,48(5):1045-1059.

1高春艳,赖光金,吕晓玲,白祎扬,张明路.基于卷积神经网络的移动机器人声源定位方法综述[J].科学技术与工程,2024,24(7):2617-2624. 被引量：1
2王汝桥,张谊,何玉鹏,周岱.基于自注意力机制的时间序列预测及异常检测研究[J].电工技术,2024(19):55-57.
3郭苏瑶,陈香瑶(指导).追[J].作文成功之路,2024(33):8-8.
4熊坤来,沙志超.基于数据驱动的高精度阵列测向方法[J].无线电通信技术,2024,50(5):1016-1023.
5戴计生,丁荣军,付勇,刘欣,于天剑.基于谱估计和中心峭度的电机轴承故障诊断[J].铁道科学与工程学报,2024,21(10):4344-4356.
6李贤琪,廖孟光,林东方.优化BP神经网络模型的边坡安全多因素滚动预报[J].工程勘察,2024,52(11):46-53.
7张宇桢,高洋.基于标识解析技术的新能源风电供应链协同与追溯场景新模式应用研究[J].IT经理世界,2024(7):11-14.
8王丹.黔东南苗族民歌的考察与研究[J].黄河之声,2024(17):34-37.
9吴跃华.香港“马礼逊学堂”音乐教育考释[J].南京艺术学院学报（音乐与表演版）,2024(5):14-19.
10张彪.基于频谱分析的煤矿螺旋滚筒洗煤机故障振动诊断[J].煤矿机械,2024,45(11):175-179.

清华大学学报（自然科学版）

2024年第11期

浏览历史

内容加载中请稍等...

基于Kalman滤波与频率聚焦的单声源到达方向实时估计与跟踪方法

参考文献3

二级参考文献7

共引文献4

相关作者

相关机构

相关主题

浏览历史