期刊文献+

噪声混响下说话人跟踪的多特征自适应UPF算法

Adaptive unscented particle filter algorithm based on multi-feature for speaker tracking in noisy and reverberant environments
下载PDF
导出
摘要 为了提高噪声混响环境下说话人跟踪系统的精度和稳健性,提出了一种多特征自适应无迹粒子滤波(MFAUPF)算法。该算法以语音信号的多特征作为观测信息,采用多假设和频选函数构建了时延选择机制和波束输出能量优化机制,并在两种机制融合的基础上构建了似然函数,弥补了单特征不能同时稳健噪声和混响的不足。由于说话人运动具有随机性,建立了声源跟踪的自适应CV模型,在此基础上将无迹卡尔曼滤波(UKF)与抗差估计理论相结合作为提议分布,提高了模型的适配能力。文中仿真和实测结果表明,在AUPF下,多特征算法比SBFSRP算法位置平均RMSE减少了18%以上,在多特征观测下,AUPF算法比CV算法位置平均RMSE减少了14%以上,所提算法具有跟踪精度高和数值稳定性强的特点。 To improve the accuracy and robustness of the speaker tracking system in noisy and reverberant environments, an adaptive unscented particle filter(AUPF) algorithm based on multi-feature is proposed. The multi-feature of the speech signal is regarded as the observation information in this algorithm, where the multi-hypothesis and frequency selection function is applied to the mechanisms of time delay selection and beam output energy optimization. Subsequently, the likelihood function is constructed by combining these two mechanisms, which makes up for the deficiency that noise and reverberation cannot be restrained simultaneously by a single feature. Considering the randomness of speaker motion, a new proposal distribution is utilized in the particle filter algorithm, which combines the unscented Kalman filter(UKF) and the robust estimation theory based on the adaptive constant speed model to improve the adaptability of the model. The simulation and experimental results show that based on AUPF, the position average RMSE of multi feature algorithm is reduced by more than 18% compared with that of SBFSRP, and under multi-feature observation, the position average RMSE of AUPF algorithm is reduced by more than 14% compared with that of CV algorithm. It has the characteristics of high tracking accuracy and strong numerical stability.
作者 刘望生 潘海鹏 王明环 Liu Wangsheng;Pan Haipeng;Wang Minghuan(School of Mechanical Engineering and Automation,Zhejiang Sci Tech Uniersity,Hangzhou 310018,China;Key laboratlory of Special Purpose Equipment and Adranced Processing Technology,2hejiang Unitersity of Technology,Ministry of Education,Hangzhou 310012,China)
出处 《仪器仪表学报》 EI CAS CSCD 北大核心 2022年第4期224-233,共10页 Chinese Journal of Scientific Instrument
基金 国家自然科学基金(51975532)项目资助。
关键词 说话人跟踪 麦克风阵列 室内混响 多特征 AUPF算法 speaker tracking microphone array room reverberation multi-feature AUPF algorithm
  • 相关文献

参考文献7

二级参考文献47

  • 1RENChao,OUJikun,YUANYunbin.Application of adaptive filtering by selecting the parameter weight factor in precise kinematic GPS positioning[J].Progress in Natural Science:Materials International,2005,15(1):41-46. 被引量:12
  • 2崔玮玮,曹志刚,魏建强.声源定位中的时延估计技术[J].数据采集与处理,2007,22(1):90-99. 被引量:92
  • 3VALIN J M,MICHAUD F,ROUAT J.Robust localization and tracking of simultaneous moving sound sources using beamforming and particle filtering[J].Robotics and Autonomous Systems,2007,55:216-228.
  • 4KUHNE M,TOGNERI R,NORDHOLM S.Robust source localization in reverberant environment based on weighted fuzzy clustering[J].IEEE Signal Processing Letters,2009,16(2):85-88.
  • 5DIBIASE J H.A high-accuracy,low-latency technique for talker localization in reverberant environments using microphone arrays[D].USA:Brown University,2000.
  • 6ARULAMPALAM M S,MASKELL S,GORDON N,et al.A tutorial on particle filters for online non-liner/ non-Gaussian Bayesian tracking[J].IEEE Transactions on signal processing,2002,50(2):174-188.
  • 7ALLEN J B,BERKLEY D A.Image method for effi-ciently simulating small-room acoustics[J].Journal of Acoustical Society of America,1979,65(4):943-950.
  • 8ZHANG CH,FLORENCIO D,BA D E,et al.Maximum likelihood sound source localization and beamforming for directional microphone arrays in distributed meetings[J].IEEE Transactions on multimedia,2008,10(3):538-548.
  • 9KAGAMI S,THOMPSON S,SASAKI Y,et al.2D sound source mapping from mobile robot using beamforming and particle filtering[C].Taiwan:IEEE International Conference on Acoustics,Speech,and Signal Processing (ICASSP),2009:3689-3692.
  • 10VERMAAK J,BLAKE A.Nonlinear filtering for speaker tracking in noisy and reverberant envi-ronments[C].Salt Lake City:IEEE International Conference on Acoustics,Speech,and Signal Processing (ICASSP),2001:3021-3024.

共引文献114

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部