期刊文献+

基于子带信噪比估计和软判决的鲁棒双耳声源定位算法

Robust binaural sound source localization based on sub-band SNR estimation and soft decision
下载PDF
导出
摘要 为了提高噪声和混响环境下的双耳声源定位算法性能,提出了一种基于子带信噪比估计和软判决的双耳互功率谱和耳间时间差估计算法.首先根据每帧中每个子带双耳声信号的自相关矩阵估计子带信噪比;其次,将子带信噪比映射为软判决值,并对双耳互功率谱进行加权;最后利用加权后的互功率谱估计耳间时间差,从而判断目标声源方位.仿真测试和实际环境测试均表明:与基于互相关函数、过零率的传统双耳声源定位算法相比,所提算法在噪声和混响的复杂声学环境下,显著提高了双耳声源定位性能. In order to improve the localization performance in noisy and reverberation environments, a robust binaural sound source localization (SSL) algorithm based on sub-band signal-to-noise ratio (SNR) estimation and soft decision is proposed. First, sub-band SNR is estimated based on the au- tocorrelation matrix of sub-band binaural sound signals in each frame. Then, the sub-band SNR is mapped to soft decision value, and the cross power spectrum density (PSD) of binaural sound signal is weighted by soft decision. Finally, inter-aural time difference (ITD) is computed by weighted cross PSD, and the azimuth of sound source is estimated. Simulation and real environment test re- suits show that, compared with the conventional binaural SSL algorithms based on cross correlation and zeros crossing, the localization performance of the proposed algorithm is significantly improved in complex acoustic environments.
出处 《东南大学学报(自然科学版)》 EI CAS CSCD 北大核心 2015年第4期619-624,共6页 Journal of Southeast University:Natural Science Edition
基金 国家自然科学基金资助项目(61201345) 中央高校基本科研业务费专项资金资助项目(2242013K30010)
关键词 双耳声源定位 子带信噪比估计 软判决 耳间时间差 binaural sound source localization sub-band signal-to-noise ratio estimation soft deci-sion inter-aural time difference
  • 相关文献

参考文献12

  • 1Rayleigh L. On our perception of sound direction [ J ]. Philosophical Magazine, 1907, 13 (74) :214 - 232.
  • 2Raspaud M, Viste H, Evangelista G. Binaural source localization by joint estimation of ILD and ITD [J]. IEEE Transactions on Audio, Speech and Language Processing, 2010, 18( 1 ) :68 -77.
  • 3Kim Y I, Kil R M. Estimation of interaural time differ- ences based on zero-crossings in noisy multisource envi- ronments [ J ]. IEEE Transactions on Audio, Speech and Language Processing, 2007, 15 ( 2 ) : 734 - 743,.
  • 4Chau D T, Li J, Akagi M. A DOA estimation algo- rithm based on equalization cancellation theory [ C ]// Proceedings of INTERSPEECH-2010. Maknhari, Chi- ha, Japan, 2010:2770-2773.
  • 5Parisi R, Camoes F, Scarpiniti M, et al. Cepstrum pre- filtering for binaural source localization in reverberant environments [J] 1EEE Signal Processing Letters, 2012, 19(2): 99-102.
  • 6May T, van de Par S, Kohlrausch A. A probabilistic model forrobust localization based on a binaural auditory front-end [ J ]. IEEE Transactions on Audio, Speech and Lan- guage Processing, 2011, 19( 1 ) : 1 - 13.
  • 7May T, van de Par S, Kohlrausch A. A binaural scene analyzer for joint localization and recognition of speakers in the presence of interfering noise sources and reverberation [ J]. IEEE Transactions on Audio, Speech and Language Processing, 2012, 20 ( 7 ) : 2016 -2030.
  • 8Roman N, Wang D L. Binaural tracking of multiple moving sources [ J ]. IEEE Transactions on Audio, Speech and Language Processing, 2008, 16 (4) : 728 - 739.
  • 9Karim Y, Sylvain A, Jean-Luc Z. A binaural sound source localization method using auditive cues and vision [ C ]//Proceedings of ICASSP-2012. Kyoto, Japan, 2012:217 - 220.
  • 10Kim C, Kumar K, Stern R M. Binaural sound source separation motivated by auditory processing [ C ]// Proceedings of ICASSP-2011. Prague, Czech, 2011 : 5072 - 5075.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部