期刊文献+

Head-related transfer function–reserved time-frequency masking for robust binaural sound source localization 被引量:2

下载PDF
导出
摘要 Various time-frequency(T-F)masks are being applied to sound source localization tasks.Moreover,deep learning has dramatically advanced T-F mask estimation.However,existing masks are usually designed for speech separation tasks and are suitable only for single-channel signals.A novel complex-valued T-F mask is proposed that reserves the head-related transfer function(HRTF),customized for binaural sound source localization.In addition,because the convolutional neural network that is exploited to estimate the proposed mask takes binaural spectral information as the input and output,accurate binaural cues can be preserved.Compared with conventional T-F masks that emphasize single speech source–dominated T-F units,HRTFreserved masks eliminate the speech component while keeping the direct propagation path.Thus,the estimated HRTF is capable of extracting more reliable localization features for the final direction of arrival estimation.Hence,binaural sound source localization guided by the proposed T-F mask is robust under noisy and reverberant acoustic environments.The experimental results demonstrate that the new T-F mask is superior to conventional T-F masks and lead to the better performance of sound source localization in adverse environments.
出处 《CAAI Transactions on Intelligence Technology》 SCIE EI 2022年第1期26-33,共8页 智能技术学报(英文)
基金 National Natural Science Foundation of China,Grant/Award Numbers:61673030,U1613209 National Natural Science Foundation of Shenzhen,Grant/Award Number:JCYJ20190808182209321。
  • 相关文献

同被引文献3

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部