Head-related transfer function–reserved time-frequency masking for robust binaural sound source localization 被引量：2

下载PDF

导出

摘要 Various time-frequency(T-F)masks are being applied to sound source localization tasks.Moreover,deep learning has dramatically advanced T-F mask estimation.However,existing masks are usually designed for speech separation tasks and are suitable only for single-channel signals.A novel complex-valued T-F mask is proposed that reserves the head-related transfer function(HRTF),customized for binaural sound source localization.In addition,because the convolutional neural network that is exploited to estimate the proposed mask takes binaural spectral information as the input and output,accurate binaural cues can be preserved.Compared with conventional T-F masks that emphasize single speech source–dominated T-F units,HRTFreserved masks eliminate the speech component while keeping the direct propagation path.Thus,the estimated HRTF is capable of extracting more reliable localization features for the final direction of arrival estimation.Hence,binaural sound source localization guided by the proposed T-F mask is robust under noisy and reverberant acoustic environments.The experimental results demonstrate that the new T-F mask is superior to conventional T-F masks and lead to the better performance of sound source localization in adverse environments.

作者 Hong Liu Peipei Yuan Bing Yang Ge Yang Yang Chen

机构地区 Key Laboratory of Machine Perception School of Artificial Intelligence Yanka Kupala State University of Grodno

出处《CAAI Transactions on Intelligence Technology》 SCIE EI 2022年第1期26-33,共8页 智能技术学报（英文）

基金 National Natural Science Foundation of China,Grant/Award Numbers:61673030,U1613209 National Natural Science Foundation of Shenzhen,Grant/Award Number:JCYJ20190808182209321。

关键词 estimation. SOUND FUNCTION

分类号 O42 [理学—声学]

引文网络
相关文献

同被引文献3

1Cong Jin,Tao Wang,Xiaobing Li,Chu Jie Jiessie Tie,Yun Tie,Shan Liu,Ming Yan,Yongzhi Li,Junxian Wang,Shenze Huang.A transformer generative adversarial network for multi-track music generation[J].CAAI Transactions on Intelligence Technology,2022,7(3):369-380. 被引量：4
2Xiangtao Zheng,Yichao Zhang,Yunpeng Zheng,Fulin Luo,Xiaoqiang Lu.Abnormal event detection by a weakly supervised temporal attention network[J].CAAI Transactions on Intelligence Technology,2022,7(3):419-431. 被引量：4
3Dawei Dai,Yutang Li,Yuqi Wang,Huanan Bao,Guoyin Wang.Rethinking the image feature biases exhibited by deep convolutional neural network models in image recognition[J].CAAI Transactions on Intelligence Technology,2022,7(4):721-731. 被引量：2

引证文献2

1Liang Tao,Maoshen Jia,Lu Li,Jing Wang,Yang Xiang.Multisource localization based on angle distribution of time-frequency points using an FOA microphone[J].CAAI Transactions on Intelligence Technology,2023,8(3):807-823.
2Degang Wang,Lianru Gao,Ying Qu,Xu Sun,Wenzhi Liao.Frequency‐to‐spectrum mapping GAN for semisupervised hyperspectral anomaly detection[J].CAAI Transactions on Intelligence Technology,2023,8(4):1258-1273.

1Lin Zhou,Siyuan Lu,Qiuyue Zhong,Ying Chen,Yibin Tang,Yan Zhou.Binaural Speech Separation Algorithm Based on Long and Short Time Memory Networks[J].Computers, Materials & Continua,2020(6):1373-1386. 被引量：1
2ZHANG Tianqi,XIONG Mei,ZHANG Ting,YANG Qiang.A separation method of singing and accompaniment combining discriminative training deep neural network[J].Chinese Journal of Acoustics,2019,38(2):227-239. 被引量：2
3Xiaoyan Zhao,Shuwen Chen,Lin Zhou,Ying Chen.Sound Source Localization Based on SRP-PHAT Spatial Spectrum and Deep Neural Network[J].Computers, Materials & Continua,2020(7):253-271. 被引量：3
4Ruohan Meng,Qi Cui,Zhili Zhou,Chengsheng Yuan,Xingming Sun.A Novel Steganography Algorithm Based on Instance Segmentation[J].Computers, Materials & Continua,2020(4):183-196. 被引量：1
5LIN Long,TAN Liang.Multi-Distributed Speech Emotion Recognition Based on Mel Frequency Cepstogram and Parameter Transfer[J].Chinese Journal of Electronics,2022,31(1):155-167. 被引量：1
6陈志刚,赵志川,钟新荣,蔡春雨.基于鲁棒局部均值分解与二阶瞬态提取变换的滚动轴承故障诊断[J].科学技术与工程,2022,22(1):157-165. 被引量：5
7Yuheng FENG,Qingze LIU,Aiping LIU,Ruobing QIAN,Xun CHEN.A novel SSA-CCA framework for muscle artifact removal from ambulatory EEG[J].Virtual Reality & Intelligent Hardware,2022,4(1):1-21.
8In the Spotlight[J].Women of China,2022(1):6-8.
9MA Xinjian,LIU Shiqian,CHENG Huihui.Civil aircraft fault tolerant attitude tracking based on extended state observers and nonlinear dynamic inversion[J].Journal of Systems Engineering and Electronics,2022,33(1):180-187.
10Xianyu Dong,Wu Ren,Zhenghui Xue,Xuetian Wang,Weiming Li.A Time-Frequency Associated MUSIC Algorithm Research on Human Target Detection by Through-Wall Radar[J].Journal of Beijing Institute of Technology,2022,31(1):123-130. 被引量：1

CAAI Transactions on Intelligence Technology

2022年第1期

浏览历史

内容加载中请稍等...

Head-related transfer function–reserved time-frequency masking for robust binaural sound source localization 被引量：2

同被引文献3

引证文献2

相关作者

相关机构

相关主题

浏览历史