Sound event localization and detection based on deep learning

下载PDF

导出

摘要 Acoustic source localization(ASL)and sound event detection(SED)are two widely pursued independent research fields.In recent years,in order to achieve a more complete spatial and temporal representation of sound field,sound event localization and detection(SELD)has become a very active research topic.This paper presents a deep learning-based multioverlapping sound event localization and detection algorithm in three-dimensional space.Log-Mel spectrum and generalized cross-correlation spectrum are joined together in channel dimension as input features.These features are classified and regressed in parallel after training by a neural network to obtain sound recognition and localization results respectively.The channel attention mechanism is also introduced in the network to selectively enhance the features containing essential information and suppress the useless features.Finally,a thourough comparison confirms the efficiency and effectiveness of the proposed SELD algorithm.Field experiments show that the proposed algorithm is robust to reverberation and environment and can achieve higher recognition and localization accuracy compared with the baseline method.

作者 ZHAO Dada DING Kai QI Xiaogang CHEN Yu FENG Hailin

机构地区 School of Mathematics and Statistics Science and Technology on Near-Surface Detection Laboratory

出处《Journal of Systems Engineering and Electronics》 SCIE CSCD 2024年第2期294-301,共8页 系统工程与电子技术（英文版）

基金 supported by the National Natural Science Foundation of China(61877067) the Foundation of Science and Technology on Near-Surface Detection Laboratory(TCGZ2019A002,TCGZ2021C003,6142414200511) the Natural Science Basic Research Program of Shaanxi(2021JZ-19)。

关键词 sound event localization and detection(SELD) deep learning convolutional recursive neural network(CRNN) channel attention mechanism

分类号 TN912.3 [电子电信—通信与信息系统] TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献1

1Xiaoyan Zhao,Shuwen Chen,Lin Zhou,Ying Chen.Sound Source Localization Based on SRP-PHAT Spatial Spectrum and Deep Neural Network[J].Computers, Materials & Continua,2020(7):253-271. 被引量：3

二级参考文献1

1ZHAO XiaoYan,TANG Jie,ZHOU Lin,WU ZhenYang.Accelerated steered response power method for sound source localization via clustering search[J].Science China(Physics,Mechanics & Astronomy),2013,56(7):1329-1338. 被引量：5

共引文献2

1黄静,胡馨月.基于麦克风阵列的室内三维声源定位优化算法[J].计算机系统应用,2021,30(9):212-218. 被引量：7
2Lin Zhou,Yue Xu,Tianyi Wang,Kun Feng,Jingang Shi.Microphone Array Speech Separation Algorithm Based on TC-ResNet[J].Computers, Materials & Continua,2021(11):2705-2716.

1Jingyi Wang,Lin Wu.First-line immunotherapy for advanced non-small cell lung cancer:current progress and future prospects[J].Cancer Biology & Medicine,2024,21(2):117-124.
2Zixuan Han,Leilei Shi,Lu Liu,Liang Jiang,Jiawei Fang,Fanyuan Lin,Jinjuan Zhang,John Panneerselvam,Nick Antonopoulos.A Survey on Event Tracking in Social Media Data Streams[J].Big Data Mining and Analytics,2024,7(1):217-243.
3管凝,毕华兴,张清涛,李粟粟,焦振寰.北京市昌平区不同层位地下水埋深时空动态特征及其对降水的响应[J].灌溉排水学报,2023,42(S01):103-108. 被引量：1
4Ye Yuan,Junan Pan,Weinan Yin,Haoxuan Yu,Fengshun Wang,Weifeng Hu,Longlu Wang,Dafeng Yan.Effective strategies to promote Z(S)-scheme photocatalytic water splitting[J].Chinese Chemical Letters,2024,35(3):27-39.
5陶皖辰,孙思琦,何立新,何炎凊,胡建昌,邓宇,徐诚清,兰鹏飞,陆培祥.Measurement of molecular alignment with deep learningbased M-XFROG technique [Invited][J].Chinese Optics Letters,2023,21(12):19-24. 被引量：1
6Shaohong Shi,Yuheng Jiang,Hao Ren,Siwen Deng,Jianping Sun,Fangchao Cheng,Jingjing Jing,Yinghong Chen.3D‑Printed Carbon‑Based Conformal Electromagnetic Interference Shielding Module for Integrated Electronics[J].Nano-Micro Letters,2024,16(5):87-101. 被引量：3
7石俊凯,陈晓梅,万宇,霍树春,姜行健,李冠楠,周维虎.基于栅格节距中心峰值检测的扫描探针显微镜校准方法[J].计测技术,2024,44(1):73-79. 被引量：1
8WEI Guo,ZHAO Yajun,CHEN Li.Special Topic on Near-Field Communication and Sensing Towards 6G[J].ZTE Communications,2024,22(1):1-2.
9Wang Heping,Lin Yinchao,Li Yanggui,Zhang Xiaohang,Wu Yi.Numerical Simulation Study of Oil-Water Emulsion Separation in an Ultrasonic Field:Effect of Coupling between Acoustic and Flow Field Parameters[J].China Petroleum Processing & Petrochemical Technology,2024,26(1):116-125.
10Bing Qu,Jie Ma.From blocks to cities:Morphology structure rooted in 3D patterns and forming clusters at the block level[J].Frontiers of Architectural Research,2023,12(6):1127-1143.

Journal of Systems Engineering and Electronics

2024年第2期

浏览历史

内容加载中请稍等...

Sound event localization and detection based on deep learning

参考文献1

二级参考文献1

共引文献2

相关作者

相关机构

相关主题

浏览历史