期刊文献+
共找到9篇文章
< 1 >
每页显示 20 50 100
Sound event localization and detection based on deep learning
1
作者 ZHAO Dada DING Kai +2 位作者 QI Xiaogang CHEN Yu FENG Hailin 《Journal of Systems Engineering and Electronics》 SCIE CSCD 2024年第2期294-301,共8页
Acoustic source localization(ASL)and sound event detection(SED)are two widely pursued independent research fields.In recent years,in order to achieve a more complete spatial and temporal representation of sound field,... Acoustic source localization(ASL)and sound event detection(SED)are two widely pursued independent research fields.In recent years,in order to achieve a more complete spatial and temporal representation of sound field,sound event localization and detection(SELD)has become a very active research topic.This paper presents a deep learning-based multioverlapping sound event localization and detection algorithm in three-dimensional space.Log-Mel spectrum and generalized cross-correlation spectrum are joined together in channel dimension as input features.These features are classified and regressed in parallel after training by a neural network to obtain sound recognition and localization results respectively.The channel attention mechanism is also introduced in the network to selectively enhance the features containing essential information and suppress the useless features.Finally,a thourough comparison confirms the efficiency and effectiveness of the proposed SELD algorithm.Field experiments show that the proposed algorithm is robust to reverberation and environment and can achieve higher recognition and localization accuracy compared with the baseline method. 展开更多
关键词 sound event localization and detection(SELD) deep learning convolutional recursive neural network(CRNN) channel attention mechanism
下载PDF
Sound Source Localization Based on SRP-PHAT Spatial Spectrum and Deep Neural Network 被引量:3
2
作者 Xiaoyan Zhao Shuwen Chen +1 位作者 Lin Zhou Ying Chen 《Computers, Materials & Continua》 SCIE EI 2020年第7期253-271,共19页
Microphone array-based sound source localization(SSL)is a challenging task in adverse acoustic scenarios.To address this,a novel SSL algorithm based on deep neural network(DNN)using steered response power-phase transf... Microphone array-based sound source localization(SSL)is a challenging task in adverse acoustic scenarios.To address this,a novel SSL algorithm based on deep neural network(DNN)using steered response power-phase transform(SRP-PHAT)spatial spectrum as input feature is presented in this paper.Since the SRP-PHAT spatial power spectrum contains spatial location information,it is adopted as the input feature for sound source localization.DNN is exploited to extract the efficient location information from SRP-PHAT spatial power spectrum due to its advantage on extracting high-level features.SRP-PHAT at each steering position within a frame is arranged into a vector,which is treated as DNN input.A DNN model which can map the SRP-PHAT spatial spectrum to the azimuth of sound source is learned from the training signals.The azimuth of sound source is estimated through trained DNN model from the testing signals.Experiment results demonstrate that the proposed algorithm significantly improves localization performance whether the training and testing condition setup are the same or not,and is more robust to noise and reverberation. 展开更多
关键词 sound source localization microphone array steered response power-phase transform(SRP-PHAT)spatial spectrum deep neural network
下载PDF
A FAST SEARCH METHOD OF STEERED RESPONSE POWER WITH SMALL-APERTURE MICROPHONE ARRAY FOR SOUND SOURCE LOCALIZATION 被引量:1
3
作者 Zhao Xiaoyan Tang Jie +1 位作者 Zhou Lin Wu Zhenyang 《Journal of Electronics(China)》 2013年第5期483-490,共8页
The Steered Response Power(SRP)method works well for sound source localization in noisy and reverberant environment.However,the large computation complexity limits its practical application.In this paper,a fast SRP se... The Steered Response Power(SRP)method works well for sound source localization in noisy and reverberant environment.However,the large computation complexity limits its practical application.In this paper,a fast SRP search method is proposed to reduce the computational complexity using small-aperture microphone array.The proposed method inspired by the SRP spatial spectrum includes two steps:first,the proposed method estimates the azimuth of the sound source roughly and determines whether the sound source is in far field or near field;then,different fine searching operations are performed according to the sound source being in far field or near field.Experiments both in simulation environments and real environments have been performed to compare the localization accuracy and computation complexity of the proposed method with those of the conventional SRP-PHAT algorithm.The results show that,the proposed method has a comparative accuracy with the conventional SRP algorithm,and achieves a reduction of 93.62%in computation complexity compared to the conventional SRP algorithm. 展开更多
关键词 sound source localization Steered Response Power(SRP) Three-line method Smallaperture microphone array
下载PDF
Microphone Array-Based Sound Source Localization Using Convolutional Residual Network 被引量:1
4
作者 Ziyi Wang Xiaoyan Zhao +2 位作者 Hongjun Rong Ying Tong Jingang Shi 《Journal of New Media》 2022年第3期145-153,共9页
Microphone array-based sound source localization(SSL)is widely used in a variety of occasions such as video conferencing,robotic hearing,speech enhancement,speech recognition and so on.The traditional SSL methods cann... Microphone array-based sound source localization(SSL)is widely used in a variety of occasions such as video conferencing,robotic hearing,speech enhancement,speech recognition and so on.The traditional SSL methods cannot achieve satisfactory performance in adverse noisy and reverberant environments.In order to improve localization performance,a novel SSL algorithm using convolutional residual network(CRN)is proposed in this paper.The spatial features including time difference of arrivals(TDOAs)between microphone pairs and steered response power-phase transform(SRPPHAT)spatial spectrum are extracted in each Gammatone sub-band.The spatial features of different sub-bands with a frame are combine into a feature matrix as the input of CRN.The proposed algorithm employ CRN to fuse the spatial features.Since the CRN introduces the residual structure on the basis of the convolutional network,it reduce the difficulty of training procedure and accelerate the convergence of the model.A CRN model is learned from the training data in various reverberation and noise environments to establish the mapping regularity between the input feature and the sound azimuth.Through simulation verification,compared with the methods using traditional deep neural network,the proposed algorithm can achieve a better localization performance in SSL task,and provide better generalization capacity to untrained noise and reverberation. 展开更多
关键词 Convolutional residual network microphone array spatial features sound source localization
下载PDF
SOUND SOURCE LOCALIZATION OF DIGITAL HEARING AIDS USING WAVELET BASED MULTIVARIATE STATISTICAL METHOD
5
作者 Liang Ruiyu Zou Cairog +1 位作者 Wang Qingyu Xi Ji 《Journal of Electronics(China)》 2010年第4期571-576,共6页
The letter proposed a sound source localization method of digital hearing aids using wavelet based multivariate statistics with the Generalized Cross Correlation (GCC) algorithm. Haar wavelet is used to decompose GCC ... The letter proposed a sound source localization method of digital hearing aids using wavelet based multivariate statistics with the Generalized Cross Correlation (GCC) algorithm. Haar wavelet is used to decompose GCC sequences and extract four wavelet characteristics. And then, Hotelling T2 statistical method is used to fuse the four wavelet characteristics. The statistical value is used to judge the number of sound sources and obtain corresponding time delay estimation which is used to localize the position of sound source. The experimental results show that the proposed method has better robustness in an environment with severe noise and reverberation. Meanwhile, the complexity of al-gorithm is moderate, which is available for sound source localization of hearing aids. 展开更多
关键词 sound source localization Wavelet decomposition Hotelling T2 statistical model Digital hearing aids
下载PDF
Accelerated steered response power method for sound source localization via clustering search 被引量:5
6
作者 ZHAO XiaoYan TANG Jie +1 位作者 ZHOU Lin WU ZhenYang 《Science China(Physics,Mechanics & Astronomy)》 SCIE EI CAS 2013年第7期1329-1338,共10页
The steered response power-phase transform (SRP-PHAT) sound source localization algorithm is robust in a real environment. However, the large computation complexity limits the practical application of SRP-PHAT. For a ... The steered response power-phase transform (SRP-PHAT) sound source localization algorithm is robust in a real environment. However, the large computation complexity limits the practical application of SRP-PHAT. For a microphone array, each location corresponds to a set of time differences of arrival (TDOAs), and this paper collects them into a TDOA vector. Since the TDOA vectors in the adjacent regions are similar, we present a fast algorithm based on clustering search to reduce the computation complexity of SRP-PHAT. In the training stage, the K-means or Iterative Self-Organizing Data Analysis Technique (ISODATA) clustering algorithm is used to find the centroid in each cluster with similar TDOA vectors. In the procedure of sound localization, the optimal cluster is found by comparing the steered response powers (SRPs) of all centroids. The SRPs of all candidate locations in the optimal cluster are compared to localize the sound source. Experiments both in simulation environments and real environments have been performed to compare the localization accuracy and computational load of the proposed method with those of the conventional SRP-PHAT algorithm. The results show that the proposed method is able to reduce the computational load drastically and maintains almost the same localization accuracy and robustness as those of the conventional SRP-PHAT algorithm. The difference in localization performance brought by different clustering algorithms used in the training stage is trivial. 展开更多
关键词 sound source localization microphone array steered response power clustering search
原文传递
Analyse and sound image localization experiment study on multi-channel planar surround sound system 被引量:6
7
作者 XIE Bosun and XIE Xingfu (Applied Physics Dept. South china Universityof Technology,Guangzhou 510641) 《Chinese Journal of Acoustics》 1996年第1期52-64,共13页
In this paper the method of approximate expansion is used to analyse a perfect planar surround sound system, resulting in an order of new and upgrade systems. First reproductinn signals of the perfect system and the c... In this paper the method of approximate expansion is used to analyse a perfect planar surround sound system, resulting in an order of new and upgrade systems. First reproductinn signals of the perfect system and the characteristics of different orders systems are analysed. The independent transmission signals and decoding (reproduction) equation of the systexns are given. The compatibility among different orders systems and the problem of simplifying output channels are discussed. The problem of signal picking up, recording,transmitting and the possibility of putting the systems into practical use are studied. A sound hoage localization experiment for the systems is carried out in order to study haage localization in relaion to the numbers of transmission signals and output channels. The experimental result is consistemt with the theoretical result. This work lay down a base for practical use. 展开更多
关键词 Stereophonic Surround sound sound image localization
原文传递
Distributed sound source localization algorithm with sound velocity calibration in windy environments 被引量:3
8
作者 YAN Qingli CHEN Jianfeng 《Chinese Journal of Acoustics》 CSCD 2018年第1期35-44,共10页
A new sound source localization method with sound speed compensation is proposed to reduce the wind influence on the performance of conventional TDOA (Time Difference of Arrival) algorithms. First, the sound speed i... A new sound source localization method with sound speed compensation is proposed to reduce the wind influence on the performance of conventional TDOA (Time Difference of Arrival) algorithms. First, the sound speed is described as a set of functions of the unknown source location, to approximate the acoustic velocity field distribution in the wind field. Then, they are introduced into the TDOA algorithm, to construct nonlinear equations. Finally, the particle swarm optimization algorithm is used to estimate the source location. The simulation results show that the proposed algorithm can significantly improve the localization accuracy for different wind velocities, source locations and test area sizes. The experimental results show that the proposed method can reduce localization errors to about 40% of the original error in a four nodes localization system. 展开更多
关键词 Distributed sound source localization algorithm with sound velocity calibration in windy environments
原文传递
Interchannel phase difference and stereo sound image localization
9
作者 XIE Bosun(Applied Physics Dept., South China University of Technology Guangzhou .510641) 《Chinese Journal of Acoustics》 1998年第1期85-93,共9页
By considering higher order approximation to the interaural phase difference, a more general localization equation for stereo sound image with interchannel phase difference is derived. At very low frequency or low int... By considering higher order approximation to the interaural phase difference, a more general localization equation for stereo sound image with interchannel phase difference is derived. At very low frequency or low interchannel phase difference, the equation can be simplified to Makita theory. In general, image position is obviously affected by frequency.It is shown that image position varying with freqllency is the main reason for image width broadening in stereo reproduction with interchannel phase difference. And an extra interaural sound level difference caused by interchannel phase difference is the main reason for image naturalness degrading. In practice, it is necessary to reduce the interchannel phase difference,at least, to less than 60°. 展开更多
关键词 KHZ Interchannel phase difference and stereo sound image localization
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部