Similar to air reverberation chambers, non-anechoic water tanks are important acoustic measurement devices that can be used to measure the sound power radiated from complex underwater sound sources using diffusion fie...Similar to air reverberation chambers, non-anechoic water tanks are important acoustic measurement devices that can be used to measure the sound power radiated from complex underwater sound sources using diffusion field theory. However,the problem of the poor applicability of low-frequency measurements in these tanks has not yet been solved. Therefore,we propose a low-frequency acoustic measurement method based on sound-field correction(SFC) in an enclosed space that effectively solves the problem of measuring the sound power from complex sound sources below the Schroeder cutoff frequency in a non-anechoic tank. Using normal mode theory, the transfer relationship between the mean-square sound pressure in an underwater enclosed space and the free-field sound power of the sound source is established, and this is regarded as a correction term for the sound field between this enclosed space and the free field. This correction term can be obtained based on previous measurements of a known sound source. This term can then be used to correct the mean-square sound pressure excited by any sound source to be tested in this enclosed space and equivalently obtain its free-field sound power. Experiments were carried out in a non-anechoic water tank(9.0 m × 3.1 m × 1.7 m) to confirm the validity of the SFC method. Through measurements with a spherical sound source(whose free-field radiation characteristics are known),the correction term of the sound field between this water tank and the free field was obtained. On this basis, the sound power radiated from a cylindrical shell model under the action of mechanical excitation was measured. The measurement results were found to have a maximum deviation of 2.9 d B from the free-field results. These results show that the SFC method has good applicability in the frequency band above the first-order resonant frequency in a non-anechoic tank. This greatly expands the potential low-frequency applications of non-anechoic tanks.展开更多
The time-domain inverse technique based on the time-domain rotating equivalent source method has been proposed to localize and quantify rotating sound sources. However, this technique encounters two problems to be add...The time-domain inverse technique based on the time-domain rotating equivalent source method has been proposed to localize and quantify rotating sound sources. However, this technique encounters two problems to be addressed: one is the time-consuming process of solving the transcendental equation at each time step, and the other is the difculty of controlling the instability problem due to the time-varying transfer matrix. In view of that, an improved technique is proposed in this paper to resolve these two problems. In the improved technique, a de-Dopplerization method in the time-domain rotating reference frame is frst applied to eliminate the Doppler efect caused by the source rotation in the measured pressure signals, and then the restored pressure signals without the Doppler efect are used as the inputs of the time-domain stationary equivalent source method to locate and quantify sound sources. Compared with the original technique, the improved technique can avoid solving the transcendental equation at each time step, and facilitate the treatment of the instability problem because the transfer matrix does not change with time. Numerical simulation and experimental results show that the improved technique can eliminate the Doppler efect efectively, and then localize and quantify the rotating nonstationary or broadband sources accurately. The results also demonstrate that the improved technique can guarantee a more stable reconstruction and compute more efciently than the original one.展开更多
Acoustic signals from diesel engines contain useful information but also include considerable noise components To extract information for condition monitoring purposes, continuous wavelet transform (CWT) is used for t...Acoustic signals from diesel engines contain useful information but also include considerable noise components To extract information for condition monitoring purposes, continuous wavelet transform (CWT) is used for the characterization of engine acoustics. This paper first reviews CWT characteristics represented by short duration transient signals. Wavelet selection and CWT are then implemented and wavelet transform is used to analyze the major sources of the engine front's exterior radiation sound. The research provides a reliable basis for engineering practice to reduce vehicle sound level. Comparison of the identification results of the measured acoustic signals with the identification results of the measured surface vibration showed good agreement.展开更多
Elastic wave on seafloor caused by low frequency noise radiated from ship is called ship seismic wave which can be used to identify ship target. In order to analyze the wave components and the propagating properties o...Elastic wave on seafloor caused by low frequency noise radiated from ship is called ship seismic wave which can be used to identify ship target. In order to analyze the wave components and the propagating properties of ship seismic wave, the numerical calculation of synthetic seismograms on seafloor aroused by a low frequency point sound source is carried out using a wave number integration technique combined with inverse Fourier transform. According to the numerical example of hard seafloor, the time series of seismic wave on seafloor are mostly composed of interface waves and normal mode waves. Each normal mode wave has a well defined low cut-off frequency, while the interface wave doesn't have. The frequency dispersion of normal mode wave is obvious when frequency is lower than 100Hz, while the interface wave is dispersive only in the infra-sound frequency range. The time series of seismic wave is dominated by the interface wave when the source frequency is less than the minimal cut-off frequency of normal mode wave.展开更多
Microphone array-based sound source localization(SSL)is a challenging task in adverse acoustic scenarios.To address this,a novel SSL algorithm based on deep neural network(DNN)using steered response power-phase transf...Microphone array-based sound source localization(SSL)is a challenging task in adverse acoustic scenarios.To address this,a novel SSL algorithm based on deep neural network(DNN)using steered response power-phase transform(SRP-PHAT)spatial spectrum as input feature is presented in this paper.Since the SRP-PHAT spatial power spectrum contains spatial location information,it is adopted as the input feature for sound source localization.DNN is exploited to extract the efficient location information from SRP-PHAT spatial power spectrum due to its advantage on extracting high-level features.SRP-PHAT at each steering position within a frame is arranged into a vector,which is treated as DNN input.A DNN model which can map the SRP-PHAT spatial spectrum to the azimuth of sound source is learned from the training signals.The azimuth of sound source is estimated through trained DNN model from the testing signals.Experiment results demonstrate that the proposed algorithm significantly improves localization performance whether the training and testing condition setup are the same or not,and is more robust to noise and reverberation.展开更多
Tracking moving wideband sound sources is one of the most challenging issues in the acoustic array signal processing which is based on the direction of arrival(DOA) estimation. Compressive sensing(CS) is a recent theo...Tracking moving wideband sound sources is one of the most challenging issues in the acoustic array signal processing which is based on the direction of arrival(DOA) estimation. Compressive sensing(CS) is a recent theory exploring the signal sparsity representation, which has been proved to be superior for the DOA estimation. However, the spatial aliasing and the offset at endfire are the main obstacles for CS applied in the wideband DOA estimation. We propose a particle filter based compressive sensing method for tracking moving wideband sound sources. First, the initial DOA estimates are obtained by wideband CS algorithms. Then, the real sources are approximated by a set of particles with different weights assigned. The kernel density estimator is used as the likelihood function of particle filter. We present the results for both uniform and random linear array. Simulation results show that the spatial aliasing is disappeared and the offset at endfire is reduced. We show that the proposed method can achieve satisfactory tracking performance regardless of using uniform or random linear array.展开更多
The Steered Response Power(SRP)method works well for sound source localization in noisy and reverberant environment.However,the large computation complexity limits its practical application.In this paper,a fast SRP se...The Steered Response Power(SRP)method works well for sound source localization in noisy and reverberant environment.However,the large computation complexity limits its practical application.In this paper,a fast SRP search method is proposed to reduce the computational complexity using small-aperture microphone array.The proposed method inspired by the SRP spatial spectrum includes two steps:first,the proposed method estimates the azimuth of the sound source roughly and determines whether the sound source is in far field or near field;then,different fine searching operations are performed according to the sound source being in far field or near field.Experiments both in simulation environments and real environments have been performed to compare the localization accuracy and computation complexity of the proposed method with those of the conventional SRP-PHAT algorithm.The results show that,the proposed method has a comparative accuracy with the conventional SRP algorithm,and achieves a reduction of 93.62%in computation complexity compared to the conventional SRP algorithm.展开更多
Microphone array-based sound source localization(SSL)is widely used in a variety of occasions such as video conferencing,robotic hearing,speech enhancement,speech recognition and so on.The traditional SSL methods cann...Microphone array-based sound source localization(SSL)is widely used in a variety of occasions such as video conferencing,robotic hearing,speech enhancement,speech recognition and so on.The traditional SSL methods cannot achieve satisfactory performance in adverse noisy and reverberant environments.In order to improve localization performance,a novel SSL algorithm using convolutional residual network(CRN)is proposed in this paper.The spatial features including time difference of arrivals(TDOAs)between microphone pairs and steered response power-phase transform(SRPPHAT)spatial spectrum are extracted in each Gammatone sub-band.The spatial features of different sub-bands with a frame are combine into a feature matrix as the input of CRN.The proposed algorithm employ CRN to fuse the spatial features.Since the CRN introduces the residual structure on the basis of the convolutional network,it reduce the difficulty of training procedure and accelerate the convergence of the model.A CRN model is learned from the training data in various reverberation and noise environments to establish the mapping regularity between the input feature and the sound azimuth.Through simulation verification,compared with the methods using traditional deep neural network,the proposed algorithm can achieve a better localization performance in SSL task,and provide better generalization capacity to untrained noise and reverberation.展开更多
Focused underwater plasma sound sources are being applied in more and more fields. Focusing performance is one of the most important factors determining transmission distance and peak values of the pulsed sound waves....Focused underwater plasma sound sources are being applied in more and more fields. Focusing performance is one of the most important factors determining transmission distance and peak values of the pulsed sound waves. The sound source’s components and focusing mechanism were all analyzed. A model was built in 3D Max and wave strength was measured on the simulation platform. Error analysis was fully integrated into the model so that effects on sound focusing performance of processing-errors and installation-errors could be studied. Based on what was practical, ways to limit the errors were proposed. The results of the error analysis should guide the design, machining, placement, debugging and application of underwater plasma sound sources.展开更多
The letter proposed a sound source localization method of digital hearing aids using wavelet based multivariate statistics with the Generalized Cross Correlation (GCC) algorithm. Haar wavelet is used to decompose GCC ...The letter proposed a sound source localization method of digital hearing aids using wavelet based multivariate statistics with the Generalized Cross Correlation (GCC) algorithm. Haar wavelet is used to decompose GCC sequences and extract four wavelet characteristics. And then, Hotelling T2 statistical method is used to fuse the four wavelet characteristics. The statistical value is used to judge the number of sound sources and obtain corresponding time delay estimation which is used to localize the position of sound source. The experimental results show that the proposed method has better robustness in an environment with severe noise and reverberation. Meanwhile, the complexity of al-gorithm is moderate, which is available for sound source localization of hearing aids.展开更多
In this paper, the measurement of an aerodynamic sound source for a semi-circular cylinder in a uniform flow is described using Particle Image Velocimetry (PIV). This experimental technique is based on vortex sound th...In this paper, the measurement of an aerodynamic sound source for a semi-circular cylinder in a uniform flow is described using Particle Image Velocimetry (PIV). This experimental technique is based on vortex sound theory, where the time derivative of vorticity is evaluated with the aid of two sets of standard PIV systems. The experimental results indicate that the sound source for the semi-circular cylinder is located around the shear layer near the edge of the semi-circular cylinder. The sound source intensity and the area are reduced in the semi-circular cylinder compared with those of a circular cylinder. This result indicates that the aerodynamic sound of the semi- circular cylinder is smaller than that of the circular cylinder, which supports the microphone measurement result.展开更多
The purpose of this study is to develop a system that enables location finding of a small sound. The location finding of a small sound has some difficulties such as high computational costs or disturbances from the am...The purpose of this study is to develop a system that enables location finding of a small sound. The location finding of a small sound has some difficulties such as high computational costs or disturbances from the ambient noises and reflected waves. The proposed system is composed of a biologically-inspired system which uses a hearing mechanism based on the human ear and a mechanism for perceiving weak signals that uses stochastic resonance. The location finding mechanism in the proposed system is based on the time-lag detecting architecture. On the other hand, the stochastic resonance mechanism can pick up the small sound source in the ambient noises. Using this proposed system, we implemented the location finding of small sounds through numerical simulations and hardware experiments. Good results were obtained for the small sound source location finding.展开更多
It is reported that some types of insects have a remarkable ability to detect the direction of an incident sound even though its acoustic sensory organs are in very close proximity each other. Maybe the ears are joint...It is reported that some types of insects have a remarkable ability to detect the direction of an incident sound even though its acoustic sensory organs are in very close proximity each other. Maybe the ears are jointed by a cuticular structure with which the separated motions can be coupled mechanically and thus be magnified. In this paper, a detailed model is setup to describe the principle of this type of localization using a mechanical coupled structure. The transfer functions and the responses of the model in terms of time and frequency are analyzed to describe the mechanism of its ability of directional hearing. This analytical model provides a method to design the experimental model for the predetermined incident sound pressure, and the analysis of this model shows that this structure have the ability to determine the direction of the incident stimulus.展开更多
Purpose: To analyze the effect of right versus left long-term single-sided deafness(SSD) on sound source localization(SSL), discuss the necessity of intervention and treatment for SSD patients, and analyze the therape...Purpose: To analyze the effect of right versus left long-term single-sided deafness(SSD) on sound source localization(SSL), discuss the necessity of intervention and treatment for SSD patients, and analyze the therapeutic effect of long-term unilateral cochlear implantation(UCI) from the perspective of SSL.Methods: This study included 25 patients with SSD, 11 patients with UCI, and 30 participants with normal hearing(NH). Their SSL ability was tested by obtaining their average root mean square(RMS) error values of SSL test.Results: The results showed that the RMS error value of SSD, UCI and NH groups were 52.26 ± 20.25°, 69.84 ±12.14° and 4.27 ± 2.66°, respectively. The ability of SSL was better in the SSD-L group than that in the SSD-R group, and no significant difference existed in the SSD-R and the UCI group.Conclusion: When bilateral deafness patients select unilateral treatment, right-side cochlear implantation may be more beneficial in terms of SSL, which means that the central auditory cortex in long-term SSD patients is affected differently based on which side their deafness occurs.展开更多
The steered response power-phase transform (SRP-PHAT) sound source localization algorithm is robust in a real environment. However, the large computation complexity limits the practical application of SRP-PHAT. For a ...The steered response power-phase transform (SRP-PHAT) sound source localization algorithm is robust in a real environment. However, the large computation complexity limits the practical application of SRP-PHAT. For a microphone array, each location corresponds to a set of time differences of arrival (TDOAs), and this paper collects them into a TDOA vector. Since the TDOA vectors in the adjacent regions are similar, we present a fast algorithm based on clustering search to reduce the computation complexity of SRP-PHAT. In the training stage, the K-means or Iterative Self-Organizing Data Analysis Technique (ISODATA) clustering algorithm is used to find the centroid in each cluster with similar TDOA vectors. In the procedure of sound localization, the optimal cluster is found by comparing the steered response powers (SRPs) of all centroids. The SRPs of all candidate locations in the optimal cluster are compared to localize the sound source. Experiments both in simulation environments and real environments have been performed to compare the localization accuracy and computational load of the proposed method with those of the conventional SRP-PHAT algorithm. The results show that the proposed method is able to reduce the computational load drastically and maintains almost the same localization accuracy and robustness as those of the conventional SRP-PHAT algorithm. The difference in localization performance brought by different clustering algorithms used in the training stage is trivial.展开更多
Underwater target localization and parameters(azimuth and range) estimation by the method of utilizing explosions as underwater sound sources are described in this paper.The narrow beam reverberation model of the targ...Underwater target localization and parameters(azimuth and range) estimation by the method of utilizing explosions as underwater sound sources are described in this paper.The narrow beam reverberation model of the target echo signal is researched to estimate the target azimuth in reverberation background.Estimation errors of target azimuth and range are studied and proved to approximately meet Gauss distribution.Then the variance formula of target range error is deduced.Simulation experiments are applied to research the target range error and its standard deviation,and a series of measures to improve the estimation accuracy of target range are proposed.It is confirmed by the data processing results of simulations and lake experiments that the proposed method can accurately locate underwater target at a long distance on the condition of a certain underwater explosion range error.展开更多
This paper presents the method named acoustic holography which can be used to identify noise sources. A new formula of holography reconstruction is obtained, based on the Kirchhoff integral formula. Some simulating te...This paper presents the method named acoustic holography which can be used to identify noise sources. A new formula of holography reconstruction is obtained, based on the Kirchhoff integral formula. Some simulating tests are carried out using the new formula. The comparison with other reconstruction formulas proves that the new formula is more effective. By using acoustic holography method, some interesting results about the noise of a vehicle are shown. The results proves that acoustic holography is an effective method for the identification of the complex noise sources.展开更多
A new sound source localization method with sound speed compensation is proposed to reduce the wind influence on the performance of conventional TDOA (Time Difference of Arrival) algorithms. First, the sound speed i...A new sound source localization method with sound speed compensation is proposed to reduce the wind influence on the performance of conventional TDOA (Time Difference of Arrival) algorithms. First, the sound speed is described as a set of functions of the unknown source location, to approximate the acoustic velocity field distribution in the wind field. Then, they are introduced into the TDOA algorithm, to construct nonlinear equations. Finally, the particle swarm optimization algorithm is used to estimate the source location. The simulation results show that the proposed algorithm can significantly improve the localization accuracy for different wind velocities, source locations and test area sizes. The experimental results show that the proposed method can reduce localization errors to about 40% of the original error in a four nodes localization system.展开更多
A head-related transfer function (HRTF) model for fast and real-time synthesizing multiple virtual sound sources is proposed. A head-related impulse response (HRIR, time- domain version of HRTF) is first decompose...A head-related transfer function (HRTF) model for fast and real-time synthesizing multiple virtual sound sources is proposed. A head-related impulse response (HRIR, time- domain version of HRTF) is first decomposed by a two-level wavelet packet and then represented by a model composed of subband filters and reconstruction filters. The coefficients of the subband filters are the zero interpolation of the wavelet coefficients of the HRIR. The coefficients of the reconstruction filters can be calculated from the wavelet function. The model is simplified by applying a threshold method to reduce the wavelet coefficients. The calculated results indicate that for a model with 30 wavelet coefficients, the error of reconstructed HRIR is about 1%. And the result of a psychoacoustic test shows that a model with 35 wavelet coefficients is perceptually indistinguishable from the original HRIR. When multiple virtual sound sources are synthesized simultaneously, the computational cost of the proposed model is much less than the traditional HRTF filters.展开更多
Based on the problem that the generating method of random array structure is inefficient, a method is proposed to generate the random target arrays by using coaxial circu- lar array in the polar coordinates in the pre...Based on the problem that the generating method of random array structure is inefficient, a method is proposed to generate the random target arrays by using coaxial circu- lar array in the polar coordinates in the premise that the array angular resolution of source identification is guaranteed. According to the principle of moving sound source identification, this work deduces the basic non-equidistance coaxial circular rings' radius, and generates target random arrays which were suitable for moving sound source identification through array partitioning, condition filtering in the polar coordinates and simulation evaluation. Finally, numerical simulation and moving car sound source identification test have been done. The analytical results show that using this method to generate random array is effective. Compared with the traditional regular arrays, the target random array has more accurate moving sound source identification performance.展开更多
基金the National Natural Science Foundation of China (Grant No. 11874131)Open Fund Project of Key Laboratory of Underwater Acoustic Countermeasures Technology (Grant No. 2021-JCJQ-LB033-05)。
文摘Similar to air reverberation chambers, non-anechoic water tanks are important acoustic measurement devices that can be used to measure the sound power radiated from complex underwater sound sources using diffusion field theory. However,the problem of the poor applicability of low-frequency measurements in these tanks has not yet been solved. Therefore,we propose a low-frequency acoustic measurement method based on sound-field correction(SFC) in an enclosed space that effectively solves the problem of measuring the sound power from complex sound sources below the Schroeder cutoff frequency in a non-anechoic tank. Using normal mode theory, the transfer relationship between the mean-square sound pressure in an underwater enclosed space and the free-field sound power of the sound source is established, and this is regarded as a correction term for the sound field between this enclosed space and the free field. This correction term can be obtained based on previous measurements of a known sound source. This term can then be used to correct the mean-square sound pressure excited by any sound source to be tested in this enclosed space and equivalently obtain its free-field sound power. Experiments were carried out in a non-anechoic water tank(9.0 m × 3.1 m × 1.7 m) to confirm the validity of the SFC method. Through measurements with a spherical sound source(whose free-field radiation characteristics are known),the correction term of the sound field between this water tank and the free field was obtained. On this basis, the sound power radiated from a cylindrical shell model under the action of mechanical excitation was measured. The measurement results were found to have a maximum deviation of 2.9 d B from the free-field results. These results show that the SFC method has good applicability in the frequency band above the first-order resonant frequency in a non-anechoic tank. This greatly expands the potential low-frequency applications of non-anechoic tanks.
基金Supported by National Natural Science Foundation of China(Grant Nos.51875147,12174082,51675149)。
文摘The time-domain inverse technique based on the time-domain rotating equivalent source method has been proposed to localize and quantify rotating sound sources. However, this technique encounters two problems to be addressed: one is the time-consuming process of solving the transcendental equation at each time step, and the other is the difculty of controlling the instability problem due to the time-varying transfer matrix. In view of that, an improved technique is proposed in this paper to resolve these two problems. In the improved technique, a de-Dopplerization method in the time-domain rotating reference frame is frst applied to eliminate the Doppler efect caused by the source rotation in the measured pressure signals, and then the restored pressure signals without the Doppler efect are used as the inputs of the time-domain stationary equivalent source method to locate and quantify sound sources. Compared with the original technique, the improved technique can avoid solving the transcendental equation at each time step, and facilitate the treatment of the instability problem because the transfer matrix does not change with time. Numerical simulation and experimental results show that the improved technique can eliminate the Doppler efect efectively, and then localize and quantify the rotating nonstationary or broadband sources accurately. The results also demonstrate that the improved technique can guarantee a more stable reconstruction and compute more efciently than the original one.
基金Project (No. 50175078) supported by the National Natural Science Foundation of China
文摘Acoustic signals from diesel engines contain useful information but also include considerable noise components To extract information for condition monitoring purposes, continuous wavelet transform (CWT) is used for the characterization of engine acoustics. This paper first reviews CWT characteristics represented by short duration transient signals. Wavelet selection and CWT are then implemented and wavelet transform is used to analyze the major sources of the engine front's exterior radiation sound. The research provides a reliable basis for engineering practice to reduce vehicle sound level. Comparison of the identification results of the measured acoustic signals with the identification results of the measured surface vibration showed good agreement.
基金Sponsored by National Nature Science Foundation of China ( 51179195)National Defense Foundation of China ( 513030203-02)
文摘Elastic wave on seafloor caused by low frequency noise radiated from ship is called ship seismic wave which can be used to identify ship target. In order to analyze the wave components and the propagating properties of ship seismic wave, the numerical calculation of synthetic seismograms on seafloor aroused by a low frequency point sound source is carried out using a wave number integration technique combined with inverse Fourier transform. According to the numerical example of hard seafloor, the time series of seismic wave on seafloor are mostly composed of interface waves and normal mode waves. Each normal mode wave has a well defined low cut-off frequency, while the interface wave doesn't have. The frequency dispersion of normal mode wave is obvious when frequency is lower than 100Hz, while the interface wave is dispersive only in the infra-sound frequency range. The time series of seismic wave is dominated by the interface wave when the source frequency is less than the minimal cut-off frequency of normal mode wave.
基金This work is supported by the National Nature Science Foundation of China(NSFC)under Grant No.61571106Jiangsu Natural Science Foundation under Grant No.BK20170757the Natural Science Foundation of the Jiangsu Higher Education Institutions of China under grant No.17KJD510002.
文摘Microphone array-based sound source localization(SSL)is a challenging task in adverse acoustic scenarios.To address this,a novel SSL algorithm based on deep neural network(DNN)using steered response power-phase transform(SRP-PHAT)spatial spectrum as input feature is presented in this paper.Since the SRP-PHAT spatial power spectrum contains spatial location information,it is adopted as the input feature for sound source localization.DNN is exploited to extract the efficient location information from SRP-PHAT spatial power spectrum due to its advantage on extracting high-level features.SRP-PHAT at each steering position within a frame is arranged into a vector,which is treated as DNN input.A DNN model which can map the SRP-PHAT spatial spectrum to the azimuth of sound source is learned from the training signals.The azimuth of sound source is estimated through trained DNN model from the testing signals.Experiment results demonstrate that the proposed algorithm significantly improves localization performance whether the training and testing condition setup are the same or not,and is more robust to noise and reverberation.
基金supported by the NFSC Grants 51375385 and 51675425Natural Science Basic Research Plan in Shaanxi Province of China Grants 2016JZ013
文摘Tracking moving wideband sound sources is one of the most challenging issues in the acoustic array signal processing which is based on the direction of arrival(DOA) estimation. Compressive sensing(CS) is a recent theory exploring the signal sparsity representation, which has been proved to be superior for the DOA estimation. However, the spatial aliasing and the offset at endfire are the main obstacles for CS applied in the wideband DOA estimation. We propose a particle filter based compressive sensing method for tracking moving wideband sound sources. First, the initial DOA estimates are obtained by wideband CS algorithms. Then, the real sources are approximated by a set of particles with different weights assigned. The kernel density estimator is used as the likelihood function of particle filter. We present the results for both uniform and random linear array. Simulation results show that the spatial aliasing is disappeared and the offset at endfire is reduced. We show that the proposed method can achieve satisfactory tracking performance regardless of using uniform or random linear array.
基金Supported by the National Natural Science Foundation of China(No.61201345)the Beijing Key Laboratory of Advanced Information Science and Network Technology(No.XDXX1308)
文摘The Steered Response Power(SRP)method works well for sound source localization in noisy and reverberant environment.However,the large computation complexity limits its practical application.In this paper,a fast SRP search method is proposed to reduce the computational complexity using small-aperture microphone array.The proposed method inspired by the SRP spatial spectrum includes two steps:first,the proposed method estimates the azimuth of the sound source roughly and determines whether the sound source is in far field or near field;then,different fine searching operations are performed according to the sound source being in far field or near field.Experiments both in simulation environments and real environments have been performed to compare the localization accuracy and computation complexity of the proposed method with those of the conventional SRP-PHAT algorithm.The results show that,the proposed method has a comparative accuracy with the conventional SRP algorithm,and achieves a reduction of 93.62%in computation complexity compared to the conventional SRP algorithm.
基金supported by Nature Science Research Project of Higher Education Institutions in Jiangsu Province under Grant No.21KJB510018National Nature Science Foundation of China (NSFC)under Grant No.62001215.
文摘Microphone array-based sound source localization(SSL)is widely used in a variety of occasions such as video conferencing,robotic hearing,speech enhancement,speech recognition and so on.The traditional SSL methods cannot achieve satisfactory performance in adverse noisy and reverberant environments.In order to improve localization performance,a novel SSL algorithm using convolutional residual network(CRN)is proposed in this paper.The spatial features including time difference of arrivals(TDOAs)between microphone pairs and steered response power-phase transform(SRPPHAT)spatial spectrum are extracted in each Gammatone sub-band.The spatial features of different sub-bands with a frame are combine into a feature matrix as the input of CRN.The proposed algorithm employ CRN to fuse the spatial features.Since the CRN introduces the residual structure on the basis of the convolutional network,it reduce the difficulty of training procedure and accelerate the convergence of the model.A CRN model is learned from the training data in various reverberation and noise environments to establish the mapping regularity between the input feature and the sound azimuth.Through simulation verification,compared with the methods using traditional deep neural network,the proposed algorithm can achieve a better localization performance in SSL task,and provide better generalization capacity to untrained noise and reverberation.
基金Supported by the National Natural Science Foundation under Grant No.60572098
文摘Focused underwater plasma sound sources are being applied in more and more fields. Focusing performance is one of the most important factors determining transmission distance and peak values of the pulsed sound waves. The sound source’s components and focusing mechanism were all analyzed. A model was built in 3D Max and wave strength was measured on the simulation platform. Error analysis was fully integrated into the model so that effects on sound focusing performance of processing-errors and installation-errors could be studied. Based on what was practical, ways to limit the errors were proposed. The results of the error analysis should guide the design, machining, placement, debugging and application of underwater plasma sound sources.
基金Supported by the National Natural Science Foundation of China (No. 60472058, No. 60975017)Jiangsu Provincial Natural Science Foundation (No. BK2008291)
文摘The letter proposed a sound source localization method of digital hearing aids using wavelet based multivariate statistics with the Generalized Cross Correlation (GCC) algorithm. Haar wavelet is used to decompose GCC sequences and extract four wavelet characteristics. And then, Hotelling T2 statistical method is used to fuse the four wavelet characteristics. The statistical value is used to judge the number of sound sources and obtain corresponding time delay estimation which is used to localize the position of sound source. The experimental results show that the proposed method has better robustness in an environment with severe noise and reverberation. Meanwhile, the complexity of al-gorithm is moderate, which is available for sound source localization of hearing aids.
文摘In this paper, the measurement of an aerodynamic sound source for a semi-circular cylinder in a uniform flow is described using Particle Image Velocimetry (PIV). This experimental technique is based on vortex sound theory, where the time derivative of vorticity is evaluated with the aid of two sets of standard PIV systems. The experimental results indicate that the sound source for the semi-circular cylinder is located around the shear layer near the edge of the semi-circular cylinder. The sound source intensity and the area are reduced in the semi-circular cylinder compared with those of a circular cylinder. This result indicates that the aerodynamic sound of the semi- circular cylinder is smaller than that of the circular cylinder, which supports the microphone measurement result.
文摘The purpose of this study is to develop a system that enables location finding of a small sound. The location finding of a small sound has some difficulties such as high computational costs or disturbances from the ambient noises and reflected waves. The proposed system is composed of a biologically-inspired system which uses a hearing mechanism based on the human ear and a mechanism for perceiving weak signals that uses stochastic resonance. The location finding mechanism in the proposed system is based on the time-lag detecting architecture. On the other hand, the stochastic resonance mechanism can pick up the small sound source in the ambient noises. Using this proposed system, we implemented the location finding of small sounds through numerical simulations and hardware experiments. Good results were obtained for the small sound source location finding.
基金National Natural Science Foundation of China and Science and Technology Foundation of Shanghai Jiao Tong UniversityGrant number:50375094 /E05010705
文摘It is reported that some types of insects have a remarkable ability to detect the direction of an incident sound even though its acoustic sensory organs are in very close proximity each other. Maybe the ears are jointed by a cuticular structure with which the separated motions can be coupled mechanically and thus be magnified. In this paper, a detailed model is setup to describe the principle of this type of localization using a mechanical coupled structure. The transfer functions and the responses of the model in terms of time and frequency are analyzed to describe the mechanism of its ability of directional hearing. This analytical model provides a method to design the experimental model for the predetermined incident sound pressure, and the analysis of this model shows that this structure have the ability to determine the direction of the incident stimulus.
基金supported by the National Key Research and Development Project of China(2020YFC20052003 to S.M.Yang)Key International(Regional)Joint Research Program of National Natural Science Foundation of China(NSFC#81820108009 to S.M.Yang)National Natural Science Foundation of China(NSFC#82000976 to J.N.Li).
文摘Purpose: To analyze the effect of right versus left long-term single-sided deafness(SSD) on sound source localization(SSL), discuss the necessity of intervention and treatment for SSD patients, and analyze the therapeutic effect of long-term unilateral cochlear implantation(UCI) from the perspective of SSL.Methods: This study included 25 patients with SSD, 11 patients with UCI, and 30 participants with normal hearing(NH). Their SSL ability was tested by obtaining their average root mean square(RMS) error values of SSL test.Results: The results showed that the RMS error value of SSD, UCI and NH groups were 52.26 ± 20.25°, 69.84 ±12.14° and 4.27 ± 2.66°, respectively. The ability of SSL was better in the SSD-L group than that in the SSD-R group, and no significant difference existed in the SSD-R and the UCI group.Conclusion: When bilateral deafness patients select unilateral treatment, right-side cochlear implantation may be more beneficial in terms of SSL, which means that the central auditory cortex in long-term SSD patients is affected differently based on which side their deafness occurs.
基金supported by the National Natural Science Foundation of China(Grant Nos. 60971098 and 61201345)the Beijing Key Laboratory of Advanced Information Science and Network Technology(Grant No.XDXX1308)
文摘The steered response power-phase transform (SRP-PHAT) sound source localization algorithm is robust in a real environment. However, the large computation complexity limits the practical application of SRP-PHAT. For a microphone array, each location corresponds to a set of time differences of arrival (TDOAs), and this paper collects them into a TDOA vector. Since the TDOA vectors in the adjacent regions are similar, we present a fast algorithm based on clustering search to reduce the computation complexity of SRP-PHAT. In the training stage, the K-means or Iterative Self-Organizing Data Analysis Technique (ISODATA) clustering algorithm is used to find the centroid in each cluster with similar TDOA vectors. In the procedure of sound localization, the optimal cluster is found by comparing the steered response powers (SRPs) of all centroids. The SRPs of all candidate locations in the optimal cluster are compared to localize the sound source. Experiments both in simulation environments and real environments have been performed to compare the localization accuracy and computational load of the proposed method with those of the conventional SRP-PHAT algorithm. The results show that the proposed method is able to reduce the computational load drastically and maintains almost the same localization accuracy and robustness as those of the conventional SRP-PHAT algorithm. The difference in localization performance brought by different clustering algorithms used in the training stage is trivial.
基金supported by the National Natural Science Foundation of China(61431020,61571434)
文摘Underwater target localization and parameters(azimuth and range) estimation by the method of utilizing explosions as underwater sound sources are described in this paper.The narrow beam reverberation model of the target echo signal is researched to estimate the target azimuth in reverberation background.Estimation errors of target azimuth and range are studied and proved to approximately meet Gauss distribution.Then the variance formula of target range error is deduced.Simulation experiments are applied to research the target range error and its standard deviation,and a series of measures to improve the estimation accuracy of target range are proposed.It is confirmed by the data processing results of simulations and lake experiments that the proposed method can accurately locate underwater target at a long distance on the condition of a certain underwater explosion range error.
基金This work wassupportedby the Natural Science Foundation of China(No.59775020).
文摘This paper presents the method named acoustic holography which can be used to identify noise sources. A new formula of holography reconstruction is obtained, based on the Kirchhoff integral formula. Some simulating tests are carried out using the new formula. The comparison with other reconstruction formulas proves that the new formula is more effective. By using acoustic holography method, some interesting results about the noise of a vehicle are shown. The results proves that acoustic holography is an effective method for the identification of the complex noise sources.
基金supported by the National Natural Science Fundation of China(61501374)Underwater Information and Control Key Laboratory Fundation(9140C230310150C23102)
文摘A new sound source localization method with sound speed compensation is proposed to reduce the wind influence on the performance of conventional TDOA (Time Difference of Arrival) algorithms. First, the sound speed is described as a set of functions of the unknown source location, to approximate the acoustic velocity field distribution in the wind field. Then, they are introduced into the TDOA algorithm, to construct nonlinear equations. Finally, the particle swarm optimization algorithm is used to estimate the source location. The simulation results show that the proposed algorithm can significantly improve the localization accuracy for different wind velocities, source locations and test area sizes. The experimental results show that the proposed method can reduce localization errors to about 40% of the original error in a four nodes localization system.
基金supported by the National Nature Science Fund of China(50938003,10774049)State Key Lab of Subtropical Building Science,South China University of Technology
文摘A head-related transfer function (HRTF) model for fast and real-time synthesizing multiple virtual sound sources is proposed. A head-related impulse response (HRIR, time- domain version of HRTF) is first decomposed by a two-level wavelet packet and then represented by a model composed of subband filters and reconstruction filters. The coefficients of the subband filters are the zero interpolation of the wavelet coefficients of the HRIR. The coefficients of the reconstruction filters can be calculated from the wavelet function. The model is simplified by applying a threshold method to reduce the wavelet coefficients. The calculated results indicate that for a model with 30 wavelet coefficients, the error of reconstructed HRIR is about 1%. And the result of a psychoacoustic test shows that a model with 35 wavelet coefficients is perceptually indistinguishable from the original HRIR. When multiple virtual sound sources are synthesized simultaneously, the computational cost of the proposed model is much less than the traditional HRTF filters.
基金supported by the National Natural Science Foundation of China(61271387)the Natural Science Foundation of Shandong Province(ZR2012FZ001)
文摘Based on the problem that the generating method of random array structure is inefficient, a method is proposed to generate the random target arrays by using coaxial circu- lar array in the polar coordinates in the premise that the array angular resolution of source identification is guaranteed. According to the principle of moving sound source identification, this work deduces the basic non-equidistance coaxial circular rings' radius, and generates target random arrays which were suitable for moving sound source identification through array partitioning, condition filtering in the polar coordinates and simulation evaluation. Finally, numerical simulation and moving car sound source identification test have been done. The analytical results show that using this method to generate random array is effective. Compared with the traditional regular arrays, the target random array has more accurate moving sound source identification performance.