Introduction: Located in the central-western part of Côte d’Ivoire, the subsoil of the Gagnoa region is made up of sedimentary volcano formations and granitoids with developed fracturing. This complex Precambria...Introduction: Located in the central-western part of Côte d’Ivoire, the subsoil of the Gagnoa region is made up of sedimentary volcano formations and granitoids with developed fracturing. This complex Precambrian basement contains most of the region’s water resources. This is at the origin of the high failure rate during the various hydrogeological prospecting campaigns. Methodology: The database consists of resistivities from 42 holes and 51 trails drilled as part of the implementation of high-throughput drilling in the study area. The objective of this study is to deepen the knowledge of the fissured basement by interpreting profile curves and electrical soundings. It will be a question of classifying the different types of anomalies obtained on the profiles and their shapes. The orientation of the lineaments observed on the profiles was determined. Results: The interpretation of the geophysical data revealed various anomalies, the main ones being of the CC (Conductor Compartment) and CEDP (Contact between two bearings) types. These types of anomalies are mainly expressed in various forms: the “V”, “W” and “U” shapes. From these anomalies and the appearance of the electrical profiles, lineaments and their orientations were identified with N90-100, N130-140, N170-180 as major orientations. Conclusion: These results could contribute to a better understanding of the fractured environment of the Gagnoa region.展开更多
High-vertical-resolution radiosonde wind data are highly valuable for describing the dynamics of the meso-and microscale atmosphere. However, the current algorithm used in China's L-band radar sounding system for ...High-vertical-resolution radiosonde wind data are highly valuable for describing the dynamics of the meso-and microscale atmosphere. However, the current algorithm used in China's L-band radar sounding system for calculating highvertical-resolution wind vectors excessively smooths the data, resulting in significant underestimation of the calculated kinetic energy of gravity waves compared to similar products from other countries, which greatly limits the effective utilization of the data. To address this issue, this study proposes a novel method to calculate high-vertical-resolution wind vectors that utilizes the elevation angle, azimuth angle, and slant range from L-band radar. In order to obtain wind data with a stable quality, a two-step automatic quality control procedure, including the RMSE-F(root-mean-square error F) test and elemental consistency test are first applied to the slant range data, to eliminate continuous erroneous data caused by unstable signals or radar malfunctions. Then, a wind calculation scheme based on a sliding second-order polynomial fitting is utilized to derive the high-vertical-resolution radiosonde wind vectors. The evaluation results demonstrate that the wind data obtained through the proposed method show a high level of consistency with the high-resolution wind data observed using the Vaisala Global Positioning System and the data observed by the new Beidou Navigation Sounding System. The calculation of the kinetic energy of gravity waves in the recalculated wind data also reaches a level comparable to the Vaisala observations.展开更多
The scarcity of in-situ ocean observations poses a challenge for real-time information acquisition in the ocean.Among the crucial hydroacoustic environmental parameters,ocean sound velocity exhibits significant spatia...The scarcity of in-situ ocean observations poses a challenge for real-time information acquisition in the ocean.Among the crucial hydroacoustic environmental parameters,ocean sound velocity exhibits significant spatial and temporal variability and it is highly relevant to oceanic research.In this study,we propose a new data-driven approach,leveraging deep learning techniques,for the prediction of sound velocity fields(SVFs).Our novel spatiotemporal prediction model,STLSTM-SA,combines Spatiotemporal Long Short-Term Memory(ST-LSTM) with a self-attention mechanism to enable accurate and real-time prediction of SVFs.To circumvent the limited amount of observational data,we employ transfer learning by first training the model using reanalysis datasets,followed by fine-tuning it using in-situ analysis data to obtain the final prediction model.By utilizing the historical 12-month SVFs as input,our model predicts the SVFs for the subsequent three months.We compare the performance of five models:Artificial Neural Networks(ANN),Long ShortTerm Memory(LSTM),Convolutional LSTM(ConvLSTM),ST-LSTM,and our proposed ST-LSTM-SA model in a test experiment spanning 2019 to 2022.Our results demonstrate that the ST-LSTM-SA model significantly improves the prediction accuracy and stability of sound velocity in both temporal and spatial dimensions.The ST-LSTM-SA model not only accurately predicts the ocean sound velocity field(SVF),but also provides valuable insights for spatiotemporal prediction of other oceanic environmental variables.展开更多
Continuous thunderstorm occurring at Qingdao Airport in China from August 7 to 13,2022 was analyzed based on sounding data.The weather was divided into thunderstorm gale,thunderstorm and heavy precipitation,and some p...Continuous thunderstorm occurring at Qingdao Airport in China from August 7 to 13,2022 was analyzed based on sounding data.The weather was divided into thunderstorm gale,thunderstorm and heavy precipitation,and some physical quantities and time variables which can effectively identify severe convective weather types were preliminarily obtained.The results show that CAPE was sensitive to different types of weather,but the uncertainty was relatively large.Convective temperature T_(CON),temperature difference between 500 and 850 hPa,and vertical wind shear can distinguish thunderstorm gale,thunderstorm and heavy precipitation weather obviously.Besides,K index,Showalter index,θ_(se) difference between 500 and 850 hPa were also important basis to distinguish thunderstorm and heavy precipitation weather.Thunderstorm gale can be distinguished by the 24-hour variations of K index,and the difference of dew point between 500 and 850 hPa.The 24-hour variations of(T-T_(d))_(500) and vertical wind shear can be used to distinguish between heavy precipitation and thunderstorm weather;the 24-hour variation of stratification stability Δθ_(se) can distinguish the three kinds of weather well.For the wind field,the existence of vertical wind shear was required for strong convective weather,and the 24-hour increment of thunderstorm gale and thunderstorm was larger than that of heavy precipitation.展开更多
This research aims to address the pressing issue of failed and abandoned wells, causing water scarcity in Lapan Gwari Community, through an improved groundwater exploration approach integrating remote sensing and elec...This research aims to address the pressing issue of failed and abandoned wells, causing water scarcity in Lapan Gwari Community, through an improved groundwater exploration approach integrating remote sensing and electrical resistivity soundings. The study area, located within the Zungeru Sheet 163 SE, spans Latitudes 9°30'00"N to 9°32'00"N and Longitudes 6°28'00" to 6°30'00". The surface geologic, structural, and hydrogeological mapping provided essential insights into the hydrogeological framework. Leveraging SRTM DEM data, thematic maps were created for geomorphology, slope, land use, lineament density, and drainage density. These datasets were then integrated using ArcGIS to develop a preliminary groundwater potential zones map. Further investigations were conducted using Vertical Electrical Sounding (VES) and Electrical Resistivity Imaging (2D VES) surveys at targeted locations identified by the preliminary map. Results show that the study area predominantly consists of crystalline rocks of the Nigerian Basement Complex, primarily comprising schist and granite with minor occurrences of quartz vein intrusions. Surface joint directions indicated a dominant NE-SW trend. The VES data revealed three to four geoelectric layers, encompassing the topsoil (1 to 5 m depth, resistivity: 100 Ωm to 300 Ωm), the weathered layer (in the 3-layer system) or fractured layer (in the 4-layer system), and the fresh basement rock characterized by infinite resistivity. The shallow weathered layers (3 to 30 m thickness) are believed to hold aquiferous potential. Hydrogeological interpretation, facilitated by 2D resistivity models, delineated water horizons trapped within clayey sand and weathered/fractured formations. Notably, the aquifer resistivity range was found to be between 3 - 35 m and 100 - 300 Ωm, signifying a promising aquifer positioned at depths of 40 to 88 m. This aligns with corroborative static water level measurements. Given this, we recommend drilling depths of a minimum of 80 m to ensure the acquisition of sufficient and sustainable water supplies. The final groundwater potential zones map derived from this study is expected to serve as an invaluable guide for prospective groundwater developers and relevant authorities in formulating effective water resource management plans. By effectively tackling water scarcity challenges in Lapan Gwari Community, this integrated approach demonstrates its potential for application in similar regions facing comparable hydrogeological concerns.展开更多
Auscultation is crucial for the diagnosis of respiratory system diseases.However,traditional stethoscopes have inherent limitations,such as inter-listener variability and subjectivity,and they cannot record respirator...Auscultation is crucial for the diagnosis of respiratory system diseases.However,traditional stethoscopes have inherent limitations,such as inter-listener variability and subjectivity,and they cannot record respiratory sounds for offline/retrospective diagnosis or remote prescriptions in telemedicine.The emergence of digital stethoscopes has overcome these limitations by allowing physicians to store and share respiratory sounds for consultation and education.On this basis,machine learning,particularly deep learning,enables the fully-automatic analysis of lung sounds that may pave the way for intelligent stethoscopes.This review thus aims to provide a comprehensive overview of deep learning algorithms used for lung sound analysis to emphasize the significance of artificial intelligence(AI)in this field.We focus on each component of deep learning-based lung sound analysis systems,including the task categories,public datasets,denoising methods,and,most importantly,existing deep learning methods,i.e.,the state-of-the-art approaches to convert lung sounds into two-dimensional(2D)spectrograms and use convolutional neural networks for the end-to-end recognition of respiratory diseases or abnormal lung sounds.Additionally,this review highlights current challenges in this field,including the variety of devices,noise sensitivity,and poor interpretability of deep models.To address the poor reproducibility and variety of deep learning in this field,this review also provides a scalable and flexible open-source framework that aims to standardize the algorithmic workflow and provide a solid basis for replication and future extension:https://github.com/contactless-healthcare/Deep-Learning-for-Lung-Sound-Analysis.展开更多
Chiral metamaterials have been proven to possess many appealing mechanical phenomena,such as negative Poisson's ratio,high-impact resistance,and energy absorption.This work extends the applications of chiral metam...Chiral metamaterials have been proven to possess many appealing mechanical phenomena,such as negative Poisson's ratio,high-impact resistance,and energy absorption.This work extends the applications of chiral metamaterials to underwater sound insulation.Various chiral metamaterials with low acoustic impedance and proper stiffness are inversely designed using the topology optimization scheme.Low acoustic impedance enables the metamaterials to have a high and broadband sound transmission loss(STL),while proper stiffness guarantees its robust acoustic performance under a hydrostatic pressure.As proof-of-concept demonstrations,two specimens are fabricated and tested in a water-filled impedance tube.Experimental results show that,on average,over 95%incident sound energy can be isolated by the specimens in a broad frequency range from 1 k Hz to 5 k Hz,while the sound insulation performance keeps stable under a certain hydrostatic pressure.This work may provide new insights for chiral metamaterials into the underwater applications with sound insulation.展开更多
As a crucial component of terrestrial ecosystems,urban forests play a pivotal role in protecting urban biodiversity by providing suitable habitats for acoustic spaces.Previous studies note that vegetation structure is...As a crucial component of terrestrial ecosystems,urban forests play a pivotal role in protecting urban biodiversity by providing suitable habitats for acoustic spaces.Previous studies note that vegetation structure is a key factor influencing bird sounds in urban forests;hence,adjusting the frequency composition may be a strategy for birds to avoid anthropogenic noise to mask their songs.However,it is unknown whether the response mechanisms of bird vocalizations to vegetation structure remain consistent despite being impacted by anthropogenic noise.It was hypothesized that anthropogenic noise in urban forests occupies the low-frequency space of bird songs,leading to a possible reshaping of the acoustic niches of forests,and the vegetation structure of urban forests is the critical factor that shapes the acoustic space for bird vocalization.Passive acoustic monitoring in various urban forests was used to monitor natural and anthropogenic noises,and sounds were classified into three acoustic scenes(bird sounds,human sounds,and bird-human sounds)to determine interconnections between bird sounds,anthropogenic noise,and vegetation structure.Anthropogenic noise altered the acoustic niche of urban forests by intruding into the low-frequency space used by birds,and vegetation structures related to volume(trunk volume and branch volume)and density(number of branches and leaf area index)significantly impact the diversity of bird sounds.Our findings indicate that the response to low and high frequency signals to vegetation structure is distinct.By clarifying this relationship,our results contribute to understanding of how vegetation structure influences bird sounds in urban forests impacted by anthropogenic noise.展开更多
A sandwich plate with a corrugation and auxetic honeycomb hybrid core is constructed,and its sound insulation and optimization are investigated.First,the motion governing equation of the sandwich plate is established ...A sandwich plate with a corrugation and auxetic honeycomb hybrid core is constructed,and its sound insulation and optimization are investigated.First,the motion governing equation of the sandwich plate is established by the third-order shear deformation theory(TSDT),and then combined with the fluid-structure coupling conditions,and the sound insulation is solved.The theoretical results are validated by COMSOL simulation results,and the effects of the structural parameter on the sound insulation are analyzed.Finally,the standard genetic algorithm is adopted to optimize the sound insulation of the sandwich plate.展开更多
In order to overcome the limitations of traditional microperforated plate with narrow sound absorption bandwidth and a single structure,two multi-cavity composite sound-absorbing materials were designed based on the s...In order to overcome the limitations of traditional microperforated plate with narrow sound absorption bandwidth and a single structure,two multi-cavity composite sound-absorbing materials were designed based on the shape of monoclinic crystals:uniaxial oblique structure(UOS)and biaxial oblique structure(BOS).Through finite element simulation and experimental research,the theoretical models of UOS and BOS were verified,and their sound absorption mechanisms were revealed.At the same time,the influence of multi-cavity composites on sound absorption performance was analyzed based on the theoretical model,and the influence of structural parameters on sound absorption performance was discussed.The research results show that,in the range of 100-2000 Hz,UOS has three sound absorption peaks and BOS has five sound absorption peaks.The frequency range of the half-absorption bandwidth(α>0.5)of UOS and BOS increases by 242% and 229%,respectively.Compared with traditional microperforated sound-absorbing structures,the series and parallel hybrid methods significantly increase the sound-absorbing bandwidth of the sound-absorbing structure.This research has guiding significance for noise control and has broad application prospects in the fields of transportation,construction,and mechanical design.展开更多
As an important material for manufacturing resonant components of musical instruments,Paulownia has an important influence on the sound quality of Ruan.In this paper,a model for evaluating the sound quality of Ruan ba...As an important material for manufacturing resonant components of musical instruments,Paulownia has an important influence on the sound quality of Ruan.In this paper,a model for evaluating the sound quality of Ruan based on the vibration characteristics of wood is developed using machine learning methods.Generally,the selection of materials for Ruan manufacturing relies primarily on manually weighing,observing,striking,and listening by the instrument technician.Deficiencies in scientific theory have hindered the quality of the finished Ruan.In this study,nine Ruans were manufactured,and a prediction model of Ruan sound quality was proposed based on the raw material information of Ruans.Out of a total of 180 data sets,145 and 45 sets were chosen for training and validation,respec-tively.In this paper,typical correlation analysis was used to determine the correlation between two single indicators in two adjacent pairwise combinations of the measured objects in each stage of the production process in Ruan.The vibra-tion characteristics of the wood were tested,and a model for predicting the evaluation of Ruan’s acoustic qualities was developed by measuring the vibration characteristics of the resonating plate material.The acoustic quality of the Ruan sound board wood was evaluated and predicted using machine learning model generalized regression neural net-work.The results show that the prediction of Ruan sound quality can be achieved using Matlab simulation based on the vibration characteristics of the soundboard wood.When the model-predicted values were compared with the tradi-tional predicted results,it was found that the generalized regression neural network had good performance,achieving an accuracy of 93.8%which was highly consistent with the experimental results.It was concluded that the model can accurately predict the acoustic quality of the Ruan based on the vibration performance of the soundboards.展开更多
翻译偏离通常源于语言的文化差异、译者的识解能力和方式的局限。兰盖克(Langacker)2019年最新提出的识解理论五维度,为认识The Sound and the Fury四个汉译本中的认知偏离现象提供了理论支撑。这些汉译本中的偏离现象虽遵循认知规律,...翻译偏离通常源于语言的文化差异、译者的识解能力和方式的局限。兰盖克(Langacker)2019年最新提出的识解理论五维度,为认识The Sound and the Fury四个汉译本中的认知偏离现象提供了理论支撑。这些汉译本中的偏离现象虽遵循认知规律,但深层原因主要涉及译者的视角差异、场景选择、信息突显、动态性表达及想象性再现等多个层面。在语言认知加工过程中,译者的认知框架和识解方式,以及他们与源语文本、作者和读者之间的认知互动对意义的动态构建会产生显著的影响和制约。展开更多
Natural creatures and ancient cultures are full of potential sources to provide inspiration for applied sciences.Inspired by the fractal geometry in nature and the fretwork frame in ancient culture,here we design the ...Natural creatures and ancient cultures are full of potential sources to provide inspiration for applied sciences.Inspired by the fractal geometry in nature and the fretwork frame in ancient culture,here we design the acoustic metasurface to realize sound anomalous modulation,which manifests itself as an incident-dependent propagation behavior:sound wave propagating in the forward direction is allowed to transmit with high efficiency while in the backward direction is obviously suppressed.We quantitatively investigate the dependences of asymmetric transmission on the propagation direction,incident angle and operating frequency by calculating sound transmittance and energy contrast.This compact fractal fretwork metasurface for acoustic anomalous modulation would promote the development of integrated acoustic devices and expand versatile applications in acoustic communication and information encryption.展开更多
Objective To assess the diagnostic accuracy of bowel sound analysis for irritable bowel syndrome(IBS)with a systematic review and meta-analysis.Methods We searched MEDLINE,Embase,the Cochrane Library,Web of Science,an...Objective To assess the diagnostic accuracy of bowel sound analysis for irritable bowel syndrome(IBS)with a systematic review and meta-analysis.Methods We searched MEDLINE,Embase,the Cochrane Library,Web of Science,and IEEE Xplore databases until September 2023.Cross-sectional and case-control studies on diagnostic accuracy of bowel sound analysis for IBS were identified.We estimated the pooled sensitivity,specificity,positive likelihood ratio,negative likeli-hood ratio,and diagnostic odds ratio with a 95% confidence interval(CI),and plotted a summary receiver operat-ing characteristic curve and evaluated the area under the curve.Results Four studies were included.The pooled diagnostic sensitivity,specificity,positive likelihood ratio,nega-tive likelihood ratio,and diagnostic odds ratio were 0.94(95%CI,0.87‒0.97),0.89(95%CI,0.81‒0.94),8.43(95%CI,4.81‒14.78),0.07(95%CI,0.03‒0.15),and 118.86(95%CI,44.18‒319.75),respectively,with an area under the curve of 0.97(95%CI,0.95‒0.98).Conclusions Computerized bowel sound analysis is a promising tool for IBS.However,limited high-quality data make the results'validity and applicability questionable.There is a need for more diagnostic test accuracy studies and better wearable devices for monitoring and analysis of IBS.展开更多
Investigations into the aerodynamic properties of vertical sound barriers exposed to high-speed operations employ computational fluid dynamics.The primary focus of this research is to evaluate the influence of train s...Investigations into the aerodynamic properties of vertical sound barriers exposed to high-speed operations employ computational fluid dynamics.The primary focus of this research is to evaluate the influence of train speed and the distance(D)from the track centerline under various operating conditions.The findings elucidate a marked elevation in the aerodynamic effect amplitude on sound barriers as train speeds increase.In single-train passages,the aerodynamic effect amplitude manifests a direct relationship with the square of the train speed.When two trains pass each other,the aerodynamic amplitude intensifies due to an additional aerodynamic increment on the sound barrier.This increment exhibits an approximate quadratic correlation with the retrograde train speed.Notably,the impact of high-speed trains on sound barrier aerodynamics surpasses that of low-speed trains,and this discrepancy amplifies with larger speed differentials between trains.Moreover,the train-induced aerodynamic effect diminishes significantly with greater distance(D),with occurrences of pressure coefficient(CP)exceeding the standard thresholds during dual-train passages.This study culminates in the formulation of universal equations for quantifying the influence of train speed and distance(D)on sound barrier aerodynamic characteristics across various operational scenarios.展开更多
We investigate the following inverse problem:starting from the acoustic wave equation,reconstruct a piecewise constant passive acoustic source from a single boundary temporal measurement without knowing the speed of s...We investigate the following inverse problem:starting from the acoustic wave equation,reconstruct a piecewise constant passive acoustic source from a single boundary temporal measurement without knowing the speed of sound.When the amplitudes of the source are known a priori,we prove a unique determination result of the shape and propose a level set algorithm to reconstruct the singularities.When the singularities of the source are known a priori,we show unique determination of the source amplitudes and propose a least-squares fitting algorithm to recover the source amplitudes.The analysis bridges the low-frequency source inversion problem and the inverse problem of gravimetry.The proposed algorithms are validated and quantitatively evaluated with numerical experiments in 2D and 3D.展开更多
Segregated incompressible large eddy simulation and acoustic perturbation equations were used to obtain the flow field and sound field of 1:25 scale trains with three,six and eight coaches in a long tunnel,and the aer...Segregated incompressible large eddy simulation and acoustic perturbation equations were used to obtain the flow field and sound field of 1:25 scale trains with three,six and eight coaches in a long tunnel,and the aerodynamic results were verified by wind tunnel test with the same scale two-coach train model.Time-averaged drag coefficients of the head coach of three trains are similar,but at the tail coach of the multi-group trains it is much larger than that of the three-coach train.The eight-coach train presents the largest increment from the head coach to the tail coach in the standard deviation(STD)of aerodynamic force coefficients:0.0110 for drag coefficient(Cd),0.0198 for lift coefficient(Cl)and 0.0371 for side coef-ficient(Cs).Total sound pressure level at the bottom of multi-group trains presents a significant streamwise increase,which is different from the three-coach train.Tunnel walls affect the acoustic distribution at the bottom,only after the coach number reaches a certain value,and the streamwise increase in the sound pressure fluctuation of multi-group trains is strengthened by coach number.Fourier transform of the turbulent and sound pressures presents that coach number has little influence on the peak frequencies,but increases the sound pressure level values at the tail bogie cavities.Furthermore,different from the turbulent pressure,the first two sound pressure proper orthogonal decomposition(POD)modes in the bogie cavities contain 90%of the total energy,and the spatial distributions indicate that the acoustic distributions in the head and tail bogies are not related to coach number.展开更多
Purpose – The vibration of the rails is a significant source of railway rolling noise, often forming the dominantcomponent of noise in the important frequency region between 400 and 2000 Hz. The purpose of the paper ...Purpose – The vibration of the rails is a significant source of railway rolling noise, often forming the dominantcomponent of noise in the important frequency region between 400 and 2000 Hz. The purpose of the paper is toinvestigate the influence of the ground profile and the presence of the train body on the sound radiation fromthe rail.Design/methodology/approach – Two-dimensional boundary element calculations are used, in which therail vibration is the source. The ground profile and various different shapes of train body are introduced in themodel, and results are observed in terms of sound power and sound pressure. Comparisons are also made withvibro-acoustic measurements performed with and without a train present.Findings – The sound radiated by the rail in the absence of the train body is strongly attenuated by shieldingdue to the ballast shoulder. When the train body is present, the sound from the vertical rail motion is reflectedback down toward the track where it is partly absorbed by the ballast. Nevertheless, the sound pressure at thetrackside is increased by typically 0–5 dB. For the lateral vibration of the rail, the effects are much smaller. Oncethe sound power is known, the sound pressure with the train present can be approximated reasonably well withsimple line source directivities.Originality/value – Numerical models used to predict the sound radiation from railway rails have generallyneglected the influence of the ground profile and reflections from the underside of the train body on the soundpower and directivity of the rail. These effects are studied in a systematic way including comparisons with measurements.展开更多
With the development of ultra-wide coverage technology,multibeam echo-sounder(MBES)system has put forward higher requirements for localization accuracy and computational efficiency of ray tracing method.The classical ...With the development of ultra-wide coverage technology,multibeam echo-sounder(MBES)system has put forward higher requirements for localization accuracy and computational efficiency of ray tracing method.The classical equivalent sound speed profile(ESSP)method replaces the measured sound velocity profile(SVP)with a simple constant gradient SVP,reducing the computational workload of beam positioning.However,in deep-sea environment,the depth measurement error of this method rapidly increases from the central beam to the edge beam.By analyzing the positioning error of the ESSP method at edge beam,it is discovered that the positioning error increases monotonically with the incident angle,and the relationship between them could be expressed by polynomial function.Therefore,an error correction algorithm based on polynomial fitting is obtained.The simulation experiment conducted on an inclined seafloor shows that the proposed algorithm exhibits comparable efficiency to the original ESSP method,while significantly improving bathymetry accuracy by nearly eight times in the edge beam.展开更多
Acoustic source localization(ASL)and sound event detection(SED)are two widely pursued independent research fields.In recent years,in order to achieve a more complete spatial and temporal representation of sound field,...Acoustic source localization(ASL)and sound event detection(SED)are two widely pursued independent research fields.In recent years,in order to achieve a more complete spatial and temporal representation of sound field,sound event localization and detection(SELD)has become a very active research topic.This paper presents a deep learning-based multioverlapping sound event localization and detection algorithm in three-dimensional space.Log-Mel spectrum and generalized cross-correlation spectrum are joined together in channel dimension as input features.These features are classified and regressed in parallel after training by a neural network to obtain sound recognition and localization results respectively.The channel attention mechanism is also introduced in the network to selectively enhance the features containing essential information and suppress the useless features.Finally,a thourough comparison confirms the efficiency and effectiveness of the proposed SELD algorithm.Field experiments show that the proposed algorithm is robust to reverberation and environment and can achieve higher recognition and localization accuracy compared with the baseline method.展开更多
文摘Introduction: Located in the central-western part of Côte d’Ivoire, the subsoil of the Gagnoa region is made up of sedimentary volcano formations and granitoids with developed fracturing. This complex Precambrian basement contains most of the region’s water resources. This is at the origin of the high failure rate during the various hydrogeological prospecting campaigns. Methodology: The database consists of resistivities from 42 holes and 51 trails drilled as part of the implementation of high-throughput drilling in the study area. The objective of this study is to deepen the knowledge of the fissured basement by interpreting profile curves and electrical soundings. It will be a question of classifying the different types of anomalies obtained on the profiles and their shapes. The orientation of the lineaments observed on the profiles was determined. Results: The interpretation of the geophysical data revealed various anomalies, the main ones being of the CC (Conductor Compartment) and CEDP (Contact between two bearings) types. These types of anomalies are mainly expressed in various forms: the “V”, “W” and “U” shapes. From these anomalies and the appearance of the electrical profiles, lineaments and their orientations were identified with N90-100, N130-140, N170-180 as major orientations. Conclusion: These results could contribute to a better understanding of the fractured environment of the Gagnoa region.
基金funded by an NSFC Major Project (Grant No. 42090033)the China Meteorological Administration Youth Innovation Team “High-Value Climate Change Data Product Development and Application Services”(Grant No. CMA2023QN08)the National Meteorological Information Centre Surplus Funds Program (Grant NMICJY202310)。
文摘High-vertical-resolution radiosonde wind data are highly valuable for describing the dynamics of the meso-and microscale atmosphere. However, the current algorithm used in China's L-band radar sounding system for calculating highvertical-resolution wind vectors excessively smooths the data, resulting in significant underestimation of the calculated kinetic energy of gravity waves compared to similar products from other countries, which greatly limits the effective utilization of the data. To address this issue, this study proposes a novel method to calculate high-vertical-resolution wind vectors that utilizes the elevation angle, azimuth angle, and slant range from L-band radar. In order to obtain wind data with a stable quality, a two-step automatic quality control procedure, including the RMSE-F(root-mean-square error F) test and elemental consistency test are first applied to the slant range data, to eliminate continuous erroneous data caused by unstable signals or radar malfunctions. Then, a wind calculation scheme based on a sliding second-order polynomial fitting is utilized to derive the high-vertical-resolution radiosonde wind vectors. The evaluation results demonstrate that the wind data obtained through the proposed method show a high level of consistency with the high-resolution wind data observed using the Vaisala Global Positioning System and the data observed by the new Beidou Navigation Sounding System. The calculation of the kinetic energy of gravity waves in the recalculated wind data also reaches a level comparable to the Vaisala observations.
基金supported by the National Natural Science Foundation of China(Grant No.42004030)Basic Scientific Fund for National Public Research Institutes of China(Grant No.2022S03)+1 种基金Science and Technology Innovation Project(LSKJ202205102)funded by Laoshan Laboratory,and the National Key Research and Development Program of China(2020YFB0505805).
文摘The scarcity of in-situ ocean observations poses a challenge for real-time information acquisition in the ocean.Among the crucial hydroacoustic environmental parameters,ocean sound velocity exhibits significant spatial and temporal variability and it is highly relevant to oceanic research.In this study,we propose a new data-driven approach,leveraging deep learning techniques,for the prediction of sound velocity fields(SVFs).Our novel spatiotemporal prediction model,STLSTM-SA,combines Spatiotemporal Long Short-Term Memory(ST-LSTM) with a self-attention mechanism to enable accurate and real-time prediction of SVFs.To circumvent the limited amount of observational data,we employ transfer learning by first training the model using reanalysis datasets,followed by fine-tuning it using in-situ analysis data to obtain the final prediction model.By utilizing the historical 12-month SVFs as input,our model predicts the SVFs for the subsequent three months.We compare the performance of five models:Artificial Neural Networks(ANN),Long ShortTerm Memory(LSTM),Convolutional LSTM(ConvLSTM),ST-LSTM,and our proposed ST-LSTM-SA model in a test experiment spanning 2019 to 2022.Our results demonstrate that the ST-LSTM-SA model significantly improves the prediction accuracy and stability of sound velocity in both temporal and spatial dimensions.The ST-LSTM-SA model not only accurately predicts the ocean sound velocity field(SVF),but also provides valuable insights for spatiotemporal prediction of other oceanic environmental variables.
文摘Continuous thunderstorm occurring at Qingdao Airport in China from August 7 to 13,2022 was analyzed based on sounding data.The weather was divided into thunderstorm gale,thunderstorm and heavy precipitation,and some physical quantities and time variables which can effectively identify severe convective weather types were preliminarily obtained.The results show that CAPE was sensitive to different types of weather,but the uncertainty was relatively large.Convective temperature T_(CON),temperature difference between 500 and 850 hPa,and vertical wind shear can distinguish thunderstorm gale,thunderstorm and heavy precipitation weather obviously.Besides,K index,Showalter index,θ_(se) difference between 500 and 850 hPa were also important basis to distinguish thunderstorm and heavy precipitation weather.Thunderstorm gale can be distinguished by the 24-hour variations of K index,and the difference of dew point between 500 and 850 hPa.The 24-hour variations of(T-T_(d))_(500) and vertical wind shear can be used to distinguish between heavy precipitation and thunderstorm weather;the 24-hour variation of stratification stability Δθ_(se) can distinguish the three kinds of weather well.For the wind field,the existence of vertical wind shear was required for strong convective weather,and the 24-hour increment of thunderstorm gale and thunderstorm was larger than that of heavy precipitation.
文摘This research aims to address the pressing issue of failed and abandoned wells, causing water scarcity in Lapan Gwari Community, through an improved groundwater exploration approach integrating remote sensing and electrical resistivity soundings. The study area, located within the Zungeru Sheet 163 SE, spans Latitudes 9°30'00"N to 9°32'00"N and Longitudes 6°28'00" to 6°30'00". The surface geologic, structural, and hydrogeological mapping provided essential insights into the hydrogeological framework. Leveraging SRTM DEM data, thematic maps were created for geomorphology, slope, land use, lineament density, and drainage density. These datasets were then integrated using ArcGIS to develop a preliminary groundwater potential zones map. Further investigations were conducted using Vertical Electrical Sounding (VES) and Electrical Resistivity Imaging (2D VES) surveys at targeted locations identified by the preliminary map. Results show that the study area predominantly consists of crystalline rocks of the Nigerian Basement Complex, primarily comprising schist and granite with minor occurrences of quartz vein intrusions. Surface joint directions indicated a dominant NE-SW trend. The VES data revealed three to four geoelectric layers, encompassing the topsoil (1 to 5 m depth, resistivity: 100 Ωm to 300 Ωm), the weathered layer (in the 3-layer system) or fractured layer (in the 4-layer system), and the fresh basement rock characterized by infinite resistivity. The shallow weathered layers (3 to 30 m thickness) are believed to hold aquiferous potential. Hydrogeological interpretation, facilitated by 2D resistivity models, delineated water horizons trapped within clayey sand and weathered/fractured formations. Notably, the aquifer resistivity range was found to be between 3 - 35 m and 100 - 300 Ωm, signifying a promising aquifer positioned at depths of 40 to 88 m. This aligns with corroborative static water level measurements. Given this, we recommend drilling depths of a minimum of 80 m to ensure the acquisition of sufficient and sustainable water supplies. The final groundwater potential zones map derived from this study is expected to serve as an invaluable guide for prospective groundwater developers and relevant authorities in formulating effective water resource management plans. By effectively tackling water scarcity challenges in Lapan Gwari Community, this integrated approach demonstrates its potential for application in similar regions facing comparable hydrogeological concerns.
基金This work is supported by the National Key Research and Development Program of China(2022YFC2407800)the General Program of National Natural Science Foundation of China(62271241)+1 种基金the Guangdong Basic and Applied Basic Research Foundation(2023A1515012983)the Shenzhen Fundamental Research Program(JCYJ20220530112601003).
文摘Auscultation is crucial for the diagnosis of respiratory system diseases.However,traditional stethoscopes have inherent limitations,such as inter-listener variability and subjectivity,and they cannot record respiratory sounds for offline/retrospective diagnosis or remote prescriptions in telemedicine.The emergence of digital stethoscopes has overcome these limitations by allowing physicians to store and share respiratory sounds for consultation and education.On this basis,machine learning,particularly deep learning,enables the fully-automatic analysis of lung sounds that may pave the way for intelligent stethoscopes.This review thus aims to provide a comprehensive overview of deep learning algorithms used for lung sound analysis to emphasize the significance of artificial intelligence(AI)in this field.We focus on each component of deep learning-based lung sound analysis systems,including the task categories,public datasets,denoising methods,and,most importantly,existing deep learning methods,i.e.,the state-of-the-art approaches to convert lung sounds into two-dimensional(2D)spectrograms and use convolutional neural networks for the end-to-end recognition of respiratory diseases or abnormal lung sounds.Additionally,this review highlights current challenges in this field,including the variety of devices,noise sensitivity,and poor interpretability of deep models.To address the poor reproducibility and variety of deep learning in this field,this review also provides a scalable and flexible open-source framework that aims to standardize the algorithmic workflow and provide a solid basis for replication and future extension:https://github.com/contactless-healthcare/Deep-Learning-for-Lung-Sound-Analysis.
基金supported by the National Natural Science Foundation of China(Nos.52171327,11991032,52201386,and 51805537)。
文摘Chiral metamaterials have been proven to possess many appealing mechanical phenomena,such as negative Poisson's ratio,high-impact resistance,and energy absorption.This work extends the applications of chiral metamaterials to underwater sound insulation.Various chiral metamaterials with low acoustic impedance and proper stiffness are inversely designed using the topology optimization scheme.Low acoustic impedance enables the metamaterials to have a high and broadband sound transmission loss(STL),while proper stiffness guarantees its robust acoustic performance under a hydrostatic pressure.As proof-of-concept demonstrations,two specimens are fabricated and tested in a water-filled impedance tube.Experimental results show that,on average,over 95%incident sound energy can be isolated by the specimens in a broad frequency range from 1 k Hz to 5 k Hz,while the sound insulation performance keeps stable under a certain hydrostatic pressure.This work may provide new insights for chiral metamaterials into the underwater applications with sound insulation.
基金the National Natural Science Foundation of China(32201338)Science Technology Program from the Forestry Administration of Guangdong Province(2021KJCX017)+1 种基金Guangzhou Municipal Science and Technology Bureau Program(2023A04J0086)Shenzhen Key Laboratory of Southern Subtropical Plant Diversity。
文摘As a crucial component of terrestrial ecosystems,urban forests play a pivotal role in protecting urban biodiversity by providing suitable habitats for acoustic spaces.Previous studies note that vegetation structure is a key factor influencing bird sounds in urban forests;hence,adjusting the frequency composition may be a strategy for birds to avoid anthropogenic noise to mask their songs.However,it is unknown whether the response mechanisms of bird vocalizations to vegetation structure remain consistent despite being impacted by anthropogenic noise.It was hypothesized that anthropogenic noise in urban forests occupies the low-frequency space of bird songs,leading to a possible reshaping of the acoustic niches of forests,and the vegetation structure of urban forests is the critical factor that shapes the acoustic space for bird vocalization.Passive acoustic monitoring in various urban forests was used to monitor natural and anthropogenic noises,and sounds were classified into three acoustic scenes(bird sounds,human sounds,and bird-human sounds)to determine interconnections between bird sounds,anthropogenic noise,and vegetation structure.Anthropogenic noise altered the acoustic niche of urban forests by intruding into the low-frequency space used by birds,and vegetation structures related to volume(trunk volume and branch volume)and density(number of branches and leaf area index)significantly impact the diversity of bird sounds.Our findings indicate that the response to low and high frequency signals to vegetation structure is distinct.By clarifying this relationship,our results contribute to understanding of how vegetation structure influences bird sounds in urban forests impacted by anthropogenic noise.
基金Project supported by the National Natural Science Foundation of China (Nos. 12172339 and 11732005)the Beijing Natural Science Foundation of China (No. 1222006)。
文摘A sandwich plate with a corrugation and auxetic honeycomb hybrid core is constructed,and its sound insulation and optimization are investigated.First,the motion governing equation of the sandwich plate is established by the third-order shear deformation theory(TSDT),and then combined with the fluid-structure coupling conditions,and the sound insulation is solved.The theoretical results are validated by COMSOL simulation results,and the effects of the structural parameter on the sound insulation are analyzed.Finally,the standard genetic algorithm is adopted to optimize the sound insulation of the sandwich plate.
基金Project(52202455)supported by the National Natural Science Foundation of ChinaProject(23A0017)supported by the Key Project of Scientific Research Project of Hunan Provincial Department of Education,China。
文摘In order to overcome the limitations of traditional microperforated plate with narrow sound absorption bandwidth and a single structure,two multi-cavity composite sound-absorbing materials were designed based on the shape of monoclinic crystals:uniaxial oblique structure(UOS)and biaxial oblique structure(BOS).Through finite element simulation and experimental research,the theoretical models of UOS and BOS were verified,and their sound absorption mechanisms were revealed.At the same time,the influence of multi-cavity composites on sound absorption performance was analyzed based on the theoretical model,and the influence of structural parameters on sound absorption performance was discussed.The research results show that,in the range of 100-2000 Hz,UOS has three sound absorption peaks and BOS has five sound absorption peaks.The frequency range of the half-absorption bandwidth(α>0.5)of UOS and BOS increases by 242% and 229%,respectively.Compared with traditional microperforated sound-absorbing structures,the series and parallel hybrid methods significantly increase the sound-absorbing bandwidth of the sound-absorbing structure.This research has guiding significance for noise control and has broad application prospects in the fields of transportation,construction,and mechanical design.
基金supported by China Postdoctoral Science Foundation(2019M651240)National Natural Science Foundation of China(31670559).
文摘As an important material for manufacturing resonant components of musical instruments,Paulownia has an important influence on the sound quality of Ruan.In this paper,a model for evaluating the sound quality of Ruan based on the vibration characteristics of wood is developed using machine learning methods.Generally,the selection of materials for Ruan manufacturing relies primarily on manually weighing,observing,striking,and listening by the instrument technician.Deficiencies in scientific theory have hindered the quality of the finished Ruan.In this study,nine Ruans were manufactured,and a prediction model of Ruan sound quality was proposed based on the raw material information of Ruans.Out of a total of 180 data sets,145 and 45 sets were chosen for training and validation,respec-tively.In this paper,typical correlation analysis was used to determine the correlation between two single indicators in two adjacent pairwise combinations of the measured objects in each stage of the production process in Ruan.The vibra-tion characteristics of the wood were tested,and a model for predicting the evaluation of Ruan’s acoustic qualities was developed by measuring the vibration characteristics of the resonating plate material.The acoustic quality of the Ruan sound board wood was evaluated and predicted using machine learning model generalized regression neural net-work.The results show that the prediction of Ruan sound quality can be achieved using Matlab simulation based on the vibration characteristics of the soundboard wood.When the model-predicted values were compared with the tradi-tional predicted results,it was found that the generalized regression neural network had good performance,achieving an accuracy of 93.8%which was highly consistent with the experimental results.It was concluded that the model can accurately predict the acoustic quality of the Ruan based on the vibration performance of the soundboards.
文摘翻译偏离通常源于语言的文化差异、译者的识解能力和方式的局限。兰盖克(Langacker)2019年最新提出的识解理论五维度,为认识The Sound and the Fury四个汉译本中的认知偏离现象提供了理论支撑。这些汉译本中的偏离现象虽遵循认知规律,但深层原因主要涉及译者的视角差异、场景选择、信息突显、动态性表达及想象性再现等多个层面。在语言认知加工过程中,译者的认知框架和识解方式,以及他们与源语文本、作者和读者之间的认知互动对意义的动态构建会产生显著的影响和制约。
基金Project supported by the National Key Research and Development Program of China(Grant No.2022YFA1404500)the National Natural Science Foundation of China(Grant Nos.T2222024 and 12034005)the STCSM Science and Technology Innovation Plan of Shanghai Science and Technology Commission(Grant Nos.20ZR1404200 and 21JC1400300)。
文摘Natural creatures and ancient cultures are full of potential sources to provide inspiration for applied sciences.Inspired by the fractal geometry in nature and the fretwork frame in ancient culture,here we design the acoustic metasurface to realize sound anomalous modulation,which manifests itself as an incident-dependent propagation behavior:sound wave propagating in the forward direction is allowed to transmit with high efficiency while in the backward direction is obviously suppressed.We quantitatively investigate the dependences of asymmetric transmission on the propagation direction,incident angle and operating frequency by calculating sound transmittance and energy contrast.This compact fractal fretwork metasurface for acoustic anomalous modulation would promote the development of integrated acoustic devices and expand versatile applications in acoustic communication and information encryption.
基金funded by the National Natural Science Foundation of China(No.32170788)National High Level Hospital Clinical Research Funding(No.2022-PUMCH-B-023)Beijing Natural Science Foundation(No.7232123).
文摘Objective To assess the diagnostic accuracy of bowel sound analysis for irritable bowel syndrome(IBS)with a systematic review and meta-analysis.Methods We searched MEDLINE,Embase,the Cochrane Library,Web of Science,and IEEE Xplore databases until September 2023.Cross-sectional and case-control studies on diagnostic accuracy of bowel sound analysis for IBS were identified.We estimated the pooled sensitivity,specificity,positive likelihood ratio,negative likeli-hood ratio,and diagnostic odds ratio with a 95% confidence interval(CI),and plotted a summary receiver operat-ing characteristic curve and evaluated the area under the curve.Results Four studies were included.The pooled diagnostic sensitivity,specificity,positive likelihood ratio,nega-tive likelihood ratio,and diagnostic odds ratio were 0.94(95%CI,0.87‒0.97),0.89(95%CI,0.81‒0.94),8.43(95%CI,4.81‒14.78),0.07(95%CI,0.03‒0.15),and 118.86(95%CI,44.18‒319.75),respectively,with an area under the curve of 0.97(95%CI,0.95‒0.98).Conclusions Computerized bowel sound analysis is a promising tool for IBS.However,limited high-quality data make the results'validity and applicability questionable.There is a need for more diagnostic test accuracy studies and better wearable devices for monitoring and analysis of IBS.
基金This study was supported in part by the National Natural Science Foundation of China under Grant Nos.52278463,52208505,and 52202422.
文摘Investigations into the aerodynamic properties of vertical sound barriers exposed to high-speed operations employ computational fluid dynamics.The primary focus of this research is to evaluate the influence of train speed and the distance(D)from the track centerline under various operating conditions.The findings elucidate a marked elevation in the aerodynamic effect amplitude on sound barriers as train speeds increase.In single-train passages,the aerodynamic effect amplitude manifests a direct relationship with the square of the train speed.When two trains pass each other,the aerodynamic amplitude intensifies due to an additional aerodynamic increment on the sound barrier.This increment exhibits an approximate quadratic correlation with the retrograde train speed.Notably,the impact of high-speed trains on sound barrier aerodynamics surpasses that of low-speed trains,and this discrepancy amplifies with larger speed differentials between trains.Moreover,the train-induced aerodynamic effect diminishes significantly with greater distance(D),with occurrences of pressure coefficient(CP)exceeding the standard thresholds during dual-train passages.This study culminates in the formulation of universal equations for quantifying the influence of train speed and distance(D)on sound barrier aerodynamic characteristics across various operational scenarios.
基金partially supported by the NSF(Grant Nos.2012046,2152011,and 2309534)partially supported by the NSF(Grant Nos.DMS-1715178,DMS-2006881,and DMS-2237534)+1 种基金NIH(Grant No.R03-EB033521)startup fund from Michigan State University.
文摘We investigate the following inverse problem:starting from the acoustic wave equation,reconstruct a piecewise constant passive acoustic source from a single boundary temporal measurement without knowing the speed of sound.When the amplitudes of the source are known a priori,we prove a unique determination result of the shape and propose a level set algorithm to reconstruct the singularities.When the singularities of the source are known a priori,we show unique determination of the source amplitudes and propose a least-squares fitting algorithm to recover the source amplitudes.The analysis bridges the low-frequency source inversion problem and the inverse problem of gravimetry.The proposed algorithms are validated and quantitatively evaluated with numerical experiments in 2D and 3D.
基金supported by the National Natural Science Foundation of China (Grant No. 52072267)Shanghai Key Lab of Vehicle Aerodynamics and Vehicle Thermal Management Systems (Grant No. 23DZ2229029)
文摘Segregated incompressible large eddy simulation and acoustic perturbation equations were used to obtain the flow field and sound field of 1:25 scale trains with three,six and eight coaches in a long tunnel,and the aerodynamic results were verified by wind tunnel test with the same scale two-coach train model.Time-averaged drag coefficients of the head coach of three trains are similar,but at the tail coach of the multi-group trains it is much larger than that of the three-coach train.The eight-coach train presents the largest increment from the head coach to the tail coach in the standard deviation(STD)of aerodynamic force coefficients:0.0110 for drag coefficient(Cd),0.0198 for lift coefficient(Cl)and 0.0371 for side coef-ficient(Cs).Total sound pressure level at the bottom of multi-group trains presents a significant streamwise increase,which is different from the three-coach train.Tunnel walls affect the acoustic distribution at the bottom,only after the coach number reaches a certain value,and the streamwise increase in the sound pressure fluctuation of multi-group trains is strengthened by coach number.Fourier transform of the turbulent and sound pressures presents that coach number has little influence on the peak frequencies,but increases the sound pressure level values at the tail bogie cavities.Furthermore,different from the turbulent pressure,the first two sound pressure proper orthogonal decomposition(POD)modes in the bogie cavities contain 90%of the total energy,and the spatial distributions indicate that the acoustic distributions in the head and tail bogies are not related to coach number.
基金supported by the TRANSIT project(funded by EU Horizon 2020 and the Europe’s Rail Joint Undertaking under Grant Agreement 881771).
文摘Purpose – The vibration of the rails is a significant source of railway rolling noise, often forming the dominantcomponent of noise in the important frequency region between 400 and 2000 Hz. The purpose of the paper is toinvestigate the influence of the ground profile and the presence of the train body on the sound radiation fromthe rail.Design/methodology/approach – Two-dimensional boundary element calculations are used, in which therail vibration is the source. The ground profile and various different shapes of train body are introduced in themodel, and results are observed in terms of sound power and sound pressure. Comparisons are also made withvibro-acoustic measurements performed with and without a train present.Findings – The sound radiated by the rail in the absence of the train body is strongly attenuated by shieldingdue to the ballast shoulder. When the train body is present, the sound from the vertical rail motion is reflectedback down toward the track where it is partly absorbed by the ballast. Nevertheless, the sound pressure at thetrackside is increased by typically 0–5 dB. For the lateral vibration of the rail, the effects are much smaller. Oncethe sound power is known, the sound pressure with the train present can be approximated reasonably well withsimple line source directivities.Originality/value – Numerical models used to predict the sound radiation from railway rails have generallyneglected the influence of the ground profile and reflections from the underside of the train body on the soundpower and directivity of the rail. These effects are studied in a systematic way including comparisons with measurements.
基金The Natural Science Foundation of Shandong Province of China under contract Nos ZR2022MA051 and ZR2020MA090the National Natural Science Foundation of China under contract No.U22A2012+2 种基金China Postdoctoral Science Foundation under contract No.2020M670891the SDUST Research Fund under contract No.2019TDJH103the Talent Introduction Plan for Youth Innovation Team in universities of Shandong Province(innovation team of satellite positioning and navigation)。
文摘With the development of ultra-wide coverage technology,multibeam echo-sounder(MBES)system has put forward higher requirements for localization accuracy and computational efficiency of ray tracing method.The classical equivalent sound speed profile(ESSP)method replaces the measured sound velocity profile(SVP)with a simple constant gradient SVP,reducing the computational workload of beam positioning.However,in deep-sea environment,the depth measurement error of this method rapidly increases from the central beam to the edge beam.By analyzing the positioning error of the ESSP method at edge beam,it is discovered that the positioning error increases monotonically with the incident angle,and the relationship between them could be expressed by polynomial function.Therefore,an error correction algorithm based on polynomial fitting is obtained.The simulation experiment conducted on an inclined seafloor shows that the proposed algorithm exhibits comparable efficiency to the original ESSP method,while significantly improving bathymetry accuracy by nearly eight times in the edge beam.
基金supported by the National Natural Science Foundation of China(61877067)the Foundation of Science and Technology on Near-Surface Detection Laboratory(TCGZ2019A002,TCGZ2021C003,6142414200511)the Natural Science Basic Research Program of Shaanxi(2021JZ-19)。
文摘Acoustic source localization(ASL)and sound event detection(SED)are two widely pursued independent research fields.In recent years,in order to achieve a more complete spatial and temporal representation of sound field,sound event localization and detection(SELD)has become a very active research topic.This paper presents a deep learning-based multioverlapping sound event localization and detection algorithm in three-dimensional space.Log-Mel spectrum and generalized cross-correlation spectrum are joined together in channel dimension as input features.These features are classified and regressed in parallel after training by a neural network to obtain sound recognition and localization results respectively.The channel attention mechanism is also introduced in the network to selectively enhance the features containing essential information and suppress the useless features.Finally,a thourough comparison confirms the efficiency and effectiveness of the proposed SELD algorithm.Field experiments show that the proposed algorithm is robust to reverberation and environment and can achieve higher recognition and localization accuracy compared with the baseline method.