Environmental sound classification(ESC)involves the process of distinguishing an audio stream associated with numerous environmental sounds.Some common aspects such as the framework difference,overlapping of different...Environmental sound classification(ESC)involves the process of distinguishing an audio stream associated with numerous environmental sounds.Some common aspects such as the framework difference,overlapping of different sound events,and the presence of various sound sources during recording make the ESC task much more complicated and complex.This research is to propose a deep learning model to improve the recognition rate of environmental sounds and reduce the model training time under limited computation resources.In this research,the performance of transformer and convolutional neural networks(CNN)are investigated.Seven audio features,chromagram,Mel-spectrogram,tonnetz,Mel-Frequency Cepstral Coefficients(MFCCs),delta MFCCs,delta-delta MFCCs and spectral contrast,are extracted fromtheUrbanSound8K,ESC-50,and ESC-10,databases.Moreover,this research also employed three data enhancement methods,namely,white noise,pitch tuning,and time stretch to reduce the risk of overfitting issue due to the limited audio clips.The evaluation of various experiments demonstrates that the best performance was achieved by the proposed transformer model using seven audio features on enhanced database.For UrbanSound8K,ESC-50,and ESC-10,the highest attained accuracies are 0.98,0.94,and 0.97 respectively.The experimental results reveal that the proposed technique can achieve the best performance for ESC problems.展开更多
Environmental sound classification (ESC) has gained increasing attention in recent years. This study focuses on the evaluation of the popular public dataset Urbansound8k (Us8k) at different sampling rates using hand c...Environmental sound classification (ESC) has gained increasing attention in recent years. This study focuses on the evaluation of the popular public dataset Urbansound8k (Us8k) at different sampling rates using hand crafted features. The Us8k dataset contains environment sounds recorded at various sampling rates, and previous ESC works have uniformly resampled the dataset. Some previous work converted this data to different sampling rates for various reasons. Some of them chose to convert the rest of the dataset to 44,100, as the majority of the Us8k files were already at that sampling rate. On the other hand, some researchers down sampled the dataset to 8000, as it reduced computational complexity, while others resampled it to 16,000, aiming to achieve a balance between higher classification accuracy and lower computational complexity. In this research, we assessed the performance of ESC tasks using sampling rates of 8000 Hz, 16,000 Hz, and 44,100 Hz by extracting the hand crafted features Mel frequency cepstral coefficient (MFCC), gamma tone cepstral coefficients (GTCC), and Mel Spectrogram (MelSpec). The results indicated that there was no significant difference in the classification accuracy among the three tested sampling rates.展开更多
Noise pollution is one of the common physical harmful factors in many work environments.The current study aimed to assess personal and environmental sound pressure level and project the sound-Isosonic map in one of th...Noise pollution is one of the common physical harmful factors in many work environments.The current study aimed to assess personal and environmental sound pressure level and project the sound-Isosonic map in one of the Razavi Khorasan Paste manufacture using Surfer V.14 and Noise at work V.5.0.This cross-sectional,descrip-tive study is analytical that was conducted in 2018 in the Paste factory that contains Canister,production and Brewing unit.Following ISO 9612:2009,Casella Cel-320 was used to measure personal sound pressure level,while CEL-450 sound level meter(manufactured by Casella-Cel,the UK)was employed to assess environmental sound pressure level.Statistical analyzes was done using SPSS V.18 and Linear Regression test.The sound-isosonic maps were projected using Surfer V.14 and Noise at work V.5.0.The results of assessing personal sound pressure level indicated that the highest received dose(172.21%)and personal equivalent sound level(87.36 dBA)were recorded for workers in the Canister unit.According to results of measuring of the environmental sound pressure level,out of 16 measurement stations in this unit,overall 87.5%were regarded as danger and caution areas.The lowest and highest sound pressure levels in this units were 61 dBA and 92 dBA that belong to Brewing and Canister units respectively.Results indicate Over 75%of the Canister and production units had a sound pressure level greater than 85 dBA and these two units were regarded as the most dangerous area in terms of noise pollution.It is there-fore necessary to implement noise control measures,apply hearing protection program and auditory tests among workers in these units.展开更多
In this study,the crowd has been investigated and analyzed in waiting areas in large general hospitals in China in order to find the rules the variations of sound environment with the change of crowd. The field invest...In this study,the crowd has been investigated and analyzed in waiting areas in large general hospitals in China in order to find the rules the variations of sound environment with the change of crowd. The field investigation,questionnaire,field-testing and computer simulation have been adopted. The results show that: the social /demographic characteristics of staff and patients are not significantly related to the satisfaction evaluation of sound environment; there is a significant correlation between the population density and LAeq of the background noise in waiting areas; when population density is 0,the LAeq of background noise is not 0 in waiting areas; the loudspeaker should be set in the waiting areas. Loudspeaker arrangements should be integrated into the ceiling lamp or construct facilities along the depth direction of the layout,and the two adjacent speakers recommended distance should be controlled at about 4 m. If the population density is controlled in the reasonable range,and sound absorption,noise reduction processing and electronic queuing system are adopted,sound environment of waiting areas will be built with noise interference relatively small in different population densities.展开更多
By using the expressions for the maximum absorption per wavelength (αλ),and the relaxation frequency fr of the boric acid relaxation derived previously by the author and employing the related oceanographic literatur...By using the expressions for the maximum absorption per wavelength (αλ),and the relaxation frequency fr of the boric acid relaxation derived previously by the author and employing the related oceanographic literatures, the effects of pressure, temperature, pH and salinity on (αλ)r and ∫r of the boric acid relaxation in sea water have been estimated. Results show that ( αλ), not only increases with pH but also increases approximately linearly with pressure and temperature, and is nearly proportional to the 1. 35 power of salinity. However, pressure, pH and salinity have negligible effect on ∫r; therefore, ∫r, can be approximately expressed as a function of temperature only. Comparisons of the predicted with the measured ( αλ)r and ∫r in different ocean areas are given.展开更多
Monitoring techniques are a key technology for examining the conditions in various scenarios, e.g., structural conditions, weather conditions, and disasters. In order to understand such scenarios, the appropriate extr...Monitoring techniques are a key technology for examining the conditions in various scenarios, e.g., structural conditions, weather conditions, and disasters. In order to understand such scenarios, the appropriate extraction of their features from observation data is important. This paper proposes a monitoring method that allows sound environments to be expressed as a sound pattern. To this end, the concept of synesthesia is exploited. That is, the keys, tones, and pitches of the monitored sound are expressed using the three elements of color, that is, the hue, saturation, and brightness, respectively. In this paper, it is assumed that the hue, saturation, and brightness can be detected from the chromagram, sonogram, and sound spectrogram, respectively, based on a previous synesthesia experiment. Then, the sound pattern can be drawn using color, yielding a “painted sound map.” The usefulness of the proposed monitoring technique is verified using environmental sound data observed at a galleria.展开更多
The performance of classic Mel-frequency cepstral coefficients (MFCC) is unsatisfactory in noisy environment with different sound sources from nature. In this paper, a classification approach of the ecological environ...The performance of classic Mel-frequency cepstral coefficients (MFCC) is unsatisfactory in noisy environment with different sound sources from nature. In this paper, a classification approach of the ecological environmental sounds using the double-level energy detection (DED) was presented. The DED was used to detect the existence of the sound signals under noise conditions. In addition, MFCC features from the frames which were detected the presence of the sound signals by DED were extracted. Experimental results show that the proposed technology has better noise immunity than classic MFCC, and also outperforms time-domain energy detection (TED) and frequency-domain energy detection (FED) respectively.展开更多
Since its initiation in 2008, the Beijing Forum on Human Rights has been bent on promoting our mutual understanding and expanding our common grounds. It has carried out very fruitful exchanges and researches around ma...Since its initiation in 2008, the Beijing Forum on Human Rights has been bent on promoting our mutual understanding and expanding our common grounds. It has carried out very fruitful exchanges and researches around many important questions concerning human rights development, and hence attained great achievements. It has played avery constructive role in promoting exchanges about and cooperation in human rights between China and other countries, becoming an important and influential platform for international dialogs over human rights.展开更多
基金the Taif University Researchers Supporting Project number(TURSP-2020/36),Taif University,Taif,Saudi Arabia.
文摘Environmental sound classification(ESC)involves the process of distinguishing an audio stream associated with numerous environmental sounds.Some common aspects such as the framework difference,overlapping of different sound events,and the presence of various sound sources during recording make the ESC task much more complicated and complex.This research is to propose a deep learning model to improve the recognition rate of environmental sounds and reduce the model training time under limited computation resources.In this research,the performance of transformer and convolutional neural networks(CNN)are investigated.Seven audio features,chromagram,Mel-spectrogram,tonnetz,Mel-Frequency Cepstral Coefficients(MFCCs),delta MFCCs,delta-delta MFCCs and spectral contrast,are extracted fromtheUrbanSound8K,ESC-50,and ESC-10,databases.Moreover,this research also employed three data enhancement methods,namely,white noise,pitch tuning,and time stretch to reduce the risk of overfitting issue due to the limited audio clips.The evaluation of various experiments demonstrates that the best performance was achieved by the proposed transformer model using seven audio features on enhanced database.For UrbanSound8K,ESC-50,and ESC-10,the highest attained accuracies are 0.98,0.94,and 0.97 respectively.The experimental results reveal that the proposed technique can achieve the best performance for ESC problems.
文摘Environmental sound classification (ESC) has gained increasing attention in recent years. This study focuses on the evaluation of the popular public dataset Urbansound8k (Us8k) at different sampling rates using hand crafted features. The Us8k dataset contains environment sounds recorded at various sampling rates, and previous ESC works have uniformly resampled the dataset. Some previous work converted this data to different sampling rates for various reasons. Some of them chose to convert the rest of the dataset to 44,100, as the majority of the Us8k files were already at that sampling rate. On the other hand, some researchers down sampled the dataset to 8000, as it reduced computational complexity, while others resampled it to 16,000, aiming to achieve a balance between higher classification accuracy and lower computational complexity. In this research, we assessed the performance of ESC tasks using sampling rates of 8000 Hz, 16,000 Hz, and 44,100 Hz by extracting the hand crafted features Mel frequency cepstral coefficient (MFCC), gamma tone cepstral coefficients (GTCC), and Mel Spectrogram (MelSpec). The results indicated that there was no significant difference in the classification accuracy among the three tested sampling rates.
文摘Noise pollution is one of the common physical harmful factors in many work environments.The current study aimed to assess personal and environmental sound pressure level and project the sound-Isosonic map in one of the Razavi Khorasan Paste manufacture using Surfer V.14 and Noise at work V.5.0.This cross-sectional,descrip-tive study is analytical that was conducted in 2018 in the Paste factory that contains Canister,production and Brewing unit.Following ISO 9612:2009,Casella Cel-320 was used to measure personal sound pressure level,while CEL-450 sound level meter(manufactured by Casella-Cel,the UK)was employed to assess environmental sound pressure level.Statistical analyzes was done using SPSS V.18 and Linear Regression test.The sound-isosonic maps were projected using Surfer V.14 and Noise at work V.5.0.The results of assessing personal sound pressure level indicated that the highest received dose(172.21%)and personal equivalent sound level(87.36 dBA)were recorded for workers in the Canister unit.According to results of measuring of the environmental sound pressure level,out of 16 measurement stations in this unit,overall 87.5%were regarded as danger and caution areas.The lowest and highest sound pressure levels in this units were 61 dBA and 92 dBA that belong to Brewing and Canister units respectively.Results indicate Over 75%of the Canister and production units had a sound pressure level greater than 85 dBA and these two units were regarded as the most dangerous area in terms of noise pollution.It is there-fore necessary to implement noise control measures,apply hearing protection program and auditory tests among workers in these units.
基金Sponsored by the People's Republic of China Ministry of Education Foundation for PhD Bases (Grant No.20112302110045)the Training Program of Young Talent of Heilongjiang University of Science and Techonology
文摘In this study,the crowd has been investigated and analyzed in waiting areas in large general hospitals in China in order to find the rules the variations of sound environment with the change of crowd. The field investigation,questionnaire,field-testing and computer simulation have been adopted. The results show that: the social /demographic characteristics of staff and patients are not significantly related to the satisfaction evaluation of sound environment; there is a significant correlation between the population density and LAeq of the background noise in waiting areas; when population density is 0,the LAeq of background noise is not 0 in waiting areas; the loudspeaker should be set in the waiting areas. Loudspeaker arrangements should be integrated into the ceiling lamp or construct facilities along the depth direction of the layout,and the two adjacent speakers recommended distance should be controlled at about 4 m. If the population density is controlled in the reasonable range,and sound absorption,noise reduction processing and electronic queuing system are adopted,sound environment of waiting areas will be built with noise interference relatively small in different population densities.
基金This work is supported by National Natural Science Foundation of China
文摘By using the expressions for the maximum absorption per wavelength (αλ),and the relaxation frequency fr of the boric acid relaxation derived previously by the author and employing the related oceanographic literatures, the effects of pressure, temperature, pH and salinity on (αλ)r and ∫r of the boric acid relaxation in sea water have been estimated. Results show that ( αλ), not only increases with pH but also increases approximately linearly with pressure and temperature, and is nearly proportional to the 1. 35 power of salinity. However, pressure, pH and salinity have negligible effect on ∫r; therefore, ∫r, can be approximately expressed as a function of temperature only. Comparisons of the predicted with the measured ( αλ)r and ∫r in different ocean areas are given.
文摘Monitoring techniques are a key technology for examining the conditions in various scenarios, e.g., structural conditions, weather conditions, and disasters. In order to understand such scenarios, the appropriate extraction of their features from observation data is important. This paper proposes a monitoring method that allows sound environments to be expressed as a sound pattern. To this end, the concept of synesthesia is exploited. That is, the keys, tones, and pitches of the monitored sound are expressed using the three elements of color, that is, the hue, saturation, and brightness, respectively. In this paper, it is assumed that the hue, saturation, and brightness can be detected from the chromagram, sonogram, and sound spectrogram, respectively, based on a previous synesthesia experiment. Then, the sound pattern can be drawn using color, yielding a “painted sound map.” The usefulness of the proposed monitoring technique is verified using environmental sound data observed at a galleria.
文摘The performance of classic Mel-frequency cepstral coefficients (MFCC) is unsatisfactory in noisy environment with different sound sources from nature. In this paper, a classification approach of the ecological environmental sounds using the double-level energy detection (DED) was presented. The DED was used to detect the existence of the sound signals under noise conditions. In addition, MFCC features from the frames which were detected the presence of the sound signals by DED were extracted. Experimental results show that the proposed technology has better noise immunity than classic MFCC, and also outperforms time-domain energy detection (TED) and frequency-domain energy detection (FED) respectively.
文摘Since its initiation in 2008, the Beijing Forum on Human Rights has been bent on promoting our mutual understanding and expanding our common grounds. It has carried out very fruitful exchanges and researches around many important questions concerning human rights development, and hence attained great achievements. It has played avery constructive role in promoting exchanges about and cooperation in human rights between China and other countries, becoming an important and influential platform for international dialogs over human rights.