A three mass model of vocal cords as well as mathematical expression of the model are discussed. Different kinds of typical hoarse speech due to laryngeal diseases are simulated on microcomputer and the effects of di...A three mass model of vocal cords as well as mathematical expression of the model are discussed. Different kinds of typical hoarse speech due to laryngeal diseases are simulated on microcomputer and the effects of different pathological factors of vocal cords on model parameters are studied. Some typical spectrum distribution of the simulated speech signals are given. Moreover, hoarse speech signals of some typical cases are analyzed by the methods of digital signal processing, including FFT, LPC, Cepstrum technique, Pseudocolor encoding, etc. The experiment results show that the three mass model analysis of vocal cords is an efficient method for analysis of hoarse speech signals.展开更多
Tracking moving wideband sound sources is one of the most challenging issues in the acoustic array signal processing which is based on the direction of arrival(DOA) estimation. Compressive sensing(CS) is a recent theo...Tracking moving wideband sound sources is one of the most challenging issues in the acoustic array signal processing which is based on the direction of arrival(DOA) estimation. Compressive sensing(CS) is a recent theory exploring the signal sparsity representation, which has been proved to be superior for the DOA estimation. However, the spatial aliasing and the offset at endfire are the main obstacles for CS applied in the wideband DOA estimation. We propose a particle filter based compressive sensing method for tracking moving wideband sound sources. First, the initial DOA estimates are obtained by wideband CS algorithms. Then, the real sources are approximated by a set of particles with different weights assigned. The kernel density estimator is used as the likelihood function of particle filter. We present the results for both uniform and random linear array. Simulation results show that the spatial aliasing is disappeared and the offset at endfire is reduced. We show that the proposed method can achieve satisfactory tracking performance regardless of using uniform or random linear array.展开更多
In this work, a novel voice activity detection (VAD) algorithm that uses speech absence probability (SAP) based on Teager energy (TE) was proposed for speech enhancement. The proposed method employs local SAP (...In this work, a novel voice activity detection (VAD) algorithm that uses speech absence probability (SAP) based on Teager energy (TE) was proposed for speech enhancement. The proposed method employs local SAP (LSAP) based on the TE of noisy speech as a feature parameter for voice activity detection (VAD) in each frequency subband, rather than conventional LSAP. Results show that the TE operator can enhance the abiTity to discriminate speech and noise and further suppress noise components. Therefore, TE-based LSAP provides a better representation of LSAP, resulting in improved VAD for estimating noise power in a speech enhancement algorithm. In addition, the presented method utilizes TE-based global SAP (GSAP) derived in each frame as the weighting parameter for modifying the adopted TE operator and improving its performance. The proposed algorithm was evaluated by objective and subjective quality tests under various environments, and was shown to produce better results than the conventional method.展开更多
In this paper, we conduct research on the developmental trend of the computer music technology under the background of multichannelstereo recording. Surround sound is not only used in the fi lm, home theater, also pen...In this paper, we conduct research on the developmental trend of the computer music technology under the background of multichannelstereo recording. Surround sound is not only used in the fi lm, home theater, also penetrated the digital TV and digital broadcasting.But because of the limitation of storage media capacity and transmission bandwidth must use all kinds of that audio compression technologyto transmit audio data. As a more general and more practical signifi cance, as masking and by masking sound are not limited to their respectiveresend by a single speaker, but in different proportion to their respective ‘channel sound level is poor. To deal with this issue, this paperintegrates the signal denoising method as the optimization and achieves better performance.展开更多
In this paper, a viable way to fabricate Mg alloy sound ribbons with ultra-fine-grained microstructure was presented. The hot-rolled and annealed Mg-0.4Zn (at%) alloy exhibited excellent rollability to form sound ri...In this paper, a viable way to fabricate Mg alloy sound ribbons with ultra-fine-grained microstructure was presented. The hot-rolled and annealed Mg-0.4Zn (at%) alloy exhibited excellent rollability to form sound ribbons with submicrometer grains when subjected to one-pass cold rolling process. The more balanced multi-mode dislocation slips originated from the significant decrease of critical resolved shear stress for non-basal slip with the addition of solute Zn and the favorable crystallographic orientation were suggested to be responsible for the excellent cold rollability. The formation of ultra-fine-grained microstructure was attributed to low-temperature dynamic recrystallization occurring during the cold rolling process with large strain.展开更多
Vocal individuality is widespread in social animals. Individual variation in vocalizations is a prereq- uisite for discriminating among conspecifics and may have facilitated the evolution of large complex societies. R...Vocal individuality is widespread in social animals. Individual variation in vocalizations is a prereq- uisite for discriminating among conspecifics and may have facilitated the evolution of large complex societies. Ring-tailed lemurs Lemur catta live in relatively large social groups, have con- spicuous vocal repertoires, and their species-specific utterances can be interpreted in light of source-filter theory of vocal production. Indeed, their utterances allow individual discrimination and even recognition thanks to the resonance frequencies of the vocal tract. The purpose of this study is to determine which distinctive vocal features can be derived from the morphology of the upper vocal tract. To accomplish this, we built computational models derived from anatomical measurements collected on lemur cadavers and compared the results with the spectrographic out- put of vocalizations recorded from ex situ live individuals. Our results demonstrate that the mor- phological variation of the ring-tailed lemur vocal tract explains individual distinctiveness of their species-specific utterances. We also provide further evidence that vocal tract modeling is a power- ful tool for studying the vocal output of non-human primates.展开更多
Detecting/sensing targets underwater has very important applications in environmental study, civil engineering and national security. In this paper, an organic-film based triboelectric nanogenerator (TENG) has been ...Detecting/sensing targets underwater has very important applications in environmental study, civil engineering and national security. In this paper, an organic-film based triboelectric nanogenerator (TENG) has been successfully demonstrated for the first time as a self-powered and high sensitivity acoustic sensor to detect underwater targets at low frequencies around 100 Hz. This innovative, cost-effective, simple-design TENG consists of a thin-film-based Cu electrode and a polytetrafluoroethylene (PTFE) film with nanostructures on its surfaces. On the basis of the coupling effect between triboelectrification and electrostatic induction, the sensor generates electrical output signals in response to incident sound waves. Operating at a resonance frequency of 110 Hz, under an acoustic pressure of 144.2 dBspc, the maximum open-circuit voltage and short-circuit current of the generator can respectively reach 65 V and 32 ~A underwater. The directional dependence pattern has a bi-directional shape with a total response angle of 60~. Its sensitivity is higher than -185 dB in the frequency range from 30 Hz to 200 Hz. The highest sensitivity is -146 dB at resonance frequency. The three-dimensional coordinates of an acoustic source were identified by four TENGs, self-powered active sensors, and the location of the acoustic source was determined with an error about 0.2 m. This study not only expands the application fields of TENGs from the atmosphere to water, but also shows the TENG is a promising acoustic source locator in underwater environments.展开更多
文摘A three mass model of vocal cords as well as mathematical expression of the model are discussed. Different kinds of typical hoarse speech due to laryngeal diseases are simulated on microcomputer and the effects of different pathological factors of vocal cords on model parameters are studied. Some typical spectrum distribution of the simulated speech signals are given. Moreover, hoarse speech signals of some typical cases are analyzed by the methods of digital signal processing, including FFT, LPC, Cepstrum technique, Pseudocolor encoding, etc. The experiment results show that the three mass model analysis of vocal cords is an efficient method for analysis of hoarse speech signals.
基金supported by the NFSC Grants 51375385 and 51675425Natural Science Basic Research Plan in Shaanxi Province of China Grants 2016JZ013
文摘Tracking moving wideband sound sources is one of the most challenging issues in the acoustic array signal processing which is based on the direction of arrival(DOA) estimation. Compressive sensing(CS) is a recent theory exploring the signal sparsity representation, which has been proved to be superior for the DOA estimation. However, the spatial aliasing and the offset at endfire are the main obstacles for CS applied in the wideband DOA estimation. We propose a particle filter based compressive sensing method for tracking moving wideband sound sources. First, the initial DOA estimates are obtained by wideband CS algorithms. Then, the real sources are approximated by a set of particles with different weights assigned. The kernel density estimator is used as the likelihood function of particle filter. We present the results for both uniform and random linear array. Simulation results show that the spatial aliasing is disappeared and the offset at endfire is reduced. We show that the proposed method can achieve satisfactory tracking performance regardless of using uniform or random linear array.
基金Project supported by Inha University Research GrantProject(10031764) supported by the Strategic Technology Development Program of Ministry of Knowledge Economy, Korea
文摘In this work, a novel voice activity detection (VAD) algorithm that uses speech absence probability (SAP) based on Teager energy (TE) was proposed for speech enhancement. The proposed method employs local SAP (LSAP) based on the TE of noisy speech as a feature parameter for voice activity detection (VAD) in each frequency subband, rather than conventional LSAP. Results show that the TE operator can enhance the abiTity to discriminate speech and noise and further suppress noise components. Therefore, TE-based LSAP provides a better representation of LSAP, resulting in improved VAD for estimating noise power in a speech enhancement algorithm. In addition, the presented method utilizes TE-based global SAP (GSAP) derived in each frame as the weighting parameter for modifying the adopted TE operator and improving its performance. The proposed algorithm was evaluated by objective and subjective quality tests under various environments, and was shown to produce better results than the conventional method.
文摘In this paper, we conduct research on the developmental trend of the computer music technology under the background of multichannelstereo recording. Surround sound is not only used in the fi lm, home theater, also penetrated the digital TV and digital broadcasting.But because of the limitation of storage media capacity and transmission bandwidth must use all kinds of that audio compression technologyto transmit audio data. As a more general and more practical signifi cance, as masking and by masking sound are not limited to their respectiveresend by a single speaker, but in different proportion to their respective ‘channel sound level is poor. To deal with this issue, this paperintegrates the signal denoising method as the optimization and achieves better performance.
基金supported by the National Natural Science Foundation of China (51171120)
文摘In this paper, a viable way to fabricate Mg alloy sound ribbons with ultra-fine-grained microstructure was presented. The hot-rolled and annealed Mg-0.4Zn (at%) alloy exhibited excellent rollability to form sound ribbons with submicrometer grains when subjected to one-pass cold rolling process. The more balanced multi-mode dislocation slips originated from the significant decrease of critical resolved shear stress for non-basal slip with the addition of solute Zn and the favorable crystallographic orientation were suggested to be responsible for the excellent cold rollability. The formation of ultra-fine-grained microstructure was attributed to low-temperature dynamic recrystallization occurring during the cold rolling process with large strain.
文摘Vocal individuality is widespread in social animals. Individual variation in vocalizations is a prereq- uisite for discriminating among conspecifics and may have facilitated the evolution of large complex societies. Ring-tailed lemurs Lemur catta live in relatively large social groups, have con- spicuous vocal repertoires, and their species-specific utterances can be interpreted in light of source-filter theory of vocal production. Indeed, their utterances allow individual discrimination and even recognition thanks to the resonance frequencies of the vocal tract. The purpose of this study is to determine which distinctive vocal features can be derived from the morphology of the upper vocal tract. To accomplish this, we built computational models derived from anatomical measurements collected on lemur cadavers and compared the results with the spectrographic out- put of vocalizations recorded from ex situ live individuals. Our results demonstrate that the mor- phological variation of the ring-tailed lemur vocal tract explains individual distinctiveness of their species-specific utterances. We also provide further evidence that vocal tract modeling is a power- ful tool for studying the vocal output of non-human primates.
文摘Detecting/sensing targets underwater has very important applications in environmental study, civil engineering and national security. In this paper, an organic-film based triboelectric nanogenerator (TENG) has been successfully demonstrated for the first time as a self-powered and high sensitivity acoustic sensor to detect underwater targets at low frequencies around 100 Hz. This innovative, cost-effective, simple-design TENG consists of a thin-film-based Cu electrode and a polytetrafluoroethylene (PTFE) film with nanostructures on its surfaces. On the basis of the coupling effect between triboelectrification and electrostatic induction, the sensor generates electrical output signals in response to incident sound waves. Operating at a resonance frequency of 110 Hz, under an acoustic pressure of 144.2 dBspc, the maximum open-circuit voltage and short-circuit current of the generator can respectively reach 65 V and 32 ~A underwater. The directional dependence pattern has a bi-directional shape with a total response angle of 60~. Its sensitivity is higher than -185 dB in the frequency range from 30 Hz to 200 Hz. The highest sensitivity is -146 dB at resonance frequency. The three-dimensional coordinates of an acoustic source were identified by four TENGs, self-powered active sensors, and the location of the acoustic source was determined with an error about 0.2 m. This study not only expands the application fields of TENGs from the atmosphere to water, but also shows the TENG is a promising acoustic source locator in underwater environments.