期刊文献+
共找到4篇文章
< 1 >
每页显示 20 50 100
Speech Enhancement Algorithm Based on MMSE Short Time Spectral Amplitude in Whispered Speech 被引量:1
1
作者 Zhi-Heng Lu Huai-Zong Shao Tai-Liang Ju 《Journal of Electronic Science and Technology of China》 2009年第2期115-118,共4页
An improved method based on minimum mean square error-short time spectral amplitude (MMSE-STSA) is proposed to cancel background noise in whispered speech. Using the acoustic character of whispered speech, the algor... An improved method based on minimum mean square error-short time spectral amplitude (MMSE-STSA) is proposed to cancel background noise in whispered speech. Using the acoustic character of whispered speech, the algorithm can track the change of non-stationary background noise effectively. Compared with original MMSE-STSA algorithm and method in selectable mode Vo-coder (SMV), the improved algorithm can further suppress the residual noise for low signal-to-noise radio (SNR) and avoid the excessive suppression. Simulations show that under the non-stationary noisy environment, the proposed algorithm can not only get a better performance in enhancement, but also reduce the speech distortion. 展开更多
关键词 Index Terms-Minimum mean square error shorttime spectral amplitude (MMSE-STSA) speechenhancement whispered speech.
下载PDF
Conversion from whispered speech to normal speech using the extended bilinear transformation method 被引量:1
2
作者 TAO Zhi ZHAO Heming +3 位作者 TAN Xuedan GU Jihua ZHANG Xiaojun WU Di 《Chinese Journal of Acoustics》 2013年第4期425-438,共14页
A method of conversion from whispered speech to normal speech using the extended bilinear transformation was proposed. On account of the different deviation degrees of the whisper's formants in different frequency ba... A method of conversion from whispered speech to normal speech using the extended bilinear transformation was proposed. On account of the different deviation degrees of the whisper's formants in different frequency bands, the spectrum of the whispered speech will be processed in the separate partitions of this paper. On the basis of this spectrum, we will establish a conversion function able to usefully convert whispered speech to normal speech. Because of the whisper's non-linear offset in relation to normal speech, this paper introduces an expansion factor in the bilinear transform function making it correspond more closely to the actual conversion demands of whispered speech to normal speech. The introduction of this factor takes the non-linear move of the spectrum and the compression of the formant bandwidth into consideration, thus effectively reducing the spectrum distortion distance in the conversion. The experiment results show that the conversion presented in this paper effectively improves both the sound quality and the intelligibility of whispered speech. 展开更多
关键词 LSP Conversion from whispered speech to normal speech using the extended bilinear transformation method
原文传递
Research of whispered speech vocal tract system conversion based on universal background model and effective Gaussian components 被引量:1
3
作者 CHEN Xueqin ZHAO Heming 《Chinese Journal of Acoustics》 2013年第4期400-410,共11页
Directing to the weakness of the present fixed values mapping methods (method_F), a vocal tract system conversion method based on the universal background model (UBM) is proposed for improving the performance of t... Directing to the weakness of the present fixed values mapping methods (method_F), a vocal tract system conversion method based on the universal background model (UBM) is proposed for improving the performance of the speech conversion system from Chinese whis- pered speech to normal speech. For the numerous components of UBM, the errors produced by the acoustical probability density statistical model can't be ignored. Thus an effective Gaus- sian mixture components chosen method based on the posterior probability summation of the minimum spectral distortion is developed to optimizing the system performance. The proposed method (method_U) is analyzed and compared using the performance index (PI) based on Itakura-Saito spectral distortion measure. It is shown experimentally that the performance of method_U is more stability for different speakers and different phonemes than that of method_F. The average PI of method_U is better than method_F. It is shown that by selecting effective Gaussian mixture components, the PI of method_U can be further improved 5.11%. Subjective auditory tests also show that the proposed method can improve the definition and intelligibility of conversion speech. 展开更多
关键词 Research of whispered speech vocal tract system conversion based on universal background model and effective Gaussian components UBM
原文传递
A method of whispered speech enhancement based on speech absence probability and modified mel-domain masking model
4
作者 TAO Zhi~(1,2) ZHAO Heming~2 WU Di~1 CHEN Daqing~1 ZHANG Xiaojun~1 (1 School of Physical Science and Technology,Soochow University Suzhou 215006) (2 School of Electronics and Information Engineering,Soochow University Suzhou 215006) 《Chinese Journal of Acoustics》 2011年第3期345-357,共13页
Whispered speech enhancement using auditory masking model in modified Mel- domain and Speech Absence Probability (SAP) was proposed. In light of the phonation char- acteristic of whisper, we modify the Mel-frequency... Whispered speech enhancement using auditory masking model in modified Mel- domain and Speech Absence Probability (SAP) was proposed. In light of the phonation char- acteristic of whisper, we modify the Mel-frequency Scaling model. Whispered speech is filtered by the proposed model. Meanwhile, the value of masking threshold for each frequency band is dynamically determined by speech absence probability. Then whispered speech enhancement is conducted by adaptively rectifying the spectrum subtraction coefficients using different masking threshold values. Results of objective and subjective tests on the enhanced whispered signal show that compared with other methods; the proposed method can enhance whispered signal with better subjective auditory quality and less distortion by reducing the music noise and background noise under the masking threshold value. 展开更多
关键词 A method of whispered speech enhancement based on speech absence probability and modified mel-domain masking model Mel
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部