期刊文献+

基于多窗谱相关加权语音增强 被引量:7

Correlation Weighting Enhancement of Speech Based on Multitaper Spectrum
下载PDF
导出
摘要 与传统的周期谱图相比,多窗谱分析方法是一种低方差、高分辨率的谱分析方法,尤其适合非线性系统中强噪声背景下弱信号、时频演变信号的分析。基于多窗谱估计这种优点,提出多窗谱相关加权语音增强方法,先对噪声与含噪信号比(NNSR)进行估计,用基于NNSR的幅度谱减实现预语音增强,再用相关加权规则获得最终的增强语音。通过客观和主观测试表明,在相同的实验条件下,多窗谱相关加权算法能更好地抑制背景噪声和音乐噪声,同时也较好地保持了语音的可懂度和自然度。 Compared with the traditional periodogram,the Multitaper method of spectral analysis provides a means for spectral estimation with low variance and high resolution.It is particularly well-suitd for the diagnosis analysis of weak signals with a time-depended amplitude and frequency against a high-noise background.Because of the low variance feature of the multitaper spectrum,an algorithm of correlation weighting enhancement of speech based on multitaper spectrum is presented.The noise spectrum and the Noise to Noisy Signal Ratio(NNSR) are estimated based on the multitaper spectrum of the noisy signal,and the pre-enhanced speech is obtained by the spectral amplitude subtraction method,whose gain is a function of NNSR.The final enhanced speech is obtained by correlation weighting rule,and subjective and objective tests are made on this algorithm.The results show that this algorithm is very effective to reduce the background noise and music noise,moreover,ensures the intelligibility and naturalness of speech.
出处 《计算机仿真》 CSCD 北大核心 2011年第3期142-145,共4页 Computer Simulation
关键词 语音增强 多窗谱 相关加权 音乐噪声 Speech enhancement Multitaper spectrum Correlation weighting Music noise
  • 相关文献

参考文献11

  • 1D J Thomson. Spectrum estimation and harmonic analysis Proc [J]. IEEE, 1982,70(9):1055-1096.
  • 2J Park. Envelope estimation for quasiperiodic geophy-sical signals in noise[ M]. In:A Multitaper Approach in Statistics in the Environmental and Earth Sciences London, Edward A mold Press, 1992. 189-219.
  • 3M E Mann, J Park. Spatial correlations of vari-ation in global surface temperatures [ M ]. Geophys, ResLet, 1993,20 : 1055-1058.
  • 4Y Hu, P C Loizou. Incorporating a psychoacoustical model in frequency domain speech enhancement [ J ]. IEEE Signal Processing letters,2004,11 (2) :270-273.
  • 5O Cappe. Elimination of the Musical Noise Phenomenon with the Ephraim and Malah Noise Suppressor[J]. IEEE Trans. on Speech and Audio Processing, 1994,2(2) : 345-349.
  • 6M Berouti, R Schwartz, J Markhoul. Enhancement of Speech Corrupted by Acoustic Noise [ J]. IEEE Trans , onAcoustics Speech, and Signal processing, 1979,4 : 208-211.
  • 7武鹏鹏,赵刚,邹明.基于多窗谱估计的改进谱减法[J].现代电子技术,2008,31(12):150-152. 被引量:20
  • 8D Slepian. Some Comments on Fourier Analysis[ M]. Uncertainty, and Modeling Review, 1983,25:379-393.
  • 9D Slepian, H O Pollak. Prolate Spheroidal Wave Functions Fourier Analysis and Uncertainty I[ J]. Bell System Tech, 1961,40: 43 -64.
  • 10D G Manolakis, V K Lngle, S M Kogon. Statistical and adaptive signal processing[ M ]. 北京:清华大学出版社, 2003. 246 - 255.

二级参考文献25

  • 1卜凡亮,王为民,戴启军,陈砚圃.基于噪声被掩蔽概率的优化语音增强方法[J].电子与信息学报,2005,27(5):753-756. 被引量:16
  • 2陶智,赵鹤鸣,龚呈卉.基于听觉掩蔽效应和Bark子波变换的语音增强[J].声学学报,2005,30(4):367-372. 被引量:39
  • 3吴红卫,吴镇扬,赵力.基于多窗谱的心理声学语音增强[J].声学学报,2007,32(3):275-281. 被引量:12
  • 4Thomson D J. Spectrum estimation and harmonic analysis. Proc. IEEE, 1982; 70(9): 1055--1096
  • 5Hu Y, Loizou P C. Incorporating a psychoacoustical model in frequency domain speech enhancement. IEEE Signal Processing letters, 2004; 11(2): 270--273
  • 6Cappe O. Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor. IEEE Trans. on Speech and Audio Processing, 1994; 2(2): 345-- 349
  • 7Virag N. Single channel speech enhancement based on masking properties of the human auditory system. IEEE Trans. Speech and Audio Processing, 1999; 7(2): 126--137
  • 8Gustafsson S, Jax P, Vary P. A novel psychoacoustically motivated audio enhancement algorithm preserving background noise characteristics. In: Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, 1998:397--400
  • 9Johnston J D. Transform coding of audio signal using perceptual noise criteria. IEEE J. Select. Areas Commun., 1988; 6(2): 314--323
  • 10Manolakis D G, Lngle V K, Kogon S M. Statistical and adaptive signal processing. 北京:清华大学出版社, 2003: 246-255

共引文献30

同被引文献65

  • 1华强,夏哲雷,祝剑英.一种改进的变步长LMS自适应滤波算法及其仿真[J].中国计量学院学报,2012,23(3):304-308. 被引量:8
  • 2肖述才,王作英.端点检测中的一种新的对数能量特征[J].电声技术,2004,28(6):37-41. 被引量:12
  • 3闫润强,朱贻盛.基于信号递归度分析的语音端点检测方法[J].通信学报,2007,28(1):35-39. 被引量:8
  • 4武薇,范影乐,庞全.基于广义维数距离的语音端点检测方法[J].电子与信息学报,2007,29(2):465-468. 被引量:11
  • 5Loizou P C, Ma J. Extending the articulation index to account for non-linear distortions introduced by noise suppression algorithms[J]. The Journal of the Acous-tical Society of America, 2011,130(2) 986-995.
  • 6Upadhyay N, Karmakar A. A perceptually motivated multi-band spectral subtraction algorithm for enhance- ment of degraded speech[C]//Computer and Commu- nication Technology (ICCCT), 2012 Third Interna- tional Conference on. India, IEEE, 2012 : 340-345.
  • 7Ping W. An improved spectral subtraction algorithm based on auditory masking in voice human-computer interaction[C]//Mechatronics and Automation (IC- MA), 2010 International Conference on. China..Xi'an, IEEE, 2010: 1938-1941.
  • 8Cao L, Zhang T, Gao H, et al. Multi-band spectral subtraction method combined with auditory masking properties for speech enhancement [C]//Image and Signal Processing (CISP), 2012 5th International Con- gress on. India: Guwahati, IEEE, 2012 : 72-76.
  • 9Lu Yang, Loizou P C. A geometric approach to spec- tral subtraction[J]. Speech communication, 2008, 50 (6) : 453-466.
  • 10Martin R. Noise power spectral density estimation based on optimal smoothing and minimum statistics[J]. Speech and Audio Processing, IEEE Transactions on, 2001, 9 (5) : 504-512.

引证文献7

二级引证文献32

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部