期刊文献+

基于ERB尺度划分的多子带语声信号抗噪谱减算法 被引量:1

A multi-band anti-noise spectral subtraction algorithm using ERB scale
下载PDF
导出
摘要 为了研究心理声学在语声增强方面的应用,本文提出了一种基于等效矩阵带宽(ERB)尺度划分的多子带语声信号抗噪谱减算法。此算法根据ERB尺度将带噪信号的频谱划分成多个子带,然后再根据每个子带的分段信噪比以及心理声学掩蔽原则分别计算每个子带的谱减参数,最后在每个子带中分别进行谱减算法处理。实验结果表明,应用新算法所获得的语声增强结果在信噪比、IS失真以及PESQ方面均优于之前提出的多子带语声信号抗噪谱减算法。 This paper addresses a multi-band spectral subtraction algorithm based on equivalent rectangular bandwidth(ERB)scale for applying psychoacoustics to speech enhancement.In the proposed algorithm,the whole spectrum of noisy speech is divided into multiple bands based on ERB scale.The subtraction parameters are then calculated according to the segment SNR of each band and psychoacoustics criteria.Finally,spectral subtraction with different subtraction parameters is executed in each band.The measurements of SNR improvement,IS distortion and PESQ show that the proposed algorithm outperforms the previous speech enhancement algorithms.
作者 周挺挺 曾毓敏 王蓉蓉 卞乐 ZHOU Tingting;ZENG Yumin;WANG Rongrong;BIAN Le(School of Physics and Technology, Nanjing Normal University, Nanjing 210000, China)
出处 《应用声学》 CSCD 北大核心 2017年第3期212-219,共8页 Journal of Applied Acoustics
基金 江苏省科技项目(BE2014139)
关键词 ERB尺度 心理声学掩蔽 多子带谱减 ERB scale Psychoacoustic masking Multi-band spectral subtra
  • 相关文献

参考文献2

二级参考文献13

  • 1唐娟,行鸿彦.基于二次相关的时延估计方法[J].计算机工程,2007,33(21):265-267. 被引量:48
  • 2RYAN J G, GOUBRAN R A. Application of near-field optimum microphone arrays to hands-free mobile tele- phone[J]. IEEE Transactions on Vehic-ular Technology, 2003, 52(2): 390-400.
  • 3KNAPP C H, CARTER G C. The generalized correla- tion method for estimation of time delay[J]. IEEE Trans. Acoust, Speech, Signal Processing, 1976, 24(8): 320-327.
  • 4WIDROW B, STEARNS D. Adaptive signal process- ing[M]. Englewood Cliffs: Prentice-Hall. Inc., 1993.
  • 5LU B, FENG C, LONG G. A new varible step-size LMS adaptive based on marr function[C]//IEEE, International Conference on. Information Technology and Applications (ITA), 2013: 214-217.
  • 6POURMOHAMMAD A, AHADI S M. N-dimensional N- microphone sound source localization[J]. EURASIP Jour- nal on Audio, Speech, and Music Processing, 2013, 201a(1): 27.
  • 7HUANG N E, SHEN Z, LONG S R. The empirical mode decomposition and the Hilbert spectrum for nonlinear and nonstationary time series analysis[J]. Proceedings of the Royal Society of London series A, 1998, 454(1971): 903-995.
  • 8TAMIM N S M, GHANI F. Hilbert transform of FFT pruned cross correlation function for optimization in time delay estimation[C]//IEEE 9th Malaysia International Conference on Communications (MICC), IEEE, 2009: 811-812.
  • 9ALLEN J B, BERKELY D A. Image method for efficiently simulating small room acoustics[J]. Journal of Acoustical Society of America, 1979, 65(4): 943-950.
  • 10International Audio Laboratories Erlangen. Rir generator [EB/OL]. [2015-07-21]. https://www.audiolabs-erlangen. de/fau/professor/habets/software/rir-generator.

共引文献15

同被引文献6

引证文献1

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部