期刊文献+

基于听觉场景分析的近讲语音增强算法 被引量:1

Close-talk speech enhancement algorithm based on CASA
原文传递
导出
摘要 针对近讲场景,提出一种双麦克近讲语音增强算法。该算法基于耳间延时差(ITD)、耳间强度差(IID)特征来区分目标语音,利用声学掩蔽效应,实现目标语音对背景噪声的分离降噪。与传统的单麦克增强方法相比,该算法可消除多种类型噪声且对语音造成的损伤较小。实验表明:该算法能将8~33dB的白噪声、音乐噪声、广播噪声3种噪声类型的带噪语音的信噪比提高到36dB以上,同时保持较高的目标语音可懂度。对于冲击噪声的带噪语音也具有较好的降噪效果。 A two-microphone speech enhancement algorithm was developed for close-talk applications.The algorithm segregates the target speech from the noisy speech using cues such as the interaural time difference(ITD),the interaural intensity difference(IID) and the masking effect.This algorithm suppresses more kinds of noise with less speech distortion than single-microphone speech enhancement methods.Tests show that the algorithm increases the SNR of 3 kinds of noisy signals from 8~33 dB to 36 dB.The enhanced speech is intelligible and the method handles impulsive noise well.
作者 胡奎 梁维谦
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2011年第9期1176-1179,1186,共5页 Journal of Tsinghua University(Science and Technology)
基金 国家"八六三"高技术项目(2008AA010700)
关键词 语音增强 听觉场景分析 耳间延时差 耳间能量差 声学掩蔽 speech enhancement auditory scene analysis interaural time differences(ITD) interaural intensity differences(IID) masking effect
  • 相关文献

参考文献8

  • 1Cherry E C. Some experiments on the recognition of speech with one and with two cars[J]. J of ASA, 1953, 25: 975- 979.
  • 2Bregman A S. Auditory Scene Analysis [M]. Cambridge: MIT Press, 1990.
  • 3Hansler E, Schmidt G. Topics in Acoustic Echo and Noise Control [M]. Berlin: Springer, 2006.
  • 4Roman N, Wang D L, Brown G J. Speech segregation based on sound localization [J].J of ASA, 2003, 114(4) : 2236 -2252.
  • 5Meddis R. Simulation of mechanical to neural transduction in the auditory receptor [J]. J of ASA, 1988, 83(3): 1056 - 1063.
  • 6Berouti M, Schwartz M, Makhoul J. Enhancement of speech corrupted by acoustic noise [J]. Proc IEEE Int Conf Acoust, Speech, Signal Process, 1979 : 208 - 211.
  • 7Scalart P, Filho J. Speech enhancement based on a priori signal to noise estimation [J]. Proc IEEE Int Con f Acoust, Speech, Signal Process, 1996:629 - 632.
  • 8Ephraim Y, Malah D. Speech enhancement using a minimum mean-square error log spectral amplitude estimator [J].IEEE Trans Acoust, Speech, Signal Process, 1985 : 23(2) : 443 - 445.

同被引文献14

  • 1JIN Zhaozhang,WANG Deliang.Reverberant speech segregation based on multipitch tracking and classification[J].IEEE Trans Audio,Speech,and Language Processing,2011,19(8):2328-2337.
  • 2ZHANG Xiaojia,SHAO Yang,WANG Deliang.CASA-based robust speaker identification[J].IEEE Trans Audio,Speech,and Language Processing,2012,20(5):1608-1616.
  • 3CHAO Ling,WANG Deliang,Jang R,et al.A tandem algorithm for singing pitch extraction and voice separation from Music Accompaniment[J].IEEE Trans Audio,Speech,and Language Processing,2012,20(5):1482-1491.
  • 4HU Guoning,WANG Deliang.Auditory segmentation based on onset and offset analysis[J].IEEE Trans Audio,Speech,and Language Processing,2007,15(2):396-405.
  • 5Martin C,Hershey J,Rennie S.Monaural speech separation and recognition challenge[J].Computer Speech and Language,2010,24(1):1-15.
  • 6SueH,Jon B,Grown B.Mask estimation for missing data speech recognition based on statistics of binaural interaction[J].IEEE Trans Audio,Speech,and Language Processing,2006,14(1):58-67.
  • 7Nima Y,Philipos L.A dual microphone speech enhancement algorithm based on the coherence function[J].IEEE Trans Audio,Speech,and Language Processing,2012,20(2):599-609.
  • 8FathiK,Mondher F,Mohamed G,et al.Dual-channel spectral subtraction algorithms based speech enhancement dedicated to a bilateral cochlear implant[J].Appl Acoust,2012,73(1):12-20.
  • 9WANG Deliang,Brown G.Computational Auditory Scene Analysis:Principles,Algorithms,and Applications[M].New Jersey:IEEE Press,2006.
  • 10Young S,Evermann G,Gales M,et al.The HTK Book(HTK version 3.4.1)[M].Cambridge:Cambridge University,2009.

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部