期刊文献+

基于波束形成与最大似然估计的近距离双麦克风语音增强算法 被引量:3

Beamforming and maximum likelihood estimation for speech enhancement using dual closely-spaced microphones
原文传递
导出
摘要 为了解决波束形成法直接应用于近距离双麦克风系统时存在的问题,如目标信号低频段能量损失、多方向的竞争性语音噪声难以被有效抑制等,该文提出一种基于波束形成与最大似然估计的两步去噪方法。该方法首先使用加权叠加滤波器对混合声源进行时频分解,然后通过2个零点分别在0°和180°的波束形成图的幅频响应比值,设置各时频单元所对应的初步掩蔽值,在避免低频滚降现象出现的情况下,抑制本底噪声;最后根据统计模型和简化的最大似然估计法,抑制多方向的竞争性语音噪声,进一步增强目标信号。测试结果表明:在低信噪比、多种类型噪声源共同存在的情境下,该方法可以在无需低通滤波或宽带波束补偿的情况下,恢复原始信号的能量分布特点,明显提升信噪比。 Traditional beamforming systems using dual closely-spaced microphones have various problems such as low-frequency-roll-off and limitations in suppressing competitive speech noises from multiple directions. This paper presents a two-step beamforming and maximum likelihood estimation algorithm. The algorithm first uses a WOLA filter for the time-frequency analysis for the speech mixture and then sets mask values to suppress background noise without low-frequency-roll-off based on the ratio of the two beamforming patterns, which have zeros at 0°and 180°. A statistical model and the maximum likelihood estimation are then used to further enhance thespeech. Tests indicate that the algorithm effectively recovers the energy distribution of the target signal and improves the signal-to-noise ratio without a low-pass filter or broadband compensation when the signal-to-ratio is low or multiple kinds of noises exist.
作者 宫琴 郑硕 GONG Qin;ZHENG Shuo(Department of Biomedical Engineering, Tsinghua University, Beijing 100084, China;Research Center of Biomedical Engineering, Graduate School at Shenzhen, Tsinghua University Shenzhen 518055, China)
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2018年第6期603-608,共6页 Journal of Tsinghua University(Science and Technology)
基金 国家自然科学基金资助项目(61271133) 高等学校博士学科点专项科研基金资助项目(20120002110054) 清华信息科学与技术国家实验室(筹)学科交叉基金资助项目(042003172) 深圳市自然科学基金资助项目(JCYJ20140827160046745)
关键词 近距离双麦克风 波束形成 最大似然估计 低频滚降 多方向竞争性语音噪声 closely-spaced dual microphones beamforming maximum likelihood estimation low-frequency roll-off competitive speech noises from multiple directions
  • 相关文献

参考文献1

二级参考文献13

  • 1King Chung. Challenges and Recent Developments in Hearing Aids: Part I. Speech Understanding in Noise, Microphone Technologies and Noise Reduction Algorithms. Trends In Amplification, 2004, 8(3): 83-124.
  • 2R. E. Crochiere, L. R. Rabiner. Multirate Digital Signal Processing. NJ: Englewood Cliffs, Prentice-Hall, 1983: 289-341.
  • 3P. P. Vaidyanathan. Multrirate Systems and Filterbanks. NJ: Englewood Cliffs, Prentice-Hall, 1993: 286-392.
  • 4S.M. Ahadi, H. Sheikhzadeh, R.L. Brennan, et al. An Efficient Front-End for Automatic Speech Recognition. IEEE International Conference on Electronics, Circuits and Systems, 2003, 1: 128-131.
  • 5O. Romain,B. Denby. Prototype of a Software Defined Broadcast Media Indexing Engine. IEEE International Conference on Acoustics, Speech, and Signal Processing 2007, 2: 11813-11816.
  • 6Z. Cvetkovie, M. Vetterli. Oversampled Filter Banks. IEEE Transactions on signal processing, 1998, 46(5): 1245-1255.
  • 7M. R. Wilbur, T. N. Davidson, J. P. Reilly. Efficient Design of Oversampled NPR GDFT Filterbanks. IEEE Transactions on signal processing, 2004, 52(7): 1947-1963.
  • 8C. D. Creusere, S. K. Mitra. A Simple Method for Designing High-Quality Prototype Filters of M-band Pseudo QMF banks. IEEE Transactions on signal Processing, 1995, 43(4): 1005-1007.
  • 9Y. E Lin, E P. Vaidyanathan. A Kaiser Window Approach for the Design of Prototype Filters of Cosine Modulated Filterbanks. IEEE Signal Processing Letters, 1998, 5(6): 132-134.
  • 10D. Hermann, E. Chau, R. D. Dony, et al. Window Based Prototype Filter Design for Highly Oversampled Filter Banks in Audio Applications. IEEE International Conference on Acoustics, Speech, and Signal Processing, 2007, 2: 11405-11408.

共引文献6

同被引文献40

引证文献3

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部