期刊文献+

基于声音定位和听觉掩蔽效应的语音分离研究 被引量:16

Speech Separation Based on Sound Localization and Auditory Masking Effect
下载PDF
导出
摘要 人耳具有在嘈杂环境中将感兴趣的语言信息提取出来的能力 ,而双耳听觉特性有助于这种能力的加强 .据此本文提出了一种基于声音定位和听觉掩蔽效应的混叠语音分离方法 .根据声音到达双耳的时间差和强度差在时频域内确定相应的掩蔽系数 ,该系数是二值的 ,以直接去除干扰信号 ,保留有用信号并达到语音分离的目的 .实验表明 ,本文提出的方法是有效的 .该方法不仅适用于混叠语音为浊音情形 ,对清音的情况同样适用 ,因而比基于基音提取的语音分离方法的适用范围更广 . Human has the ability to attend to a single interested speech in a noised condition and this ability can be improved in the presence of binaural cues. In this paper a speech separation method is presented based on sound localization and auditory masking effect. By two important parameters-the interaural time differences (ITD) and interaural intensity differences (IID)-we estimate the binary masking coefficients in corresponding time-frequency regions. The coefficients are helpful of speech separation by holding interested signal and reducing noise signal. Experiments indicate that the approach described here is efficient not only for voiced speech but also for unvoiced speech and it has more extensive applications than pitch-based speech separation algorithms.
出处 《电子学报》 EI CAS CSCD 北大核心 2005年第1期158-160,共3页 Acta Electronica Sinica
基金 国家自然科学基金 (No 60 1 72 0 1 6)
关键词 双耳时间差 双耳强度差 声音定位 语音分离 掩蔽效应 Algorithms Audition Estimation Signal processing Speech intelligibility
  • 相关文献

参考文献11

  • 1D L Wang, G J Brown.Separation of speech from inlerfering sounds based on oscillatory correlation[J].IEEE Trans,1999,NN-10(3):684- 697.
  • 2G J Brown, M Cooke. Computational auditory scene analysis [J].Computer Speech and Language, 1994,8(24) :297 - 336.
  • 3D F Rosenthal, H G Okuno. Computational Auditory Scene Analysis[M]. Mahwah: Lawrence Erlbaum, 1998.
  • 4W Roman, D L Wang. Speech segregation based on sound localization[A]. Proc IJCNN[C]. Washington DC : IEEE,2001. 2861 - 2866.
  • 5A J W Kouwe, D L Wang. A Comparison of Auditory and Blind Separation Techniques for Speech Segregation[J]. IEEE Trans,2001,SAP-9(3) : 189 - 194.
  • 6W Gardner, K Martin. HRTF measurements of a KEMAR[J] .J Acoust Soc Am, 1995,97(6) :3901 - 3908.
  • 7R Patterson, et al. An efficient auditory filterbank based on the gammatone functions[R]. APU Report No. 2341, Cambrige, Applied Psychology Unit. 1988.
  • 8F Wightman, D Kistler. The dominant role of low-frequency interaural time differences in sound localization[J] .J Acoust Soc Am. 1992,91(3) : 1648 - 1660.
  • 9J Blauert. Spatial Hearing-The Psychophysics of Human Sound Localization[ M]. Cambridge: MIT Press, 1997.
  • 10M P Cooke, P Green, L Josifovski amt A Vizinho. Robust automatic speech recognition with missing and unreliable acoustic data [J].Speech Comm, 2001,34:264 - 285.

同被引文献185

引证文献16

二级引证文献53

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部