期刊文献+

基于改进基音跟踪算法的单通道语音分离 被引量:4

Monaural Speech Segregation Based on Improved Pitch Tracking Method
下载PDF
导出
摘要 基于计算听觉场景分析(Computational Auditory Scene Analysis,CASA)的语音分离系统通过模拟人耳的听觉感知系统对混合信号进行处理并分离出感兴趣的目标语音,近年来得到了很大的发展。如何在干扰噪声存在的情况下进行正确的基音提取跟踪一直是CASA系统研究的重点。提出了一种基于目标语音源的改进基音跟踪算法。该算法通过对目标源估计和基音检测两个步骤的反复迭代计算,得到最终的基音轨迹。通过在不同噪声干扰条件下与传统基音跟踪算法对比的实验结果证明,该算法能够有效地抑制噪声,提高输出语音的信噪比和语音质量。 By means of the feature that humans can distinguish and track speech signal of interest under various noisy environments, the speech segregation system based on computational auditory scene analysis (CASA) has obtained considerable development in recent years. How to correctly pitch detection in noisy environment has been a challenge in CASA system. Hence, in this paper, an improved pitch tracking algorithm based on target source is proposed. By estimating the target units and detecting the pitch periods iteratively, the proposed algorithm obtains pitch tracks. By comparing with conventional pitch tracking method under various interferences, it is shown that the proposed algorithm can effectively suppress the interferences and improve the average output SNR and the quality of segregated speech.
出处 《华东理工大学学报(自然科学版)》 CAS CSCD 北大核心 2013年第3期338-344,共7页 Journal of East China University of Science and Technology
基金 国家自然科学基金(60903186 61271349)
关键词 语音分离 计算听觉场景分析 目标源估计 基音跟踪 speech segregation computational auditory scene analysis target units estimation pitch tracking
  • 相关文献

参考文献14

  • 1陈雪勤,赵鹤鸣,陈小平.基于计算听觉场景分析的强噪声背景下基音检测方法[J].电路与系统学报,2003,8(3):128-131. 被引量:5
  • 2张学良,刘文举,李鹏,徐波.改进谐波组织规则的单通道浊语音分离系统[J].声学学报,2011,36(1):88-96. 被引量:7
  • 3Wang Deliang, Brown G J. Computational Auditory Scene Analysis [M]. USA: IEEE Press ,2006.
  • 4Hu Guoning, Wang Deliang. An Auditory Scene Analysis Approach to Monaural Speech Segregation[M]//Topics in Acoustic Echo and Noise Control. Berlin Heidelberg: Spring er, 2006 : 485-515.
  • 5Hu Guoning, Wang Deliang. Monaural speech segregation based on pitch tracking and amplitude modulation [J]. IEEE Transactions on Neural Networks, 2004, 15(5):1135-1149.
  • 6Hu Guoning. Monaural speech organization and segregation [D]. USA: The Ohio State University,2006.
  • 7Meddis R. Simulation of auditory neural transduction: Fur- ther studies[J]. Journal of the Acoustical Society of America, 1988,88(3) :1056-1063.
  • 8虞晓,胡光锐,崔玉红.基于CASA简化模型的语音增强算法[J].上海交通大学学报,2001,35(11):1635-1639. 被引量:3
  • 9Wang Deliang, Hu Guoning. Unvoiced speech segregation [C]//IEEE International Conference on Acoustics, Speech and Signal Processing. USA: IEEE, 2006 : 953-956.
  • 10Hu Guoning, Wang Deliang. Segregation of unvoiced speech from non speech interference [J]. Journal of the Acoustical Society of America, 2008, 124 : 1306-1319.

二级参考文献32

  • 1Kadambe S, et al. Application of the wavelet transform for pitch detection of speech signals[J]. IEEE Trans. on IT, 1992, 38(2): 917-924.
  • 2Jackson P, Shadle CH. Pitch-Scaled Estimation of Simultaneous Voiced and Turbulence Components in Speech[J]. IEEE Trans. on Speech and Audio Processing, 2001,9(7): 713-726.
  • 3Brown G J, Cooke M. Computational auditory scene analysis[J]. Computer Speech and Language, 1994, 8: 297-336.
  • 4Patterson R, et al. An efficient auditory filterbank based on the gammatone functions. SVOS final report, Part B:The auditory filter bank[R].APU report. 1998, 2341.
  • 5Meddis R. Simulation of Mechanical to Neural Transduction in the Auditory Receptor[J]. JASA, 1986, 79(3): 702-711.
  • 6虞晓,学位论文,1999年
  • 7Cooke M,Hershey J R,Rennie S J.Monaural speech separation and recognition challenge. Computer Speech and Language . 2010
  • 8Klapuri A.Auditory-model based methods for multiple fundamental frequency estimation. Signal Processing Methods for Music Transcription . 2006
  • 9de Boer E,de Jongh H R.On cochlear encoding:potentialities and limitations of the reverse-correlation techniques. The Journal of The Acoustical Society of America . 1978
  • 10Kohlrausch A,Fassel R,Dau T.The influence of carrier level and frequency on modulation and beat-detection thresholds for sinusoidal carriers. The Journal of The Acoustical Society of America . 2000

共引文献11

同被引文献24

  • 1Hu Guo-ning, Wang De-liang. Monaural speech segregation based on pitch tracking and amplitude modulation [ J]. IEEE Transactions on Neural Networks,2004,15 (5) : 1135-1149.
  • 2Wang De-liang, Brown G J. Computational auditory scene analysis: principles,algorithms,and applications [M]. USA:IEEE, Press ,2006.
  • 3Hu Guo-ning, Wang De-liang. An auditory scene analysis approach to monaural speech segregation [ M]. Topics in Acoustic Echo and Noise Control. Berlin Heidelberg: Springer,2006:485-515.
  • 4Meddis R. Simulation of auditory-neural transduction: further studies [J]. Journal of the Acoustical Society of America, 1988,88 (3) : 1056-1063.
  • 5Tolonen T, Karjalalnen M. A computationally efficient multipitch a- nalysis model [ J ]. IEEE Transactions on Speech and Audio Pro- cessing, 2000,8 ( 6 ) :708 -716.
  • 6Hu Guo-ning,Wang De-liang. Auditory segmentation based on on- set and offset analysis [ J ]. IEEE Transactions on Audio, Speech, and Language Processing,2007,15 (2) : 396-405.
  • 7Hu Guo-ning, Wang De-liang. Segregation of unvoiced speech from non-speech interference [ J ]. Journal of the Acoustical Society of America,2008,124 : 1306-1319.
  • 8Wang Yu, Lin Jia-jun, Yuan Wen-hao, et al. An improved unvoiced speech segregation based on computational auditory scene analysis [ J]. Journal of East China University of Science and Technology ( Natural Science Edition) ,2014,40 (2) :212-217.
  • 9Zhao Li-heng. Monaural speech segregation based on computational auditory scene analysis [ D]. University of Science and Technology of China,2012.
  • 10Bregman S. Auditory scene analysis [ M ]. MA: MIT Press, 1990.

引证文献4

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部