基于改进基音跟踪算法的单通道语音分离被引量：4

Monaural Speech Segregation Based on Improved Pitch Tracking Method

下载PDF

导出

摘要基于计算听觉场景分析(Computational Auditory Scene Analysis,CASA)的语音分离系统通过模拟人耳的听觉感知系统对混合信号进行处理并分离出感兴趣的目标语音,近年来得到了很大的发展。如何在干扰噪声存在的情况下进行正确的基音提取跟踪一直是CASA系统研究的重点。提出了一种基于目标语音源的改进基音跟踪算法。该算法通过对目标源估计和基音检测两个步骤的反复迭代计算,得到最终的基音轨迹。通过在不同噪声干扰条件下与传统基音跟踪算法对比的实验结果证明,该算法能够有效地抑制噪声,提高输出语音的信噪比和语音质量。 By means of the feature that humans can distinguish and track speech signal of interest under various noisy environments, the speech segregation system based on computational auditory scene analysis （CASA） has obtained considerable development in recent years. How to correctly pitch detection in noisy environment has been a challenge in CASA system. Hence, in this paper, an improved pitch tracking algorithm based on target source is proposed. By estimating the target units and detecting the pitch periods iteratively, the proposed algorithm obtains pitch tracks. By comparing with conventional pitch tracking method under various interferences, it is shown that the proposed algorithm can effectively suppress the interferences and improve the average output SNR and the quality of segregated speech.

作者王雨林家骏袁文浩陈宁

机构地区华东理工大学信息科学与工程学院

出处《华东理工大学学报（自然科学版）》 CAS CSCD 北大核心 2013年第3期338-344,共7页 Journal of East China University of Science and Technology

基金国家自然科学基金(60903186 61271349)

关键词语音分离计算听觉场景分析目标源估计基音跟踪 speech segregation computational auditory scene analysis target units estimation pitch tracking

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献14

1陈雪勤,赵鹤鸣,陈小平.基于计算听觉场景分析的强噪声背景下基音检测方法[J].电路与系统学报,2003,8(3):128-131. 被引量：5
2张学良,刘文举,李鹏,徐波.改进谐波组织规则的单通道浊语音分离系统[J].声学学报,2011,36(1):88-96. 被引量：7
3Wang Deliang, Brown G J. Computational Auditory Scene Analysis [M]. USA: IEEE Press ,2006.
4Hu Guoning, Wang Deliang. An Auditory Scene Analysis Approach to Monaural Speech Segregation[M]//Topics in Acoustic Echo and Noise Control. Berlin Heidelberg: Spring er, 2006 : 485-515.
5Hu Guoning, Wang Deliang. Monaural speech segregation based on pitch tracking and amplitude modulation [J]. IEEE Transactions on Neural Networks, 2004, 15(5):1135-1149.
6Hu Guoning. Monaural speech organization and segregation [D]. USA: The Ohio State University,2006.
7Meddis R. Simulation of auditory neural transduction: Fur- ther studies[J]. Journal of the Acoustical Society of America, 1988,88(3) :1056-1063.
8虞晓,胡光锐,崔玉红.基于CASA简化模型的语音增强算法[J].上海交通大学学报,2001,35(11):1635-1639. 被引量：3
9Wang Deliang, Hu Guoning. Unvoiced speech segregation [C]//IEEE International Conference on Acoustics, Speech and Signal Processing. USA: IEEE, 2006 : 953-956.
10Hu Guoning, Wang Deliang. Segregation of unvoiced speech from non speech interference [J]. Journal of the Acoustical Society of America, 2008, 124 : 1306-1319.

二级参考文献32

1Kadambe S, et al. Application of the wavelet transform for pitch detection of speech signals[J]. IEEE Trans. on IT, 1992, 38(2): 917-924.
2Jackson P, Shadle CH. Pitch-Scaled Estimation of Simultaneous Voiced and Turbulence Components in Speech[J]. IEEE Trans. on Speech and Audio Processing, 2001,9(7): 713-726.
3Brown G J, Cooke M. Computational auditory scene analysis[J]. Computer Speech and Language, 1994, 8: 297-336.
4Patterson R, et al. An efficient auditory filterbank based on the gammatone functions. SVOS final report, Part B:The auditory filter bank[R].APU report. 1998, 2341.
5Meddis R. Simulation of Mechanical to Neural Transduction in the Auditory Receptor[J]. JASA, 1986, 79(3): 702-711.
6虞晓，学位论文，1999年
7Cooke M,Hershey J R,Rennie S J.Monaural speech separation and recognition challenge. Computer Speech and Language . 2010
8Klapuri A.Auditory-model based methods for multiple fundamental frequency estimation. Signal Processing Methods for Music Transcription . 2006
9de Boer E,de Jongh H R.On cochlear encoding:potentialities and limitations of the reverse-correlation techniques. The Journal of The Acoustical Society of America . 1978
10Kohlrausch A,Fassel R,Dau T.The influence of carrier level and frequency on modulation and beat-detection thresholds for sinusoidal carriers. The Journal of The Acoustical Society of America . 2000

共引文献11

1赵彩华,刘琚,孙建德,闫华.基于小波变换和独立分量分析的含噪混叠语音盲分离[J].电子与信息学报,2006,28(9):1565-1568. 被引量：14
2胡连锋,夏秀渝,张佩,李志昌.一种改进的强噪声背景下基音检测算法[J].通信技术,2009,42(12):164-166. 被引量：2
3赵立恒,汪增福.基于谐波和能量特征的单声道浊语音分离方法[J].声学学报,2012,37(2):218-224. 被引量：3
4王雨,林家骏,袁文浩.基于计算听觉场景分析的语音增强改进算法[J].华东理工大学学报（自然科学版）,2012,38(5):617-621. 被引量：2
5王雨,林家骏,袁文浩,陈宁.基于计算听觉场景分析的改进清音分离方法[J].华东理工大学学报（自然科学版）,2014,40(2):212-217. 被引量：3
6屈俊玲,李鸿燕.基于计算听觉场景分析的混合语音信号分离算法研究[J].计算机应用研究,2014,31(12):3822-3824. 被引量：6
7李鸿燕,屈俊玲,张雪英.基于信号能量的浊语音盲信号分离算法[J].吉林大学学报（工学版）,2015,45(5):1665-1670. 被引量：2
8李然军,李辉,李冬冬.改进听觉组织方法的单声道浊语音分离[J].小型微型计算机系统,2016,37(3):637-640.
9杨登舟,刘加,夏善红.基于计算听觉场景分析的说话人转换检测[J].计算机工程,2018,44(2):316-321. 被引量：1
10唐伟,张二华,张丽娜.基于计算听觉场分析的单声道的双人语音浊音分离[J].计算机与数字工程,2021,49(4):704-710.

同被引文献24

1Hu Guo-ning, Wang De-liang. Monaural speech segregation based on pitch tracking and amplitude modulation [ J]. IEEE Transactions on Neural Networks,2004,15 (5) : 1135-1149.
2Wang De-liang, Brown G J. Computational auditory scene analysis: principles,algorithms,and applications [M]. USA:IEEE, Press ,2006.
3Hu Guo-ning, Wang De-liang. An auditory scene analysis approach to monaural speech segregation [ M]. Topics in Acoustic Echo and Noise Control. Berlin Heidelberg: Springer,2006:485-515.
4Meddis R. Simulation of auditory-neural transduction: further studies [J]. Journal of the Acoustical Society of America, 1988,88 (3) : 1056-1063.
5Tolonen T, Karjalalnen M. A computationally efficient multipitch a- nalysis model [ J ]. IEEE Transactions on Speech and Audio Pro- cessing, 2000,8 ( 6 ) :708 -716.
6Hu Guo-ning,Wang De-liang. Auditory segmentation based on on- set and offset analysis [ J ]. IEEE Transactions on Audio, Speech, and Language Processing,2007,15 (2) : 396-405.
7Hu Guo-ning, Wang De-liang. Segregation of unvoiced speech from non-speech interference [ J ]. Journal of the Acoustical Society of America,2008,124 : 1306-1319.
8Wang Yu, Lin Jia-jun, Yuan Wen-hao, et al. An improved unvoiced speech segregation based on computational auditory scene analysis [ J]. Journal of East China University of Science and Technology ( Natural Science Edition) ,2014,40 (2) :212-217.
9Zhao Li-heng. Monaural speech segregation based on computational auditory scene analysis [ D]. University of Science and Technology of China,2012.
10Bregman S. Auditory scene analysis [ M ]. MA: MIT Press, 1990.

引证文献4

1李然军,李辉,李冬冬.改进听觉组织方法的单声道浊语音分离[J].小型微型计算机系统,2016,37(3):637-640.
2蔡良,夏秀渝,陆雄,孙文慧.基于基音跟踪的语音增强研究[J].成都信息工程大学学报,2019,34(1):1-6.
3王凯龙,张二华,曹冠彬.基于计算听觉场景分析的单通道信噪分离方法[J].计算机与数字工程,2019,47(5):1049-1054. 被引量：1
4钱政.基于计算听觉场景分析的单声道语音分离研究[J].北京印刷学院学报,2020,28(S02):276-278.

二级引证文献1

1郑振峰.基于窗函数的弱电信号智能感知系统设计[J].电子设计工程,2020,28(10):27-31. 被引量：1

1黄姗姗,许钢,李远军.一种改进的MBE基音跟踪算法[J].安徽工程大学学报,2012,27(4):49-52.
2李鹏,关勇,刘文举,徐波.基于多基音跟踪的单声道混合语音分离[J].计算机应用研究,2008,25(6):1660-1662. 被引量：1
3王都生,铁满霞,樊昌信.一种实用的双向跟踪基音周期平滑算法[J].电子学报,1999,27(10):108-110. 被引量：6
4周群群,马泳,鲁瑞津,王宏远.多带激励声码器的改进双路径基音跟踪算法[J].华中科技大学学报（自然科学版）,2012,40(6):54-58.
5李煦,屠明,吴超,国雁萌,纳跃跃,付强,颜永红.基于NMF和FCRF的单通道语音分离[J].清华大学学报（自然科学版）,2017,57(1):84-88. 被引量：1
6杨敏芝.2．84kbps多原型波型编码新技术[J].电信资料,1997(4):27-32.
7余世经,李冬梅,刘润生.一种基于CASA的单通道语音增强方法[J].电声技术,2014,38(2):50-54. 被引量：3
8王雨,林家骏,袁文浩,陈宁.基于计算听觉场景分析的改进清音分离方法[J].华东理工大学学报（自然科学版）,2014,40(2):212-217. 被引量：3
9邓昊,李双田,成少锋.一种改进的音段声码器编码方法[J].信号处理,2003,19(5):448-452. 被引量：2
10孟晔,何培宇,潘帆.基于束搜索法的基音标注新方法[J].信号处理,2011,27(11):1769-1773.

华东理工大学学报（自然科学版）

2013年第3期

浏览历史

内容加载中请稍等...

基于改进基音跟踪算法的单通道语音分离被引量：4

参考文献14

二级参考文献32

共引文献11

同被引文献24

引证文献4

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于改进基音跟踪算法的单通道语音分离 被引量：4

参考文献14

二级参考文献32

共引文献11

同被引文献24

引证文献4

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于改进基音跟踪算法的单通道语音分离被引量：4