Toeplitz含噪语音端点鲁棒检测

Voice activity robust detection of noisy speech in Toeplitz. C

下载PDF

导出

摘要针对在低信噪比条件下语音端点检测问题,提出了一种基于Toeplitz最大特征值的去噪语音端点检测方法。该方法用语带频谱自相关序列构造一个对称Toeplitz矩阵,利用该矩阵最大特征值的信息量对语音信号进行双门限端点检测。新算法经过实验,能够有效地区分语音和噪声,在不同的低噪声环境条件下具有良好的鲁棒性。与新近的信号递归度分析方法比较,准确率较高。该算法计算代价小,实时性好,简洁易实现。 A Toeplitz de-noising method using the maximum eigenvalue is proposed for the voice activity detection at low SNR scenarios.This method uses the self-correlation sequence of speech bandwidth spectrum to construct a new symmetric Toeplitz matrix and to compute the largest eigenvalue,and the double decision thresholds in the largest eigenvalue are applied in the decision framewok.Simulation results show that the presented algorithm is more effective in distinguishing speech from noise and has better robustness under various noisy environments.Compared with novel method of recurrence rate analysis,this algorithm shows lower wrong decision rate.The algorithm is of low computational complexity and is simple in real-time realization.

作者王景芳宁矿凤

机构地区湖南涉外经济学院电气工程系湖南涉外经济学院计算机科学系

出处《计算机工程与应用》 CSCD 2013年第18期217-222,共6页 Computer Engineering and Applications

关键词语音端点检测语带频谱最大特征值鲁棒性 voice activity detection speech bandwidth spectrum maximum eigenvalue robustness

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献17

1Raj B,Singh R.Classifier-based non-linear projection for adaptive endpointing of continuous speech[J].Computer Speech and Language,2003,17:5-26.
2Tanyer S G,Ozer H.Voice activity detection in nonstationary noise[J].IEEE Transactions on Speech and Audio Processing,2000,8(4):478-482.
3Karray L,Martin A.Towards improving speech detection robustness for speech recognition in adverse conditions[J].Speech Communication,2003,40:261-276.
4Kuroiwa S,Naito M,Yamamoto S,et al.Robust speech detection method for telephone speech recognition system[J].Speech Communication,1999,27:135-148.
5Ramirez J,Segura J C,Benitez C,et al.Efficient voice activity detection algorithms using long-term speech information[J].Speech Communication,2004,42:271-287.
6Ramirze J,Segura J C,Benitez C,et al.An efective subband OSF-based VAD with noise reduction for robust speech recognition[J].IEEE Transactions on Speech and Audio Processing,2005,13(6):1119-1129.
7Nemer E,Goubran R,Mahmoud S.Robust voice activity detection using higher-order statistics in the LPC residual domain[J].IEEE Transactions on Speech and Audio Processing,2001,9(3):217-231.
8Shen J,Hung J,Lee L.Robust entropy-based endpoint detection for speech recognition in noisy environments[C]//Proc of International Conference on Spoken Language Processing,Sydney,Australia,1998:232-238.
9Ephraim Y,van Trees H L.A signal subspace approach for speech enhancement[J].IEEE Trans on Speech Audio Processing,1995,3(4):251-266.
10Klein M,Kabal P.Signal subspace speech enhancement with perceptual post filtering[C]//IEEE-ICASSP’02,Orlando,Florida,USA,2002:537-540.

二级参考文献37

1吕勇,徐金梧,李友荣.递归图和近似熵在设备故障信号复杂度分析中的应用[J].机械强度,2006,28(3):317-321. 被引量：20
2闫润强,朱贻盛.基于信号递归度分析的语音端点检测方法[J].通信学报,2007,28(1):35-39. 被引量：8
3李金宝,屈百达,徐宝国,周小祥.基于自适应子带功率谱熵的语音端点检测算法[J].计算机工程与应用,2007,43(12):57-58. 被引量：5
4Marzinzik M, Kollmeier B.Speech pause detection for noise spectrum estimation by tracking power envelope dynamics[J]. IEEE Trans on Speech and Audio Processing, 2002, 10: 109-118.
5Junqua J C.Robustness and cooperative multi-model man machine communication application[C]//Proc Second Venaco Workshop and ESCA ETRW, 1991.
6Huang N E, Shen Z, Long S R,et al.The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-station time series analysis[C]//Proceeding of the Royal Society of London, 1998 : 903-995.
7Marwan N, Thiel M, Nowaczyk N R.Cross recurrence plot based synchronization of time series[J].Nonlinear Proc Geophys,2002,9:325-331.
8Zbilut J P, Webber J, Charles L.Embeddings and delays as derived from quantification of recurrence plots[J].Physics Letters A, 1992,171(1) : 199-203.
9Boudraa A O,Cexus J C,Saidi Z.EMD-based signal noise reduction[J].International Journal of Signal Processing,2004, 1 (1):33-37.
10Karray L,Martin A.Towards improving speech detection robustness for speech recognition in adverse conditions[J].Speech Communication, 2003,40: 261-276.

共引文献15

1李晋,刘甫,王玲,许慧燕.改进的语音端点检测技术[J].计算机工程与应用,2009,45(24):133-135. 被引量：9
2姜占才,孙燕,王得芳.基于复合能量和自适应阈值的语音端点检测[J].计算机工程与科学,2010,32(4):136-138. 被引量：1
3李晋,王景芳,高金定.基于经验模态分解和递归图的语音端点检测算法[J].计算机工程与应用,2010,46(34):132-135. 被引量：6
4王辉,李生华.基于EMD的语音特征信息提取[J].计算机科学,2011,38(B10):434-436. 被引量：5
5王景芳,许慧燕.基于递归分析的基音检测新方法[J].计算机工程与应用,2012,48(13):125-129. 被引量：1
6赵志强,颜学龙.基于EMD和ICA的单通道语音盲源分离算法[J].电子科技,2012,25(7):66-68. 被引量：5
7侯丽霞,曾以成,焦蓓.强噪声环境下基于改进HHT的语音端点检测[J].计算机工程与应用,2012,48(28):139-142. 被引量：6
8汤霖,姜世芬.多类噪声环境下的语音端点检测[J].计算机工程与应用,2012,48(29):114-118. 被引量：2
9李荣荣,胡昌奎,余娟.基于谱熵的语音端点检测算法改进研究[J].武汉理工大学学报,2013,35(7):134-139. 被引量：10
10李晋,汤井田,王玲,肖晓,张林成.基于信号子空间增强和端点检测的大地电磁噪声压制[J].物理学报,2014,63(1):414-423. 被引量：13

1范懋本,张自达.类高斯噪声中已知信号的Robust M-检测[J].南京邮电学院学报,1990,10(4):9-16. 被引量：1
2闫润强,朱贻盛.基于信号递归度分析的语音端点检测方法[J].通信学报,2007,28(1):35-39. 被引量：8
3李开龙,胡柏青,高敬东,冯国利.基于M估计的非线性鲁棒检测卡尔曼滤波算法[J].计算机应用,2014,34(11):3214-3217. 被引量：3
4孙振陆,王炎生,王醒华,陈宗基.多故障鲁棒检测滤波器设计[J].控制理论与应用,2001,18(1):31-35. 被引量：2
5王景芳.幅度—带宽联合分析的语音端点鲁棒检测[J].湖南涉外经济学院学报,2013,0(2):84-88.
6孙中伟,许刚.一种基于α稳定分布模型的DCT域隐藏信息检测新方法[J].电子学报,2008,36(4):720-724. 被引量：3
7代保全,王彤,同亚龙,吴建新,保铮.基于椭球不确定集约束的鲁棒自适应相干检测器[J].电子与信息学报,2014,36(12):2969-2974.
8王景芳.实时语音端点鲁棒检测[J].计算机工程与应用,2011,47(20):147-150. 被引量：4
9董占奇,胡捍英.基于延迟相乘的相关跳频信号鲁棒检测[J].无线电工程,2007,37(5):24-26.
10杨莉,杨新.基于水平集方法的多运动目标分割[J].上海交通大学学报,2004,38(5):713-717. 被引量：4

计算机工程与应用

2013年第18期

浏览历史

内容加载中请稍等...

Toeplitz含噪语音端点鲁棒检测

参考文献17

二级参考文献37

共引文献15

相关作者

相关机构

相关主题

浏览历史