一种基于改进能零法的连续语音端点检测方法被引量：3

The Continuous Speech Endpoint Detection Algorithm Based on Improved Energy-Zero Method

下载PDF

导出

摘要端点检测是语音识别和语音情感识别系统中极其关键的一步，其检测的效果直接关系到后续的参数计算和识别的结果．在分析了经典的基于短时能量和短时过零率的端点检测算法（能零法）的基础上，总结了其不足之处，并提出了改进的方法．改进后的算法通过对相邻两帧信号的短时能量正向做差来确定语音信号的起始点，反向做差来确定语音信号的终点；并且利用信号与背景噪声的短时过零率之比来修正语音信号的终点．MATLAB仿真结果表明，改进后的算法具有很好的端点检测效果． Endpoint detection is a crucial step in the speech and emotion recognition system, its test results are directly related to the calculation of the follow-up and identify the results. Based on the analysis of the classic short- time energy and short-time zero-crossing rate of endpoint detection algorithm （ energy-zero method）, we summed up its deficiencies, and proposed improvement measures. Improved algorithm ascertains the starting point of speech signal through making positive difference of short time energy between two neighboring frame, on the contrary, making negative difference ascertain the ending point, and amend location of the ending point using the rate of the short time zero-crossing between signal and background noise. The simulation results of using MATLAB showed that the improved algorithm has a good endpoint detection effect.

作者郭振兴罗中明王黎黎许伟平

机构地区哈尔滨理工大学测控技术与通信工程学院

出处《哈尔滨理工大学学报》 CAS 北大核心 2009年第A01期86-88,91,共4页 Journal of Harbin University of Science and Technology

关键词端点检测短时能量短时过零率能零法 endpoint detection short-time energy short-time zero-crossing rate energy-zero method

分类号 TP29 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

参考文献3

1YI Li,FAN YingLe,TONG QinYe.Endpoint Detection in Noisy Environment Using Complexity Measure[C]//Proceedings of the 2007 International Conference on Wavelet Analysis and Pattern Recognition,Beijing,China,Nov.2-4,2007:1004-1007.
2GANAPATHIRAJU,WEBSTER A,TRIMBLE L,et al.Comparison of Energy-Based Endpoint Detectors for Speech Signal Processing[J].Proceedings of the IEEE Southeastcon,1996:500-503.
3EVANGELOS S Dermatas,NIKOS D Fakotakis,GEORGE K Kokkinakis.Fast Endpoint Detection Algorithm for Isolated Word Recognition in Office Environment[C]//IEEE International Conference on Acoustic,Speech and Signal Processing,Salt Lake.1991:733-736.

同被引文献27

1侯珏,刘轶,郑方,蒋丹宁,秦勇,黄石磊,刘勇.基于VP树结构的多层匹配算法在哼唱识别中的应用[J].清华大学学报（自然科学版）,2009(S1):1419-1424. 被引量：4
2丁冠军,兰海滨,樊邦奎,龙腾,刘岩,王晶.智能电网应用中的PLC技术[J].电工技术学报,2013,28(S2):378-382. 被引量：23
3Wei Da- chuan. An improved feature extraction algorithm of humming music [ C ]// 2011 IEEE International Conference on Transportation, Mechanical, and Electrical Engineering (TMEE). 2011 : 2 500 - 2 503.
4Junqua J C, Mak B, Reaves B. A robust algorithm for word boundary detection in the presence of noise [ J ]. IEEE Trans on Speech and Audio Processing, 1994, 2(3) : 406 -412.
5Beritelli F, Casale S, Ruggeri G, et al. Performances evaluation and comparison of G. 729/AMR/fuzzy voice activity detectors [ J ]. IEEE Signal Processing Letters, 2002, 9 (3) : 85 - 88.
6Kennedy J, Eberhart R. Particle swarm optimization[ C ]// Proceedings 1995 IEEE International Conference on Neural Net- works. Perth: IEEE Press, 1995:1 942 -1 948.
7Duran D, Schutze H, Mobius B, et al. A computational model of unsupervised speech segmentation for correspondence learn- ing[J]. Research on Language and Computation, 2010, 8(2/3) : 133 -168.
8Batista G, Wang X, Keogh E. A complexity - invariant distance measure for time series[ C]//Proceedings of the 2011 SIAM International Conference on Data Mining. Mesa: SIAM, 2011:699 -710.
9舒倩,李银国.基于MFCC0的语音端点检测方法[J].通信技术,2007,40(11):374-375. 被引量：5
10Zimmermann M,Dostert K.A multipath model for the powerline channel[J].IEEE Transactions on Communication,2002(4):553-559.

引证文献3

1谢志成,张栋.基于粒子群优化的哼唱语音端点检测算法[J].福州大学学报（自然科学版）,2014,42(2):195-199. 被引量：1
2恩德,陈亚柯,毛哲龙.基于FastICA的低信噪比下L-PLC语音的间断传输[J].计算机工程与应用,2016,52(9):108-111. 被引量：2
3赵峰,于洋.基于VQ和HMM的双层声纹识别算法[J].桂林电子科技大学学报,2017,37(1):8-14. 被引量：3

二级引证文献6

1崔琳,王芷悦.基于LFBank与FBank混合特征的声纹识别研究[J].计算机科学,2022,49(S02):621-625. 被引量：4
2吴楠,冯祖勇,韦高梧.智能语音识别系统中噪声估计算法的研究和改进[J].广东工业大学学报,2018,35(3):43-46. 被引量：3
3孙桂琪,庄晓东,范珍艳.基于快速样本熵计算的清浊音判决与语音分割[J].青岛大学学报（工程技术版）,2018,33(4):98-103.
4甄倩倩,张庭亮.说话人识别综述[J].科技资讯,2017,15(25):241-243. 被引量：1
5李彪,王琮泽,高龙毅,赵慎书,苏哲.基于PLC的智能语音垃圾箱系统的设计[J].科学技术创新,2021(20):167-168.
6何赞园,王凯,吉立新.基于矢量量化的说话人识别系统硬件实现[J].现代电子技术,2022,45(1):171-175.

1王彪.一种改进的语音信号特征参数提取算法研究[J].电子设计工程,2011,19(21):59-61. 被引量：1
2邓艳容,景新幸,杨海燕,杨运泽.语音端点检测研究[J].计算机系统应用,2012,21(6):240-243. 被引量：17
3江官星,王建英.一种改进的检测语音端点的方法[J].微计算机信息,2006,22(05S):138-139. 被引量：27
4王彪.一种改进的语音端点检测方法研究[J].电子设计工程,2012,20(4):47-49. 被引量：3
5邢亚从.基于Matlab的语音端点检测方法浅析[J].福建电脑,2009,25(12):73-74. 被引量：1
6张梅.一种基于模糊神经网络的语音端点检测方法[J].计算机工程与应用,2012,48(16):133-135. 被引量：4
7孙一鸣,吴杨扬,李平.基于改进双门限法的语音端点检测研究[J].长春理工大学学报（自然科学版）,2016,39(1):91-95. 被引量：14
8王彪.基于小波分析的语音端点检测方法研究[J].科学技术与工程,2012,20(7):1667-1669.
9冯璐,王路露,张磊,张华东.车载环境下的语音端点检测方法[J].测控技术,2016,35(3):39-41. 被引量：2
10李昱,林志谋,黄云鹰,卢贵主.基于短时能量和短时过零率的VAD算法及其FPGA实现[J].电子技术应用,2006,32(9):110-113. 被引量：5

哈尔滨理工大学学报

2009年第A01期

浏览历史

内容加载中请稍等...

一种基于改进能零法的连续语音端点检测方法被引量：3

参考文献3

同被引文献27

引证文献3

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

一种基于改进能零法的连续语音端点检测方法 被引量：3

参考文献3

同被引文献27

引证文献3

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

一种基于改进能零法的连续语音端点检测方法被引量：3