期刊文献+

一种基于改进能零法的连续语音端点检测方法 被引量:3

The Continuous Speech Endpoint Detection Algorithm Based on Improved Energy-Zero Method
下载PDF
导出
摘要 端点检测是语音识别和语音情感识别系统中极其关键的一步,其检测的效果直接关系到后续的参数计算和识别的结果.在分析了经典的基于短时能量和短时过零率的端点检测算法(能零法)的基础上,总结了其不足之处,并提出了改进的方法.改进后的算法通过对相邻两帧信号的短时能量正向做差来确定语音信号的起始点,反向做差来确定语音信号的终点;并且利用信号与背景噪声的短时过零率之比来修正语音信号的终点.MATLAB仿真结果表明,改进后的算法具有很好的端点检测效果. Endpoint detection is a crucial step in the speech and emotion recognition system, its test results are directly related to the calculation of the follow-up and identify the results. Based on the analysis of the classic short- time energy and short-time zero-crossing rate of endpoint detection algorithm ( energy-zero method), we summed up its deficiencies, and proposed improvement measures. Improved algorithm ascertains the starting point of speech signal through making positive difference of short time energy between two neighboring frame, on the contrary, making negative difference ascertain the ending point, and amend location of the ending point using the rate of the short time zero-crossing between signal and background noise. The simulation results of using MATLAB showed that the improved algorithm has a good endpoint detection effect.
出处 《哈尔滨理工大学学报》 CAS 北大核心 2009年第A01期86-88,91,共4页 Journal of Harbin University of Science and Technology
关键词 端点检测 短时能量 短时过零率 能零法 endpoint detection short-time energy short-time zero-crossing rate energy-zero method
  • 相关文献

参考文献3

  • 1YI Li,FAN YingLe,TONG QinYe.Endpoint Detection in Noisy Environment Using Complexity Measure[C]//Proceedings of the 2007 International Conference on Wavelet Analysis and Pattern Recognition,Beijing,China,Nov.2-4,2007:1004-1007.
  • 2GANAPATHIRAJU,WEBSTER A,TRIMBLE L,et al.Comparison of Energy-Based Endpoint Detectors for Speech Signal Processing[J].Proceedings of the IEEE Southeastcon,1996:500-503.
  • 3EVANGELOS S Dermatas,NIKOS D Fakotakis,GEORGE K Kokkinakis.Fast Endpoint Detection Algorithm for Isolated Word Recognition in Office Environment[C]//IEEE International Conference on Acoustic,Speech and Signal Processing,Salt Lake.1991:733-736.

同被引文献27

  • 1侯珏,刘轶,郑方,蒋丹宁,秦勇,黄石磊,刘勇.基于VP树结构的多层匹配算法在哼唱识别中的应用[J].清华大学学报(自然科学版),2009(S1):1419-1424. 被引量:4
  • 2丁冠军,兰海滨,樊邦奎,龙腾,刘岩,王晶.智能电网应用中的PLC技术[J].电工技术学报,2013,28(S2):378-382. 被引量:23
  • 3Wei Da- chuan. An improved feature extraction algorithm of humming music [ C ]// 2011 IEEE International Conference on Transportation, Mechanical, and Electrical Engineering (TMEE). 2011 : 2 500 - 2 503.
  • 4Junqua J C, Mak B, Reaves B. A robust algorithm for word boundary detection in the presence of noise [ J ]. IEEE Trans on Speech and Audio Processing, 1994, 2(3) : 406 -412.
  • 5Beritelli F, Casale S, Ruggeri G, et al. Performances evaluation and comparison of G. 729/AMR/fuzzy voice activity detectors [ J ]. IEEE Signal Processing Letters, 2002, 9 (3) : 85 - 88.
  • 6Kennedy J, Eberhart R. Particle swarm optimization[ C ]// Proceedings 1995 IEEE International Conference on Neural Net- works. Perth: IEEE Press, 1995:1 942 -1 948.
  • 7Duran D, Schutze H, Mobius B, et al. A computational model of unsupervised speech segmentation for correspondence learn- ing[J]. Research on Language and Computation, 2010, 8(2/3) : 133 -168.
  • 8Batista G, Wang X, Keogh E. A complexity - invariant distance measure for time series[ C]//Proceedings of the 2011 SIAM International Conference on Data Mining. Mesa: SIAM, 2011:699 -710.
  • 9舒倩,李银国.基于MFCC0的语音端点检测方法[J].通信技术,2007,40(11):374-375. 被引量:5
  • 10Zimmermann M,Dostert K.A multipath model for the powerline channel[J].IEEE Transactions on Communication,2002(4):553-559.

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部