期刊文献+

语音端点检测中能零比方法的改进 被引量:7

Speech Endpoint Detection Method Based on Improved Energy-Zero Ratio
下载PDF
导出
摘要 传统的基于语音信号短时能量与短时过零率之比的单参数双门限端点检测方法对高信噪比的语音信号能实现较好的检测,而在低信噪比的情况下检测正确率却很低。本文在研究了语音信号的非线性分析方法后,提出了一种改进的端点检测方法。首先,对分帧加窗后的每一帧带噪语音信号进行经验模态分解求其短时Teager能量;然后,求每一帧的短时过零率,平滑处理之后进行归一化;最后,求出短时Teager能量与归一化短时过零率之比用于端点检测。经过仿真实验证明,本文提出的改进方法能够在低信噪比的带噪环境下实现比传统能零比方法更好的端点检测效果。 The traditional single-parameter double-threshold endpoint detection method based on the ratio of short-time energy and short-time zero-crossing rate can achieve better performance of high signal-to-noise ratio speech signal,but the correct rate is very low under low SNR circumstance.In this paper,after studying the nonlinear analysis method of speech signal,an improved endpoint detection method is proposed.Firstly,the framed and windowed noisy speech signal is subjected to empirical mode decomposition to obtain the short-time Teager energy;then,the short-time zero-crossing rate of each frame is obtained,and normalized after smoothing;finally,calculate the ratio of the short-time Teager energy to the normalized short-time zero-crossing rate for endpoint detection.After a lot of simulation experiments,the improved method proposed in this paper can achieve better performance than the traditional energy-zero ratio method in the noisy environment with low SNR.
作者 唐俊龙 刘远治 禹智文 张竣 Tang Junlong;Liu Yuanzhi;Yu Zhiwen;Zhang Jun(Physics and Electronic Science Academy,Changsha University of Science&Technology,Changsha Hunan,410114)
出处 《电子测试》 2020年第7期47-49,共3页 Electronic Test
基金 国家科技支撑计划项目(2014BAH28F04) 湖南省教育厅创新平台开放基金项目(17K004).
关键词 语音端点检测 能零比 短时Teager能量 短时过零率 speech endpoint detection energy-zero ratio short-time Teager energy short-time zero-crossing rate
  • 相关文献

参考文献5

二级参考文献26

  • 1周维权.调制域分析技术[J].电子对抗技术,1995(3):38-43. 被引量:5
  • 2刘淑华,胡强,覃团发,梁琳.基于自相关函数最大值的语音端点检测方法[J].电声技术,2006,30(12):47-50. 被引量:10
  • 3江铭虎,袁保宗,林碧琴.神经网络语音识别的研究及进展[J].电信科学,1997,13(7):1-6. 被引量:10
  • 4Lamel L F, Rabiner L R, Rosenberg A E, et al. An improved endpoint detector for isolated word recognition [ J ]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1981,29(4) :777 -785.
  • 5Lu L, Jiang H, Zhang H J. A robust audio classification and segmentation method [ C ]//Proceedings of the 9th ACM In- ternational Conference on Multimedia. 2001.
  • 6Shen J L, Hung J W, Lee L S. Robust entropy-based endpoint detection for speech recognition in noisy environments [ C ]//5th International Conference on Spoken Language Process- ing. 1998:232 - 235.
  • 7Tian Y, Wu J, Wang Z Y, et al. Fuzzy clustering and bayes- ian information criterion based threshold estimation for robust voice activity detection [ C ]//IEEE International Conference on Acoustics, Speech and Signal Processing. 2003 : 444 - 447.
  • 8Chen S S, Gopalakrishnan P S. Clustering via the Bayesian information criterion with applications in speech recognition [ C ]//IEEE International Conference on Acoustics, Speech and Signal Processing. 1998:645 -648.
  • 9Zhao H, Zhao L X, Zhao K et al. Voice activity detection based on distance entropy in noisy environment[ C]//5th International Joint Conference on INC, IMS and IDC. 2009 : 1364 - 1367.
  • 10刘华平,李昕,徐柏龄,姜宁.语音信号端点检测方法综述及展望[J].计算机应用研究,2008,25(8):2278-2283. 被引量:40

共引文献41

同被引文献68

引证文献7

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部