期刊文献+

基于多特征的语音活动检测技术分析 被引量:1

Analysis on Voice Activity Detection Technology Based on Multi-features
下载PDF
导出
摘要 针对高强度噪声背景下活动话音无法准确检测的问题,提出了基于多特征的语音活动检测算法,详细论述了该算法中语音信号的采样量化、预加重、分帧和加窗等预处理技术,分析了检测算法设计中的动态门限更新、短暂停顿平滑等关键因素,并总结出了多特征语音活动检测算法的流程图。通过基于硬件平台的算法测试和仿真分析,结果验证了该算法的合理性和有效性,对于复杂背景噪声环境下的活动话音检测有着重要的实用意义。 Under the high-intensity background noise,the voice activity can't be detected accurately,Aiming at this problem,a voice activity detection algorithm based on multi-features is proposed.In this paper,the technologies such as speech signal sample quantization,pre-emphasis,frame windows and other pre-processing are discussed in detail,the key factors such as dynamic threshold updating,pause smooth in algorithm design are analyzed,and the program flow chart is concluded.The result of test and simulation analysis on hardware platform proves the rationality and effectiveness of algorithm,which has important practical significance for voice activity detection under complex background noise environment.
作者 杨咏剑 冀峰
出处 《无线电工程》 2011年第10期24-26,共3页 Radio Engineering
关键词 语音活动检测(VAD) 语音预处理 语音检测算法 多特征检测 voice activity detection voice preprocessing voice detection algorithm multi-feature detection
  • 相关文献

参考文献4

二级参考文献10

  • 1杨胜跃,周宴宇,黄深喜.语音信号端点检测方法与展望[J].信息技术,2005,29(7):5-8. 被引量:4
  • 2Wilpon J G, Rabiner L R, Martin T. An Improved Word-detection Algorithm for Telephone-quality Speech Incorporating Both Syntactic and Semantic Constraints.AT&T Bell Labs. Tech. J., 1984, 63: 479-498.
  • 3Chengalvarayan R, Robust Energy Normalization using Speech/nonspeech Discriminator for German connected Digit Recognition. Proc. Eurospeech'99, Budapest, Hungary, 1999. 61-64.
  • 4Haigh J A, Mason J S. Robust Voice Activity Detection Using Cepstral Features. in Proc. IEEE TENCON, 1993.321-324.
  • 5Deller J R, Proakis J G, Hansen J H L. Discrete-Time Processing of Speech Signals [M]. New York: Macmillan,1993.
  • 6Petrou M, Kittler J. Optimal Edge Detectors for ramp Edges, IEEE Transactions on Pattern Analysis and Machine Intelligence, 1991, 13(5) :483-491.
  • 7Qi Li, Jinsong Zheng, Tsai A, Zhou Qiru. Robust Endpoint Detection and Energy Normalization for Realtime Speech and Speaker Recognition, IEEE Transactions on Speech and Audio Processing, 2002, 10(3):146-157.
  • 8Annex B to ITU-T Recommendation G.729: A silence Compression Scheme for G.729 Optimized for Terminals Conforming to Recommendation V.70, 1996.
  • 9吴大正.信号与线性系统分析[M]高等教育出版社,1986.
  • 10胡光锐,韦晓东.基于倒谱特征的带噪语音端点检测[J].电子学报,2000,28(10):95-97. 被引量:70

共引文献47

同被引文献7

引证文献1

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部