期刊文献+

非平稳噪声环境下基于谐波能量的语音检测 被引量:2

A Speech Endpoint Detection Method Based on Consonance Energy
下载PDF
导出
摘要 语音端点检测的鲁棒性,对于构建实际语音识别系统具有重要的意义。谐波成分是语音信号的一个基本特点,为此提出了一种基于谐波成分能量的端点检测算法。通过sobel算子计算窄带语谱图的方向场,通过Gabor滤波增强谐波区域,通过门限方法得到二值化图,去除方向大于45度和依赖度低的点,得到连续的水平方向的带状分布,即谐波分布区域,求取谐波分布区域内的能量,以此作为门限判决的特征。实验结果表明,在不同信噪比、多种非平稳噪声环境下都能够达到较好的语音检出效果。其优点为,不需要噪声的先验知识,充分利用了语音在频率域和时间域的相关性,适应于各种非平稳复杂噪声。 The robustness of VAD(voice activity detection) is crucial to the construction of a practical automatic speech recognition system. This paper presents a new VAD algorithm based on the energy of consonance, which serves as a basic feature of speech. The consonance region is attained through the following stages: orientation estimation, consonance enhancement, binarisation and post - process via Sobel operator, Gabor filter, threshold classifier and pruning respectively. The pruning is performed mainly through discarding those regions with orientation greater than 45 degree or low dependency. The consonance energy, sum of the consonance regions energy, is then used as the feature of the threshold decision argument of VAD. The experimental results show that the proposed algorithm holds the following advantages: independent of the prior information, fully utilizing the correlation of frequency and time domain of speech, applicable for various complex non - stable noises.
出处 《计算机仿真》 CSCD 2008年第11期305-308,共4页 Computer Simulation
关键词 语音检测 谐波成分能量 语谱图 图像增强 Voice activity detection Consonance energy Spectrogram Image enhancement
  • 相关文献

参考文献5

  • 1L Hong, Y Wan and A K Jain. Fingerprint image enhancement: Algorithm and performance evaluation [ C ]. IEEE Transactions on Pattern Analysis and Machine Intelligence 1995, 20 (8) :777 - 789.
  • 2H LIN, YIFEI W,ANIL J. Fingerprint image enhancement: algorithm and performance valuation [ J ]. IEEE PA--MI, 1998,20 (8) :777-789.
  • 3L Jiang. and X D Huang. Vocabulary - Independent Word Confidence Measure Using Subword Features[ C] , Int. Conf. on Spoken Language Processing, Svndey, Austrilia, 1998.
  • 4田野,王作英,陆大.基于子带能量线性映射的噪声中端点检测算法[J].清华大学学报(自然科学版),2002,42(7):953-956. 被引量:17
  • 5G D Wu, C T Lin. Word boundary detection with mel - scale frequency bank in noisy environment [ C ]. IEEE Transactions on Speech and Audio Processing, 2000, 8(5) : 541 -554.

二级参考文献8

  • 1[1]Junqua J C, Mak B, Reaves B. A Robust Algorithm for Word Boundary Detection in the Presence of Noise [J]. IEEE Transactions on Speech and Audio Processi ng, 1994, 2(3): 406412.
  • 2[2]Lamel, Rabiner L, Rosenberg A, et al. An Improved Endpoint Detector for Isol ated Word Recognition [J]. IEEE Transactions on Acoustic, Speech and Signal Processing, 1981, 29(8): 777785.
  • 3[3]Deller J R, Proakis J G, Hansen J H L, Discrete-Time Processing of Speech Si gnals [M]. New York: Macmillan, 1993.
  • 4[4]Hamada M, Takizawa Y, Norimatsu T. A Noise Robust Speech Recognition [A]. Hiro ya F. 19 90 International Conference on Speech Language Processing [C]. Kobe: Science U niversity of Japan, 1990, 893896.
  • 5[5]Wu GinDer, Lin ChinTeng, Word Boundary Detection with Mel-Scale Frequency Ba nk in Noisy Environment [J]. IEEE Transactions on Speech and Audio Processin g, 2000, 8(5): 541554.
  • 6[6]Fukunaga K. Introduction to Statistical Pattern Recognition [M]. Boston: Aca demic Press, 1990.
  • 7[7]Rabiner L, Juang B H, Fundamentals of Speech Recognition [M]. Englewood Clif fs: PTR Prentice Hall, 1993.
  • 8[8]The Signal Processing Information Base Noise Data [OL]. http: //spib.rice.edu /spib/data/signals/noise/, 2000.

共引文献16

同被引文献17

引证文献2

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部