期刊文献+

基于子带能量特征的最优化语音端点检测算法研究 被引量:22

Optimization of speech endpoint detection base on sub-band energy feature
原文传递
导出
摘要 为了提高噪声环境下语音端点检测的鲁棒性,提出了一种结合多子带能量特征和最优化边缘检测判决准则的算法。该算法的突出优点在于:在不同信噪比情况下,其端点检测滤波器的输出基本不变,从而避免了门限调整所带来的困难。实验结果表明,这种算法在多种噪声环境下都能够达到较好的语音检出效果。这种算法克服了传统语音端点检测以短时能量、基频、过零率等作为检测特征时,需要动态调整门限且在低信噪比情况下鲁棒性较差的缺点。 In order to detect more robustly and precisely the endpoints of speech under noisy environments, an algorithm was proposed in this paper by combining the multiple sub-bands energy as feature and optimal edge detection as decision criteria. The algorithm highlights itself by exemption of the adjustment of the decision threshold due to the stable output of the filters under the environments with different signal-to-noise ratio. Experiments showed that, while easy to tune the parameters, the algorithm can work more robustly under various noisy environments. Thus overtaking the traditional short-time energy, zero crossing rate and pitch based methods.
作者 陈振标 徐波
出处 《声学学报》 EI CSCD 北大核心 2005年第2期171-176,共6页 Acta Acustica
  • 相关文献

参考文献12

  • 1田野,王作英,陆大.基于子带能量线性映射的噪声中端点检测算法[J].清华大学学报(自然科学版),2002,42(7):953-956. 被引量:17
  • 2胡光锐,韦晓东.基于倒谱特征的带噪语音端点检测[J].电子学报,2000,28(10):95-97. 被引量:70
  • 3果永振,何遵文.一种多特征语音端点检测算法及实现[J].通信技术,2003,36(1):8-10. 被引量:8
  • 4高升,徐波,黄泰翼.基于决策树的汉语三音子模型[J].声学学报,2000,25(6):504-509. 被引量:20
  • 5Wu G D, Lin C T. Word boundary detection with mel-scale frequency bank in noisy environment. IEEE Transactions on Speech and Audio Processing, 2000; 8(5): 541-554.
  • 6Ramalingam Hariharan et al. Robust end of utterance detection for real-time speech recognition applications. In Proc. ICASSP'2001.
  • 7CHEN Shaoyan et al. A robust method based on likelihood estimation for speech signal detection. International Symposium on Chinese Spoken Language Processing, 2000.
  • 8HUANG Liangsheng et al. A novel approach to robust speech endpoint detection in car environments. International Conference on Acoustics Speech and Signal Processing, 2000.
  • 9Johan de Veth e~ al. Comparison of channel normalization techniques for automatic speech recognition over the phone. Proceedings of the Fourth International Conference on Spoken Language Processing (ICSLP96), 1996; 4:2332-2335.
  • 10Li Qi et al. A Robust real-time endpoint detector with energy normalization for ASR in adverse environments. In Proc. ICASSP'2001, Salt Lake City, 2001.

二级参考文献15

  • 1林焘 王理嘉.语音学教程[M].北京:北京大学出版社,..
  • 2徐波 张亮 等.基于决策树方法的语境有关HMM建模.第八届全国声学学术会议[M].,1998.421-424.
  • 3[1]Junqua J C, Mak B, Reaves B. A Robust Algorithm for Word Boundary Detection in the Presence of Noise [J]. IEEE Transactions on Speech and Audio Processi ng, 1994, 2(3): 406412.
  • 4[2]Lamel, Rabiner L, Rosenberg A, et al. An Improved Endpoint Detector for Isol ated Word Recognition [J]. IEEE Transactions on Acoustic, Speech and Signal Processing, 1981, 29(8): 777785.
  • 5[3]Deller J R, Proakis J G, Hansen J H L, Discrete-Time Processing of Speech Si gnals [M]. New York: Macmillan, 1993.
  • 6[4]Hamada M, Takizawa Y, Norimatsu T. A Noise Robust Speech Recognition [A]. Hiro ya F. 19 90 International Conference on Speech Language Processing [C]. Kobe: Science U niversity of Japan, 1990, 893896.
  • 7[5]Wu GinDer, Lin ChinTeng, Word Boundary Detection with Mel-Scale Frequency Ba nk in Noisy Environment [J]. IEEE Transactions on Speech and Audio Processin g, 2000, 8(5): 541554.
  • 8[6]Fukunaga K. Introduction to Statistical Pattern Recognition [M]. Boston: Aca demic Press, 1990.
  • 9[7]Rabiner L, Juang B H, Fundamentals of Speech Recognition [M]. Englewood Clif fs: PTR Prentice Hall, 1993.
  • 10[8]The Signal Processing Information Base Noise Data [OL]. http: //spib.rice.edu /spib/data/signals/noise/, 2000.

共引文献106

同被引文献164

引证文献22

二级引证文献94

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部