期刊文献+

基于语音学知识的鲁棒性两级语音起点检测方法 被引量:3

Robust Two-stage Starting-point Detection Method Based On Phonetic Knowledge
下载PDF
导出
摘要 语音识别系统的实用化,需要对噪声有很强的鲁棒性,而噪声环境下的端点检测对整个识别系统性能起着关键的作用。提出一种基于语音学知识的两级起点检测方法,其中第一级选取短时能零比和短时谱幅作为初检特征,并采取自适应门限,第二级根据语音起点能量变化和语音性持续时间进行起点的确定。实验结果表明该方法在常见噪声环境下鲁棒性较好,且适于实时应用。 While speech recognition system is put into use,it must be robust to noise.The endpoint detection in noisy background plays an important role in the whole recognition system.A kind of method is presented in this paper which is based on phonetic knowledge and includes two-stage starting-point detection.Short-time EZQ(Energy Zero Quotient)and short-time spectra amplitude are adopted as initiative detection features in the first stage;the accurate starting-point is determined according to the variation of endpoint phonetic energy and the time lasting of the phonetic features in the second stage.The experiments show that its better robustness can be obtained in common noisy environments and its real-time application.
作者 于迎霞
出处 《电声技术》 北大核心 2004年第5期51-54,共4页 Audio Engineering
关键词 语音识别 端点检测 起点检测 短时能零比 短时谱幅 speech recognition phonetic endpoint detection starting-point detection short-time EZQ short-time spectra amplitude
  • 相关文献

参考文献5

  • 1田野,王作英,陆大.基于子带能量线性映射的噪声中端点检测算法[J].清华大学学报(自然科学版),2002,42(7):953-956. 被引量:17
  • 2王仁华 陈永彬.语言信号处理[M].北京:中国科学技术出版社,1990..
  • 3高慧,周笃强,黄端生.噪声对说话人语音的影响[J].航天医学与医学工程,1999,12(1):72-75. 被引量:9
  • 4贾川 张健.噪声环境下的端点检测算法研究[A]..第六届全国人机语音通讯学术会议论文集[C].,2001..
  • 5.[EB/OL].NOISEX-92噪声库下载URL1:http://spib.ece.rice.edu/ spib/data/signals/noise, URL2: http ://spib. rice.edu/spib/ select noise.html.,.

二级参考文献8

  • 1[1]Junqua J C, Mak B, Reaves B. A Robust Algorithm for Word Boundary Detection in the Presence of Noise [J]. IEEE Transactions on Speech and Audio Processi ng, 1994, 2(3): 406412.
  • 2[2]Lamel, Rabiner L, Rosenberg A, et al. An Improved Endpoint Detector for Isol ated Word Recognition [J]. IEEE Transactions on Acoustic, Speech and Signal Processing, 1981, 29(8): 777785.
  • 3[3]Deller J R, Proakis J G, Hansen J H L, Discrete-Time Processing of Speech Si gnals [M]. New York: Macmillan, 1993.
  • 4[4]Hamada M, Takizawa Y, Norimatsu T. A Noise Robust Speech Recognition [A]. Hiro ya F. 19 90 International Conference on Speech Language Processing [C]. Kobe: Science U niversity of Japan, 1990, 893896.
  • 5[5]Wu GinDer, Lin ChinTeng, Word Boundary Detection with Mel-Scale Frequency Ba nk in Noisy Environment [J]. IEEE Transactions on Speech and Audio Processin g, 2000, 8(5): 541554.
  • 6[6]Fukunaga K. Introduction to Statistical Pattern Recognition [M]. Boston: Aca demic Press, 1990.
  • 7[7]Rabiner L, Juang B H, Fundamentals of Speech Recognition [M]. Englewood Clif fs: PTR Prentice Hall, 1993.
  • 8[8]The Signal Processing Information Base Noise Data [OL]. http: //spib.rice.edu /spib/data/signals/noise/, 2000.

共引文献26

同被引文献23

引证文献3

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部