期刊文献+

动态时间规正与差别子空间相结合的变异语音识别方法 被引量:2

Stressful speech recognition method based on difference subspace integrated with dynamic time warping
下载PDF
导出
摘要 分析了由于说话人受到重力加速度变化而产生的变异语音(应力影响下的变异语音)的特点,把变异语音分成主体部分和变异部分两方面进行研究,提出一种动态时间规正与差别子空间相结合的孤立词语音识别方法。该方法构造差别子空间去除变异部分的影响,利用语音的主体部分进行识别,采用动态时间规正技术对语音特征矢量进行长度对齐,并提出了相应的判别标准。实验结果表明,本方法对应力影响下的变异语音具有良好识别效果,对15个词的小词表,系统平均识别率达到98.3%,与正常语音的识别率基本相当。克服了话者在应力影响下由于心理紧张和生理情况的变化,语音发生严重变异,导致常规语音识别系统性能急剧下降的缺点。 Speech under G-Force was analyzed and considered as principal part and stressful part to research, which produced when speaker was under different acceleration of gravity. An isolated word recognition approach was proposed which integrated difference subspace means with dynamic time warping technique. The method recognized speech under G-Force by constructing a difference subspace to remove the stressful part. Dynamic time warping technique was adopted to make all feature vectors of one word in the training set have equal length, and a corresponding decision criterion was suggested. The experiments showed that for a small vocabulary including 15 words, the method obtained the average recognition rate of 98.3% , which almost equal to the rate in normal environment. The performance of general recognition system was degraded violently for the stressful speech, since G-Force had a direct physical impact on human speech production in addition to the influence on psychology. The method overcame the shortcoming perfectly, not only worked well in normal conditions but also had good performance for speech under G-Force.
出处 《声学学报》 EI CSCD 北大核心 2005年第3期229-234,共6页 Acta Acustica
基金 国家自然科学基金资助项目(60085001)
关键词 变异语音识别 重力加速度 动态时间规正 平均识别率 Feature extraction Learning algorithms Word processing
  • 相关文献

参考文献7

二级参考文献36

  • 1吕成国 张磊 等.G-Stress和Lombard效应作用下的变异语音谱图[J].高技术通讯增刊,2000,:223-226.
  • 2[1]R.P.Lippmann, E.A.Martin, D.B.Paul. Multi-Style Train-ing for Robust Isolated-Word Speech Recognition,ICASSP87, 1987:705-708
  • 3[2]J.H.L.Hansen, M.A.Clements. Stress Compensation and Noise Reduction Algorithms for Robust Speech Recogni-tion, ICASSP'89, 1989:266-269
  • 4[3]Y. Chen. Cepstral Domain Talker Stress Compensation for Robust Speech Recognition, IEEE Transactions On Acoustics, Speech and Signal Processing, 1988, 36(4):433-439
  • 5[5]H.M. Teager, S.M.Teager. Evident for Nonlinear Produc-tion Mechanisms in the Vocal Tract. Study Inst.Onspeech Production and Speech Modeling 1990.
  • 6[6]Firas Jabloun, A.Enis Cetin. The Teager Energy Based Feature Parameters For Robust Speech Recognition In Car noise. ICASSP'99,1999:273-276
  • 7[8]J.F.Kaiser. On a Simple Algorithm to Calculate the Energy of a Signal. ICASSP0, 1990:381-384
  • 8[10]S.Bou-Ghazale and J.H.L Hansen.. Speech Feature Mode-ling for Robust Stressed Speech Recognition.ICSLP'98, 1998:918-921
  • 9Lin Q,1990年
  • 10张家--,Proc ICPhS,1987年,3卷,390页

共引文献31

同被引文献28

  • 1刘海滨,吴镇扬,赵力,曾毓敏.噪声环境下基于最大后验非线性变换的隐马尔可夫模型自适应算法[J].声学学报,2004,29(5):467-471. 被引量:4
  • 2赵蕤,王作英.语音识别中信道和噪音的联合补偿[J].声学学报,2006,31(5):466-470. 被引量:11
  • 3张家禄 齐士钤 宋美珍 等.汉语声调在言语可懂度中的重要作用.声学学报,1981,7:237-237.
  • 4王成友,汤叔祺,梁甸农,陈辉煌,唐朝京.语音识别中多种特征信息综合利用的方法[J].声学学报,1997,22(2):111-115. 被引量:6
  • 5Jonathan Foote, An overview of audio information retrieval. Multimedia Systems, 1999; 7(1): 2-11.
  • 6Smoliar S W, Baker J D, Nakayama T, Wilcox L. Multimedia search: An authoring perspective. In: Proceedings of the First International Workshop on Image Databasesand Multimedia Search, 1996; 1:1-8.
  • 7John Makhoul, Francis Kubala et al. Speech and Language Technologies for Audio Indexing and Retrieval. Proceedings of the IEEE, 2000; 88(8): 1338-1353.
  • 8Smith G, Murase H, Kashino K. Quick audio retrieval using active search. In: Proc. Int. Conf. Acoustics, Speech,Signal Processing, 1998; 6:3777-3780.
  • 9Kunio Kashino, Takayuki Kurozumi, Hiroshi Murase. Feature fluctuation absorption for a quick audio retrieval from long recordings. In: Proc. Int. Conf. Pattern Recognition,2000; 3:3102-3106.
  • 10Johnson S E, Woodland P C. A Method for direct audio search with applications to indexing and retrieval. In:Proc, of International Conference on Acoustics, Speech and Signal Processing (ICASSP '2000), 2000; 3:1427-1430.

引证文献2

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部