期刊文献+

基于特征加权的应力影响下顽健语音识别方法 被引量:2

Recognition of Speech under G-force Based on the Weighted Feature
下载PDF
导出
摘要 通过对应力影响下语音数据的分析 ,发现不同的特征维对变异的敏感程度不同。一般低维特征对变异比较敏感 ,相应的高维特征敏感程度差些。在此基础上 ,提出一种新的基于特征加权的变异语音识别方法。该方法通过对不同维特征加不同的权值来消除变异因素对语音特征的影响 ,从而提高系统的识别性能。文中提出对线性权值用最大相对熵估计方法获得权值。对航空模拟飞行器中采集的特定话者小词表孤立词的实验 ,最大相对熵估计方法的识别率可达到 89 9% ,与多重风格训练方法相比 ,识别率提高了 13 1%。 Based on the analysis of stressful speech,an interesting fact that the different dimension of MFCC feature has different sensitivity of G force is found.Generally,the lower dimensions are more sensitive to stress,and the sensitivity of higher dimensions is less.Therefore,a new approach named weighted MFCC feature is proposed for the recognition under G force in the paper.Using the weighted feature to emphasize the influence of higher dimensions,the better performance of recognition system can be achieved.In order to obtain the weights,a new method named maximum relative entropy weights is proposed in which the initial weights are the linear weights.For a small vocabulary speaker dependent system,the recognition rates of these methods are better than that of traditional multi style training method.Among these methods,maximum relative entropy weights can reach the best performance with 89 9% recognition rate,which improves 13 1% comparing with the multi style training method.
出处 《中文信息学报》 CSCD 北大核心 2002年第1期7-12,共6页 Journal of Chinese Information Processing
基金 国家自然科学基金 (项目号 :6 0 0 85 0 0 1)
关键词 应力影响 特征加权 最大熵相对估计 顽健语音识别 Speech recognition G force Weighted feature Maximum relative entropy
  • 相关文献

参考文献6

  • 1[1]R.P. Lippmann, E. A. Martin and D. B. Paul. Multi-Style Training for Robust Isolated-Word Speech Recognition, ICASSP' 87,1987,705-708
  • 2[2]J.H.L. Hansen and B. D. Womack. Classification of Speech Under Stress using Target Driven Features.Speech Communication, 1996,20:131-150
  • 3[3]Y. Chen. Cepstral Domain Talker Stress Compensation for Robust Speech Recognition. IEEE Trans. , On Acoustics, Speech and Signal Processing, 1988,36(4): 433-439
  • 4[4]J.H.L. Hansen. Adaptive Source Generator Compensation and Enhancement for Speech Recognition in Noisy Stressful Environment. ICASSP' 93,1993,2: 95-98
  • 5[5]T. Cover,J. Thomas. Elements of Information Theroy. John Wiley&Sons, Inc. , 1991,90-95
  • 6[6]S. Furui. Cepstral analysis technique for automatic speaker verification. IEEE Trans., On Acoustics,Speech and Signal Processing, 1981,29(4) :254-272

同被引文献21

  • 1张家騄.超音段特征间的相互作用[J].声学学报,1993,18(4):263-271. 被引量:3
  • 2韩纪庆,张磊,王承发.心理紧张情况下的Robust语音识别方法[J].计算机科学,2000,27(9):44-46. 被引量:1
  • 3吕成国 张磊 韩纪庆 等.G-Stress和Lombard效应作用下的变异语音语谱图[J].高技术通讯增刊,2000,:223-226.
  • 4Kaiser J F.On a simple algorithm to calculate the ‘energy'' of a signal [A]..I CASSP''90 [C].USA:IEEE Press,1990.381-384.
  • 5潘胜昔 刘加 江金涛 等.基于多模式及集成判决的稳健电话语音识别算法研究[A].王承发张凯.第五届全国人机语音通讯学术会议论文集[C].,1998.154-159.
  • 6马永林.[D].哈尔滨:哈尔滨工业大学工学,20 01.
  • 7马永林 韩纪庆 张磊 等.应力影响下的变异语音分类[A]..863计划智能计算机主题学术会议论文集[C].,2001.374-378.
  • 8Ghazale S E, Hansen J H L. A comparative study of traditional and newly proposed feature for recognition of speech under stress. IEEE Transaction on Speech and Audio Processing, 2000; 8(4): 429-442.
  • 9Hansen J H L. Morphological constrained feature enhancement with adaptive cepstral compensation (mce-acc) for speech recognition in noise and lombard effect. IEEE Transaction on Speech and Audio Processing, 1994; 2(4):598-614.
  • 10Hansen J H L. Analysis and compensation of speech under stress and noise for environment robustness in speech recognition. Speech Communication, 1996; 20(1/2): 151-173.

引证文献2

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部