期刊文献+

语音识别特征提取中对特征方法的对比

Comparison of feature methods in feature extraction for speech recognition
下载PDF
导出
摘要 人工智能概念的提出,让语音识别迎来了新的生机。随着相关知识与技能的飞速发展,神经网络带动了语音识别领域相关知识的革新。文章使用语音识别中常见的LPCC特征、MFCC特征和PLP特征对同一段语音进行特征提取,通过特征图像化可以直观展示其特征的优劣势。其中,LPCC特征对频谱包络变化较为敏感;MFCC特征具有较好语音信号的短时频谱,对信号的语音干扰和音量变化等抗干扰能力较好,但高频细节不够清晰;PLP特征具有较好的鲁棒性,对信号的语音干扰和音量变化等有很好的抗干扰能力,且对高频部分的细节信息表示更为准确。 The introduction of the concept of artificial intelligence has ushered in new vitality for speech recognition.With the rapid development of related knowledge and skills,neural networks have driven the innovation of relevant knowledge in the field of speech recognition.This article uses common LPCC features,MFCC features,and PLP features in speech recognition to extract features from the same segment of speech.Through feature visualization,the advantages and disadvantages of these features can be visually displayed.Among them,LPCC features are more sensitive to changes in spectral envelope.MFCC features have a good short-term spectrum of speech signals,and have good anti-interference ability against speech interference and volume changes,but high-frequency details are not clear enough.PLP features have good robustness and have good anti-interference ability against speech interference and volume changes in signals,and are more accurate in representing detailed information in high-frequency parts.
作者 郭明琦 GUO Mingqi(Yellow River Conservancy Technical Institute,Kaifeng,Henan 475000,China)
出处 《计算机应用文摘》 2024年第2期96-99,共4页 Chinese Journal of Computer Application
关键词 语音识别 特征提取 LPCC MFCC PLP speech recognition feature extraction LPCC MFCC PLP
  • 相关文献

参考文献7

二级参考文献112

  • 1林坤辉,息晓静,周昌乐.基于HMM与神经网络的声学模型研究[J].厦门大学学报(自然科学版),2006,45(1):44-46. 被引量:13
  • 2陈方,高升.语音识别技术及发展[J].电信科学,1996,12(10):54-57. 被引量:26
  • 3Sambur M R. Selection of Acoustic Features for Speaker Identification [C]. IEEE Trans On ASSP, 1975: 176-182.
  • 4Rabineer L R, Juang B H. Fundamentals of Speech Processing and Recognition[M]. Prentice-HalL 1993.
  • 5Junqua J C, Wakital H, Hermansky H. Evaluation and Optimization of perceptualyy-based ASR front-end[j]. IEEE Tran. ASSP-1, 1993, (3):39-48.
  • 6Gersho A, Gray R M. Vector Quantization and Signal compression[M].Boston:Kluwer Academic Pubfishers, 1992.
  • 7易克初,田斌,付强.语音信号处理.北京:国防工业出版社,2003:160-197
  • 8Lippmann R, Singer E. Hybrid Neural HMM Approaches Wordspotting ICASSP
  • 9van Bezooijen R,Otto SA,Heenan TA. Recognition of vocal expressions of emotion:A three-nation study to identify universal characteristics[J].{H}JOURNAL OF CROSS-CULTURAL PSYCHOLOGY,1983,(04):387-406.
  • 10Tolkmitt FJ,Scherer KR. Effect of experimentally induced stress on vocal parameters[J].Journal of Experimental Psychology Human Perception Performance,1986,(03):302-313.

共引文献427

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部