期刊文献+

基于样本熵与MFCC融合的语音情感识别 被引量:7

Speech Emotion Recognition Based on Fusion of Sample Entropy and MFCC
下载PDF
导出
摘要 提出一种基于样本熵与Mel频率倒谱系数(MFCC)融合的语音情感识别方法。利用支持向量机分别对样本熵统计量与MFCC进行处理,计算其属于高兴、生气、厌烦和恐惧4种情感的概率,采用加法规则和乘法规则对情感概率进行融合,得到识别结果。仿真实验结果表明,该方法的识别率较高。 This paper proposes a method of speech emotion recognition based on fusion of sample entropy and Mel-frequency Cepstral Coefficients(MFCC).Sample entropy statistic and MFCC are modeled with Support Vector Machine(SVM) respectively to obtain the probabilities of happy,angry,bored and afraid.The sum and product rules are used to fuse the probabilities to gain the final decision.Simulation results demonstrate that the recognition rate obtained with the proposed method is high.
出处 《计算机工程》 CAS CSCD 2012年第7期142-144,共3页 Computer Engineering
基金 国家自然科学基金资助项目(61075008)
关键词 语音情感识别 样本熵 MEL频率倒谱系数 支持向量机 speech emotion recognition sample entropy Mel-frequency Cepstral Coefficients(MFCC) Support Vector Machine(SVM)
  • 相关文献

参考文献7

二级参考文献25

  • 1蒋丹宁,蔡莲红.基于语音声学特征的情感信息识别[J].清华大学学报(自然科学版),2006,46(1):86-89. 被引量:37
  • 2王治平,赵力,邹采荣.基于基音参数规整及统计分布模型距离的语音情感识别[J].声学学报,2006,31(1):28-34. 被引量:26
  • 3林琳,王树勋,郭纲.短语音说话人识别新方法的研究[J].系统仿真学报,2007,19(10):2272-2275. 被引量:10
  • 4Huang N E, Shen Zheng, Steven R L. The Empirical Mode Decomposition and the Hilbert Spectrum for Nonlinear and Non-stationary Time Series Analysis[EB/OL]. (1998-03-08). http:/www.citeulike.org/user/elferdo/article/2681232.
  • 5Delechelle E, Lemoine J, Oumar N. Empirical Mode Decomposition: An Analytical Approach for Sifting Process[J]. IEEE Signal Processing Letters, 2005, 12(11): 764-767.
  • 6Young K H, et al. Pitch Detection with Average Magnitude Difference Function Using Adaptive Threshold Algorithm for Estimating Shimmer and Jitter. In: Proc of the 20th IEEE International Annual Conference on Engineering in Medicine and Biology Society. Hong Kong, China, 1998, Ⅵ:3162-3164.
  • 7Wang Y R, Wong I J, Tsao T C. A Statistical Pitch Detection Algorithm. In.. Proc of the IEEE International Conference on Acoustics, Speech, and Signal Processing. Orlando, USA,2002,Ⅰ:13--17.
  • 8Hung W W, Wang H C. On the Use of Weighted Filter Bank Analysis for the Derivation of Robust MFCCs. IEEE Signal Processing Letters, 2001, 8(3):70--73.
  • 9Molau S, Pitz M, Schluter R, Ney H. Computing Mel-Frequency Cepstral Coefficients on the Power Spectrum. In: Proc of the IEEE International Conference on Acoustics, Speech, and Signal Processing. Salt Lake City, USA, 2001, Ⅰ: 73--76.
  • 10Reynolds D A, Rose R C. Robust Text-Independent Speaker Ⅰ-dentification Using Gaussian Mixture Speaker Models. IEEE Trans on Speech and Audio Processing, 1995, 3(1): 72--83.

共引文献38

同被引文献54

  • 1柳长源,孙雨涵,李文强,兰朝凤.基于MAResnet的脑电情感识别研究[J].仪器仪表学报,2020,41(7):235-242. 被引量:5
  • 2张会云.语音情感识别研究综述[J].信息通信,2019,0(11):58-60. 被引量:2
  • 3董志峰,汪增福.基于动态MFCC的说话人识别算法[J].模式识别与人工智能,2005,18(5):596-601. 被引量:7
  • 4白冬梅,邱天爽,鲍海平.基于经验模式分解与样本熵的癫痫预测方法[J].中国生物医学工程学报,2006,25(5):527-531. 被引量:12
  • 5雷雄国,曾以成,李凌.基于近似熵的语音端点检测[J].声学技术,2007,26(1):121-125. 被引量:11
  • 6RICHMAN J S, MOORMAN J R. Physiological time - series analysis using approximate entropy and sample entropy [ J ]. American Journal of Physiology Heart and Circulatory Physi- ology ,2000,278 (6) :2039 -2049.
  • 7Wolpaw J R,Birbaumer N,McFarland D J.Brain-computer interface for communication and control[J].Clinical Neuro-physiology,2002,113(6):767-791.
  • 8Pincus S M.Approximate entropy as a measure of system complexity[J].Proceeding of the National Academy Sciences,1991,88(6):2297-2301.
  • 9Richman J S,Moomaan J R.Physiologica time series analysis using approximate entropy and sample entropy[J].Am J Physio:Heart Circ Physio,2000,278(61):2039-2049.
  • 10Feng Y H, Schlindwein F S. Normalized wavelet packets quantifiers for condition monitoring[J]. Mechanical Systems and Signal Processing,2009,23 ( 3 ) : 712 - 723.

引证文献7

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部