
基于Elman神经网络的语音情感识别应用研究 被引量:4

Application research of speech emotion recognition based on Elman neural network
摘要 针对语音情感的动态特性,利用动态递归Elman神经网络实现语音情感识别系统。通过连接记忆上时刻状态与当前网络一并输入,实现Elman网络模型的状态反馈。基于此设计了语音情感识别系统,该系统能在后台修改网络类型,并实现单语句与批量语句识别模式。针对系统进行语音情感识别实验表明,基于Elman神经网络的语音情感识别在同等参数模型设置前提下优于BP神经网络识别效果,且BP神经网络参数设置较Elman网络敏感。 This paper utilized the Elman dynamic recurrent neutral network to realize speech emotion recognition system for dynamic characteristics of speech.Meanwhile,it realized state feedback of Elman neutral network through input both of connection memory from last state and current state together.Finally,it designed speech emotion recognition system which could not only modify network structure types in backstage,but also practiced two mode of signal speech recognition and batch proces-sing recognition.Based on this platform,the experiments show that the recognition effect of Elman is superior to BP network un-der the same model parameters.Furthermore,parameters setting of BP are more sensitive than Elman network.
出处 《计算机应用研究》 CSCD 北大核心 2012年第5期1809-1814,共6页 Application Research of Computers
基金 中央高校基本科研业务费专项资金资助项目(2012QNZT060) 第49批中国博士后科学基金面上资助项目(20110491272) 中南大学博士后基金资助项目(2010-2012) 湖南省教育厅青年基金项目(11B070)
关键词 语音情感识别 ELMAN网络 BP网络 MFCC speech emotion recognition Elman network back propagation network mel frequency cepstrum coefficient
  • 相关文献


  • 1韩文静,李海峰.基于韵律语段的语音情感识别方法研究[J].清华大学学报(自然科学版),2009(S1):1363-1368. 被引量:8
  • 2AYADI M E,KAMEL M S,KARRAY F.Survey on speech emotionrecognition:features,classification schemes and databases[J].Pat-tern Recognition,2011,44(3):572-587.
  • 3YANG Bin,LUGGER M.Emotion recognition from speech signalsusing new harmony features[J].Signal Processing,2010,90(5):1415-1423.
  • 4HOZJAN V,KACIC Z.Context-independent multilingual emotion re-cognition from speech signals[J].International Journal of SpeechTechnology,2003,6(3):311-320.
  • 5RONG Jia,LI Gang,CHEN Y P P.Acoustic feature selection for auto-matic emotion recognition from speech[J].Information Processing&Management,2009,45(3):315-328.
  • 6WEI Xiao-peng,ZHAO La-sheng,ZHANG Qiang.Speech emotionrecognition using PCA-based spectral features and HMM[J].Journalof Information and Computational Science,2009,6(2):741-747.
  • 7MPORAS I,GANCHEV T,FAKOTAKIS N.Phonetic segmentation ofemotional speech with HMM-based methods[J].International Jour-nal of Pattern Recognition and Artificial Intelligence,2010,24(7):1159-1179.
  • 8YUSUKE I,TAKASHI N,MAKOTO T,et al.A rapid model adaptationtechnique for emotional speech recognition with style estimation basedon multiple-regression HMM[J].IEICE Trans on Information andSystems,2010,E93.D(1):107-115.
  • 9MATROUF D,VERDET F,ROUVIER M.Modeling nuisance variabi-lities with factor analysis for GMM-based audio pattern classification[J].Computer Speech and Language,2011,25(3):481-498.
  • 10HUANG K C,KUO Y H.A novel objective function to optimize neuralnetworks for emotion recognition[C]//Proc of the 2nd World Con-gress on Nature and Biologically Inspired Computing.2010:413-417.


  • 1韩文静,李海峰,韩纪庆.基于长短时特征融合的语音情感识别方法[J].清华大学学报(自然科学版),2008,48(S1):708-714. 被引量:20
  • 2王治平,赵力,邹采荣.基于基音参数规整及统计分布模型距离的语音情感识别[J].声学学报,2006,31(1):28-34. 被引量:26
  • 3Schuller B,Reiter S,Muller R,et al.Speaker independentspeech emotion recognition by ensemble classification. Proc of ICME . 2005
  • 4Slaney M,McRoberts G.A recognition system for affectivevocalization. Speech Communication . 2003
  • 5Shami M T,Kamel M S.Segment-based approach to therecognition of emotions in speech. Proc of ICME . 2005
  • 6Tao Jianhua,Kang Y G.Features importance analysis foremotional speech classification. Lecture Notes inComputer Science . 2005
  • 7Schuller B,Rigoll G.Timing levels in segment-based speechemotion recognition. Proc of ICSLP . 2006
  • 8Pantic M,Rothkrantz L J M.Toward an Affect-Sensitive Multimodal Human-Computer Interaction. Proceedings of Tricomm . 2003
  • 9Lee C M,Narayanan S S.Toward Detecting Emotions in Spoken Dialogs. IEEE Transactions on Speech and Audio Processing . 2005
  • 10Schuller B,Rigoll G,Lang M.Hidden Markov model-based speech emotion recognition. Proceedings of the2003IEEE International Conference on Acoustics,Speech,&Signal Processing . 2003












使用帮助 返回顶部