期刊文献+

基于语音信号与文本信息的双模态情感识别 被引量:8

Multimodal Emotion Recognition Based on Speech Signal and Text Information
下载PDF
导出
摘要 情感识别已成为人机交互不可或缺的部分,目前单模态情感识别具有识别率低、可靠性差的特点,故提出一种融合语音信号与文本信息的双模态情感识别方法。首先,采集特定情感状态下的语音信号及文本信息;然后提取语音相关特征参数以及文本情感关键词特征参数并对其进行优化;最后,对两个单模态识别器的输出结果进行加权融合获得识别结果。针对所提算法进行了相关实验研究,结果表明双模态情感识别技术具有更高识别精度。 Emotion recognition has become an indispensable part of human-computer interaction. This paper propsesa fusion method of speech signal and the text information in emotion recognition,because of the low recognition rate and poor reliability of single modal emotion recognition. First of all,collecting specific emotional state of the speech signal and text information;then extracting the speech feature parameters and keywords emotional char acteristic parameters of text information and optimize it; finally, recognition results are obtained by weighted fusion of the output results of two single modal identification devices. According to the results of experimaental,it showed that the dualmodal emtoion recognition technology has higher recognition accuracy.
作者 陈鹏展 张欣 徐芳萍 Chen Pengzhan Zhang Xin Xu Fangping(School of electrical and Automation Engineering, East China Jiaotong University, Nanchang 330013, China)
出处 《华东交通大学学报》 2017年第2期100-104,共5页 Journal of East China Jiaotong University
基金 国家自然科学基金资助项目(61164011) 江西省研究生创新专项资金项目(YC2015-S242) 江西省博士后科研择优资助项目(2015KY19)
关键词 语音信号 文本识别 参数优化 高斯混合模型 speech signal text recognition parameter optimization gauss mixture model
  • 相关文献

参考文献5

二级参考文献115

  • 1胡江华,柏连发,张保民.象素级多传感器图像融合技术[J].南京理工大学学报,1996,20(5):453-456. 被引量:14
  • 2李军,林宗坚.基于特征的遥感影像数据融合方法[J].中国图象图形学报(A辑),1997,2(2):103-107. 被引量:51
  • 3JOHN R J.陈晓玲(译).遥感数字影像处理导论[M].北京:机械工业出版社,2007.378-384.
  • 4Zeng Z,Pantic M,Roisman G I,et al.A survey of affect recognition methods:audio,visual and spontaneous expressions[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2009,31(1):39-58.
  • 5Hoch S,Althoff F,McGlaun A,et al.Bimodal fusion of emotional data in an automotive environment[C]//Proceedings of the 2005 IEEE International Conference on Acoustics,Speech,and Signal Processing.Philadelphia,Pennsylvania,USA,2005:1085-1088.
  • 6Busso C,Deng Z,Yildirim S,et al.Analysis of emotion recognition using facial expressions,speech and multimodal information[C]//Proceedings of the Sixth International Conference on Multimodal Interfaces.Pennsylvania,USA,2004:205-211.
  • 7Wagner J,Kim J,Andre E.From physiological signals to emotions:implementing and comparing selected methods for feature extraction and classification[C]//Proceedings of the 2005 IEEE International Conference on Multimedia & Expo.Amsterdam,the Netherlands,2005:940-943.
  • 8Khiet T.How does real affect affect affect recognition in speech?[D].Enschede,the Netherlands:Center for Telematics and Information Technology of University of Twente,2009.
  • 9Tato R,Santos R,Kompe R,et al.Emotion space improves emotion recognition[C]//Proceedings of the 2002 International Conference on Speech and Language Processing.Denver,Colorado,USA,2002:2029-2032.
  • 10Schuller B,Rigoll G,Lang M.Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture[C]//Proceedings of the 2004 IEEE International Conference on Acoustics,Speech,and Signal Processing.Montreal,Canada,2004:577-580.

共引文献206

同被引文献42

引证文献8

二级引证文献26

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部