基于EEG和面部视频的多模态连续情感识别

Multimodal continuous emotion recognition based on EEG and facial video

下载PDF

导出

摘要针对脑电(Electroencephalogram, EEG)通道间和时间上情绪强度的改变很难被捕捉,以及不同被试的面部特征情绪上的相似性难以挖掘的问题,文章提出了一种基于EEG和面部视频的多模态连续情感识别模型.采用基于时空注意力机制(Spatial-Temporal Attention)的卷积和双向长短期记忆神经网络的组合模型(STA-CNNBiLSTM)对EEG中提取的功率谱密度(Power Spectral Density, PSD)特征进行深层特征学习与情感分类;采用引入自注意力机制的预训练卷积神经网络(SA-CNN)对人脸面部几何特征进行学习与情感分类.采用决策级融合算法,对两个模态的分类结果进行迭代学习与融合,得到最终多模态情感分类结果.在公开数据集MAHNOB-HCI进行了大量对比验证实验,在FER2013数据集的面部几何特征上对SA-CNN模型进行了预训练.在独立被试的实验中,所提模型在效价维度二分类的平均准确率为75.50%,在唤醒维度二分类的平均准确率为79.00%,均优于单模态上的最高平均准确率.和目前流行的模型LSSVM、SE-CNN和AM-LSTM相比较,所提模型的分类效果更优,验证了所提时空注意力机制能够捕捉更多的EEG时空特征,自注意力机制能够关注到不同被试面部特征的相似性,进而提高了多模态情感识别的性能. To solve the difficulty in capturing the changes of emotional intensity between EEG channels and time points,and the difficulty in capturing the emotional similarity of facial features of different subjects,this paper proposes a multimodal continuous emotion recognition model based on EEG and facial videos.A combined model of convolution and bidirectional short-term memory neural network based on spatiotemporal attention(STA-CNNBiLSTM)is used to learn and classify deep emotion related dynamics from the power spectral density(PSD)features of EEG.A pre-training convolutional neural network with self-attention(SA-CNN)is used to learn and classify the deep facial geometric features.We use the decision level fusion algorithm to fuse the above two classification results of EEG and facial geometric features modes to obtain the final results.A large number of comparative verification experiments were carried out on the public MAHNOB-HCI dataset,and the SA-CNN model was pre-trained on the facial geometric features of the FER2013 data set.In the subject-independent experiment,the average binary classification accuracy of the proposed model achieved 75.00%in valence and 79.00%in arousal,both of which are better than that in each single mode.Compared with the current popular models SE-CNN and AM-LSTM,the classification performance of the proposed model is also better,which proved that the proposed spatiotemporal attention mechanism can capture more spatiotemporal EEG features,and the self-attention mechanism can focus on the similarity of facial features of different subjects,thus improving the performance of multimodal emotion recognition.

作者雪雯陈景霞胡凯蕾刘洋 XUE Wen;CHEN Jing-xia;HU Kai-lei;LIU Yang(School of Electronic Information and Artificial Intelligence,Shaanxi University of Science&Technology,Xi′an 710021,China)

机构地区陕西科技大学电子信息与人工智能学院

出处《陕西科技大学学报》北大核心 2024年第1期169-176,共8页 Journal of Shaanxi University of Science & Technology

基金国家自然科学基金项目(61806118) 陕西科技大学博士科研启动基金项目(2020BJ-30)。

关键词 EEG 多模态情感识别卷积双向长短期记忆组合模型时空注意力机制自注意力机制 EEG multimodal emotion recognition CNN-BiLSTM spatiotemporal attention mechanism self-attention mechanism

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献1

1陈景霞,胡修文,唐喆喆,刘洋,胡凯蕾.基于卷积联合适应网络的脑电信号情感识别[J].数据采集与处理,2022,37(4):814-824. 被引量：2

二级参考文献6

1张家瑞,王刚.基于脑电信号的情感识别研究[J].计算机应用研究,2019,36(11):3306-3309. 被引量：10
2权学良,曾志刚,蒋建华,张亚倩,吕宝粮,伍冬睿.基于生理信号的情感计算研究综述[J].自动化学报,2021,47(8):1769-1784. 被引量：31
3高凌飞,王海龙,王海涛,刘强,张鲁洋,王怀斌.基于轻量级卷积神经网络的人证比对[J].南京航空航天大学学报,2021,53(5):751-758. 被引量：3
4陈景霞,郝为,张鹏伟,闵重丹,李玥辰.基于混合神经网络的脑电时空特征情感分类[J].软件学报,2021,32(12):3869-3883. 被引量：8
5王揄辰,杨士俊.再生核希尔伯特空间连续线性泛函的范数及其应用[J].高校应用数学学报（A辑）,2022,37(1):123-126. 被引量：2
6林佳伟,王士同.用于迁移学习的多尺度领域对抗网络[J].数据采集与处理,2022,37(3):555-565. 被引量：1

共引文献1

1陈晨,任南.基于SAE和GNDO-SVM的脑电信号情绪识别[J].计算机系统应用,2023,32(10):284-292.

1罗元华.高中美术教学中融入情感教育的策略研究[J].中文科技期刊数据库（引文版）教育科学,2023(7):0162-0165.
2陈亲青.新课标背景下小学美术教育中真实性学习与情感教育的融合——以“我们爱劳动”课为例[J].动画大王,2023(2):0057-0059.
3邹伟光.新高考下高中历史有效教学的策略[J].中文科技期刊数据库（文摘版）教育,2023(8):0150-0153.
4王百皓,祝玉华,李智慧.基于Transformer的粮食安全信息融合算法研究[J].中国粮油学报,2023,38(9):182-189.
5问:在别人面前哭泣就是“不坚强”的表现吗?[J].第二课堂（初中版）,2023(12):44-45.
6陈宗楠,金家瑞,潘家辉.基于Swin Transformer的四维脑电情绪识别[J].计算机技术与发展,2023,33(12):178-184.
7Xiaorui Zhang,Qijian Xie,Wei Sun,Yongjun Ren,Mithun Mukherjee.Dense Spatial-Temporal Graph Convolutional Network Based on Lightweight OpenPose for Detecting Falls[J].Computers, Materials & Continua,2023,77(10):47-61.
8任彬,ZHOU Qinyu,LI Qibing,LUO Wenfa.Research on cognitive load evaluation with subjective method in manual assembly[J].High Technology Letters,2023,29(4):434-444.
9唐少月,王曦舟,李亮,王建峰.老年人动态情绪面孔识别中的积极效应[J].成都医学院学报,2023,18(6):769-773.
10赵蒙天,宋丹,张钰恒,魏文新,王晓丽.产前应用地塞米松对晚期早产儿振幅整合脑电图的影响[J].实用药物与临床,2023,26(12):1094-1097.

陕西科技大学学报

2024年第1期

浏览历史

内容加载中请稍等...

基于EEG和面部视频的多模态连续情感识别

参考文献1

二级参考文献6

共引文献1

相关作者

相关机构

相关主题

浏览历史