期刊文献+

融合动态梯度和多视图协同注意力的情感分析

Sentiment Analysis Combining Dynamic Gradient and Multi-view Co-attention
下载PDF
导出
摘要 针对多模态情感分析中模态间优化不平衡和多模态特征融合不充分的问题,提出一种融合动态梯度机制和多视图协同注意力机制的多模态情感分析模型(DG-MCM),能够有效挖掘单模态特征并充分融合多模态信息。首先,模型使用预训练模型BERT和堆叠式长短期记忆神经网络(SLSTM)学习文本、音频和视频的特征,并提出动态梯度机制,通过监测各模态对学习目标的贡献差异和学习速度辅助各模态的特征学习。其次,将得到的不同模态的特征使用多视图协同注意力机制进行特征融合,通过将每两个模态投影到多个空间执行交互获得更加充分的融合特征。最后,拼接融合特征和单模态特征进行情感预测。在CMU-MOSI和CMUMOSEI数据集的实验结果表明,该模型能够充分学习单模态和不同模态之间的信息,有效提升多模态情感分析的准确率。 Aiming at the problems of unbalanced inter-modal optimization and inadequate fusion of multimodal features in multimodal sentiment analysis,a multimodal sentiment analysis model combining dynamic gradient mechanism and multi-view co-attention mechanism(DG-MCM)is proposed,which can effectively mine single-modal representation and fully integrate multimodal information.Firstly,the model uses pre-trained model BERT(bidirectional encoder representation from transformers)and stacked long short-term memory(SLSTM)to learn the features of text,audio and video,and proposes a dynamic gradient mechanism.By monitoring the contribution difference and learning speed of each mode,the feature learning of each mode is assisted.Secondly,the features of different modes obtained are fused using the multi-view co-attention mechanism.By projecting every two modes into multiple spaces for interaction,more adequate fusion features are obtained.Finally,fusion features and single-modal features are spliced together for sentiment prediction.Experimental results on CMU-MOSI and CMU-MOSEI datasets show that this model can fully learn information between single mode and different modes,and effectively improve the accuracy of multimodal sentiment analysis.
作者 王香 毛力 陈祺东 孙俊 WANG Xiang;MAO Li;CHEN Qidong;SUN Jun(School of Artificial Intelligence and Computer Science,Jiangnan University,Wuxi,Jiangsu 214122,China;College of Internet of Things Engineering,Wuxi University,Wuxi,Jiangsu 210044,China)
出处 《计算机科学与探索》 CSCD 北大核心 2024年第5期1328-1338,共11页 Journal of Frontiers of Computer Science and Technology
基金 国家自然科学基金(61673194,61672263),国家自然科学基金面上项目(62272202) 国家自然科学基金联合基金项目(U1836218)。
关键词 情感分析 多模态 注意力机制 特征融合 sentiment analysis multimodal attention mechanism feature fusion
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部