基于跨模态联合编码的多模态情感分析

Multimodal Sentiment Analysis Based on Cross-Modal Joint-Encoding

下载PDF

导出

摘要如何提高多模态融合特征的有效性是多模态情感分析领域的热点问题之一。以往的研究大多通过设计复杂的融合策略获取融合特征表示,这些方法往往忽略了模态间复杂的关联关系,同时存在着由于模态信息不一致所导致的融合特征有效性降低问题,进而影响模型的性能。针对上述问题,提出一种基于跨模态联合编码的多模态情感分析模型。在特征提取方面,利用预训练模型BERT和Facet模型分别提取文本和视觉特征,经过一维卷积操作获取相同维度的单模态特征表示。特征融合方面,利用跨模态注意力模块获得两模态的联合特征,使用联合特征分别调整单模态特征的权重,将两者拼接后获得多模态融合特征,最终输入到全连接层中进行情感识别。在公开数据集CMU-MOSI上的广泛实验表明,该模型的情感分析结果优于大多数现有先进的多模态情感分析方法,能够有效提升情感分析的性能。 How to improve the effectiveness of multimodal fusion features is one of the hot issues in the field of multi-modal sentiment analysis.Most of the previous studies obtained the representation of fusion features by designing complex fusion strategies.These methods have ignored the complex correlation between modes,and at the same time,the effectiveness of fusion features is reduced due to inconsistent mode information,thus affecting the performance of the model.To solve these problems,this paper proposes a multimodal sentiment analysis model based on cross-modal joint-encoding.In terms of feature extraction,pre-trained BERT and Facet models are used to extract text and visual features respectively,and unimodal feature representations of the same dimension are obtained through one-dimensional convolution operation.In terms of feature fusion,the cross-modal attention module is used to obtain joint features of two modalities,and the weights of unimodal features are adjusted using joint features,and the multimodal fusion features are obtained after the two are splice,and finally input into the fully connected layer for sentiment recognition.Extensive experiments on the public dataset CMU-MOSI have shown that the sentiment analysis results of this model are superior to most existing advanced multimodal sentiment analysis methods,and can effectively improve the performance of sentiment analysis.

作者孙斌江涛贾莉崔伊明 SUN Bin;JIANG Tao;JIA Li;CUI Yiming(Key Laboratory of Language and Cultural Computing of Ministry of Education,Northwest Minzu University,Lanzhou 730030,China;School of Computer Science,Nanjing University of Information Science&Technology,Nanjing 210044,China)

机构地区西北民族大学语言与文化计算教育部重点实验室南京信息工程大学计算机学院

出处《计算机工程与应用》 CSCD 北大核心 2024年第18期208-216,共9页 Computer Engineering and Applications

基金 2022年甘肃省教育科技创新项目青年博士基金项目(2022QB-025)。

关键词多模态情感分析联合编码跨模态注意力多模态融合 multimodal sentiment analysis joint-encoding cross-modal attention multimodal fusion

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1关柳愉,王春英.人工智能赋能自闭症儿童社交技能的干预路径与实践研究[J].师道（教研）,2024(7):110-111.
2方丛丛,金赟,赵力,马勇,李世党,顾煜.基于文本特征能量编码的多模态语声情感识别[J].应用声学,2024,43(5):997-1007.
3韦灵,卢光云,唐爱龙.基于混合神经网络的个性化自然语言情感识别系统[J].自动化与仪表,2024,39(9):26-28.
4韩东红,孔彦茹,展艺萌,刘源.音乐多模态数据情感识别方法的研究[J].东北大学学报（自然科学版）,2024,45(6):776-785.
5孙付娜,张昭俊.基于多元线性回归方法的内蒙古物流需求影响模型及其影响因素分析[J].中国商论,2024,33(17):99-103.
6鲁霄.管壳式换热器换热过程对温度波动控制的影响模拟[J].化工自动化及仪表,2024,51(5):945-949.
7宋琦,陈接峰.准社会交往理论下政务短视频优化的影响因素及策略[J].青年记者,2024(8):35-40.
8蒋志文,陈强,敦帅.研发强度对“专精特新”中小企业竞争力影响的分析[J].同济大学学报（自然科学版）,2024,52(8):1312-1320.
9陈峤鹰.人工智能在煤炭领域检测实验室的应用研究[J].智能矿山,2024,5(8):57-61.
10张巧玲,赵燕伶,冯京辉,蒙利,郝航.无人机倾斜影像的精细化三维实景重建[J].山西建筑,2024,50(20):188-191.

计算机工程与应用

2024年第18期

浏览历史

内容加载中请稍等...

基于跨模态联合编码的多模态情感分析

相关作者

相关机构

相关主题

浏览历史