期刊文献+

适用于方面级情感分析的多级数据增强方法

Multi-Level Data Augmentation Method for Aspect-Based Sentiment Analysis
下载PDF
导出
摘要 【目的】方面级情感分析能够更好地洞察用户评论,是近年来研究的热点。针对方面级情感分析领域中标签数据较难获取的问题,设计简单而有效的多级数据增强方法。【方法】在不改变情感极性的前提下,针对一个评论中特定几个目标方面进行句子级相邻词、领域级同类词和词向量级同义词替换,既保证了标签不变性,又能够生成多样化的合成训练样本。每种数据增强方法能够单独运用或者随机组合运用。【结果】提出的方案分别运用在基于注意力机制+预训练模型和基于依赖树+预训练模型上,并应用于对比学习框架。在SemEval 2014 Task 4 Sub Task 2上进行实验,实验结果表明提出的数据增强方法是有效的,Accuracy和Macro-f1指标优于基准指标。【结论】多级数据增强方法可以有效缓解方面级情感分析任务中数据不足问题,既可以作为原训练数据的有效补充实施共同训练,也可以构建正样本用于对比学习实施多任务训练。 [Objective]Aspect-level sentiment analysis provides better insights into user reviews and has become a research hotspot in recent years.This paper designs a simple and effective triple-level data augmentation method,addressing the problem that label data is difficult to obtain in the field of aspect-level sentiment analysis.[Methods]Under the premise of not changing the emotional polarity,sentence-level adjacent words,domain-level similar words,and word vector-level synonyms are replaced for specific target aspects in a comment,which not only ensures label invariance but also generates diverse Synthetic training samples.Each enhancement method in the multi-level data enhancement method can be used either individually or in random combinations.[Results]The proposed schemes are applied to the attention mechanism with the pre-trained model and the dependency tree with the pre-trained model respectively,and tested in the contrastive learning framework.The experiments are carried out on SemEval 2014 Task 4 Sub Task 2.The experimental results show that the proposed data enhancement method is effective,and the values of indicators of Accuracy and Macro-f1 are better than the baseline ones.[Conclusions]Multi-level data augmentation method can effectively alleviate the problem of insufficient data in aspect-level sentiment analysis tasks.It can be used as an effective supplement to the original training data for joint training,and can also be constructed for contrastive learning to implement multi-task training.
作者 张蓉 刘渊 ZHANG Rong;LIU Yuan(School of Internet of Things Engineering,JiangSu Vocational College of Information Technology,WuXi,JiangSu 214153,China;School of Artificial Intelligence and Computer,JiangNan University,WuXi,JiangSu 214122,China)
出处 《数据与计算发展前沿》 CSCD 2023年第5期140-153,共14页 Frontiers of Data & Computing
基金 国家自然科学基金资助项目“面向天地一体化信息网络的可伸缩与可重构仿真技术”(61972182) 江苏省高等职业教育高水平专业群建设项目“物联网应用技术”(苏教职函[2021]1号) 江苏省高校“青蓝工程”优秀教学团队“物联网应用技术”(苏教办师函[2021]23号)。
关键词 方面级情感分析 预训练模型 数据增强 依赖树 注意力机制 对比学习 aspect-based sentiment analysis pre-trained model data augmentation dependency parse tree attention mechanism contrastive learning
  • 相关文献

参考文献1

二级参考文献9

共引文献31

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部