期刊文献+

基于梯度数据选择的跨领域情感分析

Cross-domain Sentiment Analysis Based on Gradient Data Selection
下载PDF
导出
摘要 情感分析研究往往依赖特定场景的大量标注数据,领域迁移会导致模型需要重新进行训练,因此跨领域情感分析任务应运而生。然而,目前跨领域情感分析方法大多考虑学习域的不变特征,或进行特征适应以处理源领域与目标领域的差距,并没有充分、高效地利用源领域数据,所以需要从数据迁移和利用的角度研究领域适应问题。不同于之前从源领域提取可迁移样本的启发式方法,提出一种通用的可学习的替代方法——梯度数据选择(GDS),可在训练过程中自动分配样本权重,由此使不可迁移的样本很容易在采样过程中被丢弃,而可迁移的样本会被更加频繁地采样到。在英文和中文两个数据集上进行实验,并将GDS应用到BERT、RoBERTa等模型进行通用性分析,实验结果证实了该方法的有效性。 The research of sentiment analysis often relies on a large amount of labeled data in specific domain,and domain transfer will lead to the retraining of models.Therefore,cross-domain sentiment analysis tasks emerge as the times require.However,most of the cross-domain sentiment analysis methods mainly consider learning domain-invariant features or perform feature adaptation to deal with the gap between the source domain and the target domain,and do not fully and efficiently utilize the source domain data,so study domain adaptation from the per‐spective of data transfer and utilization.Unlike previous heuristics that extract transferable samples from the source domain,this paper propos‐es a general and learnable alternative,gradient data selection(GDS),to automatically assign weights to samples during training.As a result,non-transferable samples are easily discarded in the sampling process,while transferable samples are sampled more frequently.Related exper‐iments are carried out on two data sets of English and Chinese,and GDS is applied to different models(BERT,RoBERTa,etc.)for generality analysis,the results confirm the effectiveness of this method.
作者 苏仪 周夏冰 SU Yi;ZHOU Xia-bing(School of Computer Science and Technology,Soochow University,Suzhou 215006,China)
出处 《软件导刊》 2023年第5期50-56,共7页 Software Guide
基金 国家自然科学基金面上项目(62176174)。
关键词 跨领域情感分析 领域适应 数据选择 半监督学习 迁移学习 预训练模型 cross-domain sentiment analysis domain adaptation data selection semi-supervised learning transfer learning pretrained model
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部