期刊文献+

基于ELMo和Bi-SAN的中文文本情感分析 被引量:12

Chinese text sentiment analysis based on ELMo and Bi-SAN
下载PDF
导出
摘要 目前情感分析模型通常使用word2vec、GloVe等方法生成静态词向量,并且传统的卷积或循环深度模型无法完整地关注上下文,提取特征不充分,影响情感判断。针对上述问题,提出基于ELMo(embedding from language model)和双向自注意力网络(bidirectional self-attention network,Bi-SAN)的中文文本情感分析模型。首先通过ELMo语言模型训练得到融合词语本身和上下文信息的词向量,解决了一词多义的问题;同时使用预训练的skip-gram算法代替随机初始化的ELMo模型的嵌入层,提高模型的收敛速度;之后使用Bi-SAN提取特征,由于自注意力机制,Bi-SAN可以完整地关注每个词的上下文,提取特征更为全面。同现有的多个情感分析模型对比,该模型在酒店评论数据集上和NLPCC2014 task2中文数据集取得了更高的F 1值,验证了模型的有效性。 Current sentiment analysis models usually use word2vec,GloVe and other methods to generate static word embedding,and traditional convolutional or recurrent depth models cannot fully focus on the context,extract insufficiently features,and reduce the accuracy of sentiment judgment.This paper proposed a Chinese text sentiment analysis model based on ELMo and Bi-SAN.Firstly,through ELMo language model training,the model got the word vector that integrated the word itself and context information to solve the problem of ambiguity of a word.Meanwhile,it used pre-trained skip-gram algorithm to replace the embedding layer of the randomly initialized ELMo model and improved the convergence speed of the model.Then the mo-del used Bi-SAN to extract features.Due to the self-attention mechanism,Bi-SAN could fully focus on the context of each word and extract features more comprehensively.Compared with multiple existing sentiment analysis models,the proposed model achieves higher F 1 in the hotel review dataset and the NLPCC2014 task2 Chinese dataset,which validates the effectiveness of the model.
作者 李铮 陈莉 张爽 Li Zheng;Chen Li;Zhang Shuang(School of Information Science&Technology,Northwest University,Xi’an 710127,China)
出处 《计算机应用研究》 CSCD 北大核心 2021年第8期2303-2307,共5页 Application Research of Computers
基金 国家重点研发资助项目(2020YFC1523301) 陕西省重点研发计划资助项目(2019ZDLGY10-01)。
关键词 情感分析 词向量 ELMo 自注意力机制 sentiment analysis word embedding ELMo self-attention
  • 相关文献

参考文献6

二级参考文献57

  • 1李健,曹垚,王宗敏,王广印.融合k-means聚类和Hausdorff距离的散乱点云精简算法[J].武汉大学学报(信息科学版),2020,45(2):250-257. 被引量:16
  • 2许云,樊孝忠,张锋.一种不需分词的中文文本分类方法[J].北京理工大学学报,2005,25(9):778-781. 被引量:5
  • 3朱嫣岚,闵锦,周雅倩,黄萱菁,吴立德.基于HowNet的词汇语义倾向计算[J].中文信息学报,2006,20(1):14-20. 被引量:326
  • 4Turney P. Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews[C]//Proceedings of ACL 02, 2002: 417-424.
  • 5Pang B, L Lee, S Vaithyanathan. Thumbs up? Senti ment Classification using Machine Learning Techniques [C]//Proceedings of EMNLP-02, 2002:79-86.
  • 6Kennedy A, D Inkpen. Sentiment Classification of Movie Reviews using Contextual Valence Shifters[J]. Computational Intelligence, 2006,22(2) : 110-125.
  • 7Wiebe J, R Mihalcea. Word Sense and Subjectivity [C]//Proceeding of ACL-COLING-06, 2006: 1065- 1072.
  • 8Hatzivassiloglou V, K McKeown. Predicting the Se mantic Orientation of Adjectives[C]//Proceedings of ACL-97, 1997: 174-181.
  • 9Wiebe J. Learning Subjective Adjectives from Corpora [C]//Proeeedings of AAAI-2000, 2000: 735-740.
  • 10Pang B, L Lee. A Sentimental Education: Sentiment Analysis using Subjectivity Summarization based on Minimum Cuts [C]//Proceedings of ACL-04, 2004: 271-278.

共引文献267

同被引文献162

引证文献12

二级引证文献53

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部