期刊文献+

基于K-BERT和残差循环单元的中文情感分析 被引量:2

Chinese Sentiment Analysis Based on K-BERT and Residual Recurrent Units
下载PDF
导出
摘要 【目的】利用自然语言处理技术可以为网络舆论安全提供技术支持。为解决文本情感分析中存在的循环神经网络无法获取深层加浅层的特征信息,以及动态词向量偏离核心语义的问题,本文提出了基于K-BERT和残差循环单元的K-BERT-BiRESRU-ATT的情感分析模型。【方法】首先使用K-BERT模型获取包含背景知识的语义特征向量;之后使用提出的双向残差简单循环单元(Bidirectional Residual Simple Recurrent Unit,BiRESRU),对上下文特征进行序列提取,获取深层和浅层的特征信息;然后利用注意力机制对BiRESRU的输出进行关键词权重增强;最后使用softmax进行结果分类。【结果】在ChnSentiCorp和weibo数据集上,分别达到了95.6%和98.25%的准确率;在计算速度上较使用其他循环网络每轮迭代减少了接近5分钟,提高了计算效率。【结论】K-BERT-BiRESRU-ATT解决了动态词向量偏离核心语义的问题,获得了深层加浅层的特征信息,加速模型计算的同时也提高了分类准确率,但仍对计算能力有较大需求。 [Objective]The use of natural language processing technology can provide technical support for the security of network public opinion.In order to solve the problem that the recurrent neural network in text sentiment analysis cannot obtain the feature information of deep and shallow layers,and the dynamic word vector deviates from the core semantics,a K-BERT-BiRESRU-ATT based on K-BERT and the residual recurrent unit is proposed.[Methods]First,the K-BERT model is used to obtain the semantic feature vector containing background knowledge;Then,the proposed Bidirectional Residual Simple Recurrent Unit(BiRESRU)is used to extract the sequence of the contextual features to obtain deep and shallow feature information;After that,the attention mechanism is used to enhance the keyword weight of the output of BiRESRU;Finally softmax is used to classify the results.[Results]On the ChnSentiCorp and Weibo datasets,the accuracy rates were 95.6%and 98.25%,respectively;the calculation time was reduced by nearly 5 minutes per iteration compared with other recurrent networks,and the computational efficiency was improved.[Conclusions]K-BERT-BiRESRU-ATT solves the problem of the dynamic word vector deviation from the core semantics,obtains the feature information of deep and shallow layers,accelerates the model calculation,and improves the classification accuracy.But it still has a large demand for computing ability.
作者 王桂江 黄润才 黄勃 WANG Guijiang;HUANG Runcai;HUANG Bo(School of Electrical and Electronic Engineering,Shanghai Engineering University,Shanghai 201620,China)
出处 《数据与计算发展前沿》 CSCD 2023年第4期127-138,共12页 Frontiers of Data & Computing
基金 国家自然科学基金(61603242)。
关键词 简单循环单元 K-BERT 情感分析 网络舆论安全 simple recurrent unit K-BERT sentiment analysis security of network public opinion
  • 相关文献

参考文献9

二级参考文献72

  • 1朱远平,戴汝为.基于SVM决策树的文本分类器[J].模式识别与人工智能,2005,18(4):412-416. 被引量:23
  • 2姚天昉,娄德成.汉语语句主题语义倾向分析方法的研究[J].中文信息学报,2007,21(5):73-79. 被引量:77
  • 3BALAHUR A, STEINBERGER R, KABADJOV M, et al. Sentiment analysis in the news[ J]. Infrared Physics and Technology, 2014, 65:94-102.
  • 4JIANG Long, YU Mo, ZHOU Ming, et al. Target-dependent twitter sentiment classification[ C ]//Proc of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Techno- logies . 2011.
  • 5王金刚,于潇,宋丹丹,等.基于中文bag-of-opinions方法的微博情感分析[C]//NLP&CC.2012.
  • 6PAK A, PAROUBEK P. Twitter as a corpus for sentiment analysis and opinion mining [ C ]//Proc of International Conference on Lan- guage Resources and Evaluation. 2010.
  • 7TABOADA M, BROOKE J, TOFILOSKI M, et al. Lexicon-based methods for sentiment analysis [ J ]. Computational Linguistics, 2011, 37(2) : 267-307.
  • 8LUCIANO B, FENG Jun-lan. Robust sentiment detection on twitter from biased and noisy data[ C]//Proc of the 23rd International Con- ference on Computational Linguistics. 2010.
  • 9PANG Bo, LEE L, VAITHYANATHAN S. Thumbs up? Sentiment classification using machine learning techniques [ C ]//Proc of Confe- rence on Empirical Methods in Natural Language Processing. 2002: 79- 86.
  • 10CUI Hang, MITYAL V, DATAR M. Comparative experiments on senti- ment classification for online product reviews [ C ]//Proc of the 21st National Conference on Artificial Intelligence. 2006: 1265-1270.

共引文献219

同被引文献15

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部