期刊文献+

基于CNN-BIGRU的中文文本情感分类模型 被引量:6

Chinese Comment Sentiment Classification Model Based on CNN-BIGRU
下载PDF
导出
摘要 在当今商业领域,对网络评论的情感分类一直是一个比较热门的研究方向,而为了克服传统机器学习方法所构建分类器会产生较大计算开销,精度表现较差的缺点,提出一种基于深度学习模型中卷积神经网络(CNN)与循环神经网络(RNN)模型的情感分类方法。在以往的研究中,卷积神经网络往往被用来提取文本的局部特征信息,但却容易忽视文本的长距离特征,而RNN则往往被用来提取句子的长距离依赖信息,但容易陷入梯度爆炸问题。因此,结合卷积神经网络对于局部特征信息的良好提取能力与循环神经网络对于长距离依赖信息的记忆能力,构建了一个CNN-BIGRU混合模型,用以提取文本的局部特征以及文本的长距离特征。其中循环神经网络模型使用了双向GRU模型,以避免RNN模型的梯度爆炸与梯度消失问题。在谭松波的酒店评论数据集上的实验结果表明,利用该模型,实验分类的准确率比单独使用卷积神经网络模型最高提升了26.3%,比单独使用循环神经网络模型最高提升了7.9%,从而提高了对中文文本情感分类的精度,并减少了计算开销。 In today’s business field,the sentiment classification of online comments has always been a hot research direction.In order to overcome the shortcomings of the classifier constructed by the traditional machine learning method,such as large computational overhead and poor accuracy,a sentiment classification method based on the convolutional neural network(CNN)and recurrent neural network(RNN)in the deep learning model is proposed.In previous studies,CNN is often used to extract the local feature information of the text,but it is easy to ignore the long-distance feature of the text,while RNN is often used to extract the long-distance dependent information of the sentence,but it is easy to fall into the gradient explosion.Therefore,combining the great local feature information extraction of CNN and the memory of RNN to long-distance dependent information,we construct a CNN-BIGRU hybrid model to extract local feature and long-distance feature of text.A two-way GRU model is used in RNN model to avoid the gradient explosion and gradient disappearance of the RNN model.The experiment on Tan Songbo’hotel reviews data set shows that the classification accuracy of the proposed model is the highest by 26.3% compared with the CNN alone,and the highest by 7.9% compared with RNN alone,so as to improve the accuracy of the affection of Chinese text classification and reduce the computational overhead.
作者 宋祖康 阎瑞霞 SONG Zu-kang;YAN Rui-xia(School of Management,Shanghai University of Engineering Science,Shanghai 201620,China)
出处 《计算机技术与发展》 2020年第2期166-170,共5页 Computer Technology and Development
基金 国家自然科学基金(71301100) 上海市教委科研创新(14YZ140)
关键词 卷积神经网络 循环神经网络 文本分析 情感分类 convolutional neural network recurrent neural network text analysis sentiment classification
  • 相关文献

参考文献11

二级参考文献83

  • 1靳小波.文本分类综述[J].自动化博览,2006,23(z1):24-29. 被引量:16
  • 2陈富国.多维标度法的理论与方法[J].心理科学通讯,1990,13(4):38-42. 被引量:23
  • 3苏金树,张博锋,徐昕.基于机器学习的文本分类技术研究进展[J].软件学报,2006,17(9):1848-1859. 被引量:383
  • 4卢苇,彭雅.几种常用文本分类算法性能比较与分析[J].湖南大学学报(自然科学版),2007,34(6):67-69. 被引量:31
  • 5Jansen B J, Zhang M, Sobel K, Chowdury A. Twitter power: Tweets as electronic word of mouth. Journal of the American Society for Information Science and Technology, 2009, 60(11): 2169-2188.
  • 6Bollen J, Mao H, Zeng X. Twitter mood predicts the stock market. Journal of Computational Science, 2011, 2(1): 1-8.
  • 7Zhao J, Dong L, Wu J, Xu K. MoodLens: An emoticon- based sentiment analysis system for Chinese tweets. In Proc. the 18th KDD, Aug. 2012, pp.1528-1531.
  • 8Jiang L, Yu M, Zhou M, Liu X, Zhao T. Target-dependent Twitter sentiment classification. In Proc. the 49th ACL, Jun. 2011, pp.151-160.
  • 9Liu K L, Li W J, Guo M. Emoticon smoothed language models for Twitter sentiment analysis. In Proe. the 26th AAAI. Jul. 2012.
  • 10Bermingham A, Smeaton A F. Classifying sentiment in mi- croblogs: Is brevity an advantage? In Proc. the 19th ACM International Conference on Information and Knowledge Management, Oct. 2010, pp.1833-1836.

共引文献290

同被引文献43

引证文献6

二级引证文献29

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部