期刊文献+

Word2Vec+LSTM多类别情感分类算法优化 被引量:4

Optimization of Word2Vec and LSTM Multi-Category Sentiment Classification Algorithm
下载PDF
导出
摘要 随着网民的数量不断增加,用户上网产生的数据量也在成倍增多,随处可见各种各样的评论数据,所以构建一种高效的情感分类模型就非常有必要.本文结合Word2Vec与LSTM神经网络构建了一种三分类的情感分类模型:首先用Word2Vec词向量模型训练出情感词典,然后利用情感词典为当前训练集数据构建出词向量,之后用影响LSTM神经网络模型精度的主要参数来进行训练.实验发现:当数据不进行归一化,使用He初始化权重,学习率为0.001,损失函数选择均方误差,使用RMSProp优化器,同时用tanh函数作为激活函数时,测试集的总体准确率达到了92.28%.与传统的Word2Vec+SVM方法相比,准确率提高了大约10%,情感分类的效果有了明显的提升,为LSTM模型的情感分类问题提供了新的思路. With the increasing number of netizens, the users on the Internet has doubled, and a variety of comment data can be seen everywhere. So, it is very necessary to construct an efficient emotional classification model. This study combined Word2 Vec with LSTM neural network to construct a three-class emotional classification model. Firstly,Word2 Vec word vector model is used to train the emotion dictionary. Then, we construct word vectors for the current training set data by using emotional dictionary. Then, this study used the main parameters that affecting the accuracy of LSTM neural network model to train the model. The experiment found that when the data are not normalized, using the weight of He is initialized, the learning rate is 0.001, the loss function is mean square error, the RMSProp optimizer is used, the training rounds are 30, and the accuracy of traditional Word2 Vec + SVM method improves by about 10%. The effect of affective classification promotes obviously, which provides a new way of thinking for LSTM model’s sentiment classification.
作者 邬明强 邬佳明 辛伟彬 WU Ming-Qiang;WU Jia-Ming;XIN Wei-Bin(Digital Media and Design Academy,Neusoft Institute Guangdong,Foshan 528200,China;Software Vocational and Technical College,Kaifeng University,Kaifeng 475004,China)
出处 《计算机系统应用》 2020年第1期130-136,共7页 Computer Systems & Applications
基金 佛山市科技创新项目(2017AG100132)~~
关键词 Word2Vec LSTM 情感分类 学习率 损失函数 激活函数 Word2Vec LSTM sentiment classfication learining rate loss function activation function
  • 相关文献

参考文献6

二级参考文献39

  • 1于津凯,王映雪,陈怀楚.一种基于N-Gram改进的文本特征提取算法[J].图书情报工作,2004,48(8):48-50. 被引量:17
  • 2李荣陆,王建会,陈晓云,陶晓鹏,胡运发.使用最大熵模型进行中文文本分类[J].计算机研究与发展,2005,42(1):94-101. 被引量:95
  • 3苏金树,张博锋,徐昕.基于机器学习的文本分类技术研究进展[J].软件学报,2006,17(9):1848-1859. 被引量:378
  • 4黄萱菁 赵军.中文文本情感倾向性分析.中国计算机学会通讯,2008,4(2):41-46.
  • 5张希娟,王会珍,朱靖波.面向文本分类的基于最小冗余原则的特征选取[J].中文信息学报,2007,21(5):56-60. 被引量:3
  • 6Kim S M, Hovy E. Automatic detection of opinion bearing words and sentences [ C ]//Proceedings of IJCNLP- 05. [ s. 1. ] :Is. n. ] ,2005:61-66.
  • 7Pang B, Lee L, Vaithyanathan S. Thumbs up? sentiment clas- sification using machine learning techniques [ C ]//Proceed- ings of the 2002 conference on empirical methods in natural language processing. New Jersey : ACL,2002:79-86.
  • 8Sahon G, Wong A, Yang C S. A vector space model for auto- matic indexing [ J ]. Communications of the ACM, 1975,18 (11) :613-620.
  • 9Lewis D D. An evaluation of phrasal and clustered representa- tions on a text categorization task[ C ]//Proceedings of the fif- teenth annual international ACM SIGIR conference on re- search and development in information retrieval. [ s. 1. ] : [ s. n. ] , 1992:37-50.
  • 10Sharma A, Dey S. A comparative study of feature selection and machine learning techniques for sentiment analysis[ C ]//Pro- ceedings of the 2012 ACM research in applied computation symposium. San Antonio, Texas :ACM,2012 : 1-7.

共引文献100

同被引文献37

引证文献4

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部