期刊文献+

结合扩充词典与自监督学习的网络评论情感分类 被引量:11

Sentiment Classification of Network Reviews Combining Extended Dictionary and Self-supervised Learning
下载PDF
导出
摘要 在高速发展的互联网时代,网络评论情感分析对分析舆情、监控电商有着重要作用。现有分类方法主要有情感词典方法和机器学习方法。情感词典方法过于依赖词典中的情感词,情感词典越完备,网络评论情感倾向越显著,分类效果越好,但对于情感倾向不易区分的评论,其分类效果欠佳。机器学习方法是一种有监督的方法,其分类效果依赖于大量事先标注的语料,目前语料标注是通过人工完成,工作量极大。文中综合了情感词典和机器学习两种方法的特点,构建了一个网络评论情感分类模型,利用相关领域网络评论对情感词典进行扩充,基于情感词典方法的分类结果,通过自监督学习训练一个分类器,进而提高情感倾向模糊文本的分类正确率。实验表明,与情感词典方法和机器学习方法相比,所提模型在酒店评论、京东评论两个数据集上都获得了更好的情感分类效果。 In the rapidly developing Internet era,sentiment analysis of online reviews plays an important role in analyzing public opinion and monitoring e-commerce.Existing classification methods mainly include sentiment dictionary methods and machine learning methods.The sentiment dictionary method relies too much on the sentiment words in the dictionary.The more complete the sentiment dictionary,the more pronounced the sentiment tendency of online comments and the better classification effect.The classification effect of comments is not good when the sentiment tendencies are not easy to distinguish.The machine learning method is a supervised method,and its classification effect relies on a large number of pre-annotated corpora.Currently,the corpus annotation is done manually,and the workload is extremely large.This paper combines characteristics of the two methods to build a new sentiment classification model of network reviews.First,the sentiment dictionary is expanded based on the domain of online reviews,and the sentiment value of each online comment is calculated according to the extended sentiment dictionary.According to the preset sentiment threshold,the comments with significant is sentiment tendencies and higher accuracy are selected as the definite set,and the rest that are not easily distinguished are used as uncertain sets.The classification result of the definite set is directly determined by the sentiment value.Second,according to the definite set from the sentiment dictionary method,a classifier is trained through self-supervised learning,and the training data do not require manual annotation.Finally,the trained classifier is used to classify the uncertain set again,and an improved algorithm is used to improve the classification result of the uncertain set.Experiments show that,compared with the sentiment dictionary method and the machine learning method,the proposed model achieves a better sentiment classification effect for the sentiment classification of hotel reviews and Jingdong reviews.
作者 景丽 李曼曼 何婷婷 JING Li;LI Man-man;HE Ting-ting(School of Computer and Information Engineering,Henan University of Economics and Law,Zhengzhou 450046,China)
出处 《计算机科学》 CSCD 北大核心 2020年第S02期78-82,91,共6页 Computer Science
基金 国家自然科学基金(61806073,31700858,61802110)。
关键词 网络评论 情感分类 词向量 情感词典 机器学习 Internet reviews Sentiment classification Word vectors Sentiment dictionary Machine learning
  • 相关文献

参考文献7

二级参考文献81

  • 1朱嫣岚,闵锦,周雅倩,黄萱菁,吴立德.基于HowNet的词汇语义倾向计算[J].中文信息学报,2006,20(1):14-20. 被引量:326
  • 2Turney P. Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews[C]//Proceedings of ACL 02, 2002: 417-424.
  • 3Pang B, L Lee, S Vaithyanathan. Thumbs up? Senti ment Classification using Machine Learning Techniques [C]//Proceedings of EMNLP-02, 2002:79-86.
  • 4Kennedy A, D Inkpen. Sentiment Classification of Movie Reviews using Contextual Valence Shifters[J]. Computational Intelligence, 2006,22(2) : 110-125.
  • 5Wiebe J, R Mihalcea. Word Sense and Subjectivity [C]//Proceeding of ACL-COLING-06, 2006: 1065- 1072.
  • 6Hatzivassiloglou V, K McKeown. Predicting the Se mantic Orientation of Adjectives[C]//Proceedings of ACL-97, 1997: 174-181.
  • 7Wiebe J. Learning Subjective Adjectives from Corpora [C]//Proeeedings of AAAI-2000, 2000: 735-740.
  • 8Pang B, L Lee. A Sentimental Education: Sentiment Analysis using Subjectivity Summarization based on Minimum Cuts [C]//Proceedings of ACL-04, 2004: 271-278.
  • 9Cui H, V Mittal, M Datar. Comparative Experiments on Sentiment Classification for Online Product Reviews [C]//Proceedings of AAAI-06, 2006: 1265-1270.
  • 10Andrea E. Determining the Semantic Orientation of Terms through Gloss Classification[C]//Proceedings of CIKM 05, 2005: 617-624.

共引文献229

同被引文献156

引证文献11

二级引证文献38

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部