期刊文献+

基于情感词典与语义规则集的微博文本情感分析

Sentiment Analysis of Microblog Text Based on Sentiment Dictionary and Semantic Rule Set
下载PDF
导出
摘要 近年来,以微博为代表的中文媒体平台正在不断融入人们的生活,人们每天都在这些平台上发表自己的观点、感受等其他主观信息,如何从这些信息中提取有价值的情感信息并加以利用就称作情感分析。本文提出了一种基于情感词典与语义规则集的微博文本情感分析方法。我们的方法将现有的多个基础情感词典结合起来,并基于统计信息的方法构建了微博领域情感词典,同时考虑到中文的语义特性,加入了自定义的语义规则集。为了验证该方法的有效性,我们通过网络爬虫技术获取微博中关于新冠肺炎的评论信息共10万条微博文本,在此数据集上进行了实验。实验结果表明,与传统的基于情感词典的方法相比,我们的方法具有更高的准确性和更稳定的表现,正面、负面和中性情感识别准确率分别达到了79.4%、82.5%、77.3%。综上所述,本文提出的基于情感词典与语义规则集的微博文本情感分析方法具有较高的准确性和泛化能力,能够有效地识别微博文本中的情感,并具有应用价值。 In recent years, Chinese media platforms represented by microblog have been increasingly integrated into people’s lives. People express their views, feelings and other subjective information on these platforms every day. How to extract valuable sentimental information from this information and make use of it is called sentiment analysis. In this paper, a sentiment analysis method based on sentiment dictionary and semantic rule set is proposed. Our method combines several existing basic sentiment dictionaries and constructs a microblog domain sentiment dictionary based on statistical information. At the same time, considering the semantic characteristics of Chinese language, we add a custom semantic rule set. In order to verify the effectiveness of this method, we used web crawler technology to obtain a total of 100,000 microblog comments on COVID-19, and conducted experiments on this data set. The experimental results show that compared with the traditional sentiment dictionary-based method, our method has higher accuracy and more stable performance, and the accuracy rate of positive, negative and neutral sentiment recognition reaches 79.4%, 82.5% and 77.3%, respectively. In conclusion, the sentiment analysis method based on sentiment diction-ary and semantic rule set proposed in this paper has high accuracy and generalization ability, and can effectively identify the sentiment in microblog, and has application value.
作者 王伟贤 吴俊
出处 《计算机科学与应用》 2023年第4期754-763,共10页 Computer Science and Application
  • 相关文献

参考文献9

二级参考文献105

  • 1朱嫣岚,闵锦,周雅倩,黄萱菁,吴立德.基于HowNet的词汇语义倾向计算[J].中文信息学报,2006,20(1):14-20. 被引量:326
  • 2徐琳宏,林鸿飞,杨志豪.基于语义理解的文本倾向性识别机制[J].中文信息学报,2007,21(1):96-100. 被引量:123
  • 3王根,赵军.中文褒贬义词语倾向性的分析[C].第三届学生计算语言学研讨会论集,2006:81-85.
  • 4PETER D.Turney.Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews[C]//Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL)//Philadelphia,PA,USA.2002; 417-424.
  • 5PETER D.Turney and MICHAEL L.Littman.Measuring praise and criticism:inference of semantic orientation from association[J].ACM Transactions on Information Systems,2003,21(4):315-346.
  • 6PETER D.Turney and MICHAEL L.Littman.Unsupervised learning of semantic orientation from a hundred-billion-word corpus[R].Tech.Rep.EGB-1094,National Research Council Canada:2002.
  • 7DAVE K.,LAWRENCE S.,and PENNOCK D..Mining the peanut gallery.,opinion extraction and semantic classification of product reviews[C]//Proceedings of the 22nd International World Wide Web Conference.Budapest,Hungary:2003.
  • 8YUEN Raymond W.M.,CHAN Terence Y.W.,LAI Tom B.Y.et al.Morpheme-based derivation of bipolar semantic orientation of Chinese words[C]//Proc.Of the 20th International Conference on Computational Linguistics (COLING-2004),Geneva,Switzerland.2004:1008-1014.
  • 9Vasileios Hatzivassiloglou, Kathleen R. McKeown. Predicting the semantic orientation of adjectives[A]. In: Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and the 8th Conference of the European Chapter of the ACL[C], 1997:174- 181.
  • 10Turney, Peter, Littman Michael. Measuring praise and criticism: Inference of semantic orientation from association[J]. ACM Transactions on Information Systems, 2003, 21(4): 315- 346.

共引文献596

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部