期刊文献+

基于双语词典的微博多类情感分析方法 被引量:30

A Bilingual Lexicon-Based Multi-class Semantic Orientation Analysis for Microblogs
下载PDF
导出
摘要 现有微博文本情感分析方法多面向单一语种语料,如:中文语料.但是,中英文搭配使用的表达习惯已逐渐成为个体意见表达的重要形式.本文提出一种基于双语词典的多类情感分析方法,通过构建双语多类情感词典对微博文本进行多分类语义倾向性分析,以便更准确有效捕捉群体意见,及时发现社会舆论倾向.通过与多数投票算法、支持向量机算法、基于余弦距离的K近邻分类算法相比,本文提出的基于双语词典的多类情感分析模型具有良好的分类效果,其在分类准确率、F1值等方面都有明显提高. Most of the existing Weibo sentiment analysis focuses on monolingual corpus like Chinese. However,a mixed use of Chinese and English becomes a popular form of expression. To better capture the social attention on public events,this paper proposes a bilingual lexicon based multi-class semantic orientation analysis for bilingual microblogs. We compare our proposed methodologies with majority vote,support vector machine( SVM) and K-nearest neighbor( KNN)by using cosine similarity which are competitive baseline methods. The experimental results showthat our proposed methods outperform the three approaches we mentioned in terms of the accuracy and F1 score.
出处 《电子学报》 EI CAS CSCD 北大核心 2016年第9期2068-2073,共6页 Acta Electronica Sinica
基金 国家重点基础研究发展规划(973计划)项目(No.2013CB329605) 国家自然科学基金(No.61300178)
关键词 双语语义倾向性分析 半监督高斯混合模型 相对熵 情感词典 bilingual semantic orientation analysis semi-supervised gaussian mixture model(Semi-GMM) Kull back-Leibler divergence sentiment lexicon
  • 相关文献

参考文献11

  • 1Melville P,Gryc W,Lawrence R D.Sentiment analysis of blogs by combining lexical knowledge with text classification[A] .Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining[C] .New York:ACM SIGKDD Explorations Newsletter,2009.1275-1284.
  • 2Wan X.Bilingual co-training for sentiment classification of Chinese product reviews[J] .Computational Linguistics,2011,37(3):587-616.
  • 3Meng X,Wei F,Liu X,et al.Cross-lingual mixture model for sentiment classification[A] .Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics:Long Papers-Volume 1[C] .Stroudsburg:Association for Computational Linguistics,2012.572-581.
  • 4Pang B,Lee L.Opinion mining and sentiment analysis[J] .Foundations and Trends in Information Retrieval,2008,2(1-2):1-135.
  • 5Li Y,Li X,Li F,et al.A lexicon-based multi-class semantic orientation analysis for microblogs[A] .Web Technologies and Applications[C] .Cham:Springer International Publishing,2014.81-92.
  • 6Dong Z,Dong Q.HowNet and the Computation of Meaning[M] .Singapore:World Scientific,2006.
  • 7Miller G A.WordNet:a lexical database for English[J] .Communications of the ACM,1995,38(11):39-41.
  • 8Hu M,Liu B.Opinion extraction and summarization on the web[A] .Proceedings of the 21st National Conference on Artificial Intelligence(AAAI 2006)[C] .California:AAAI Press,2006.1621-1624.
  • 9Zhu Y L,Min J,Zhou Y,et al.Semantic orientation computing based on HowNet[J] .Journal of Chinese Information Processing,2006,20(1):14-20.
  • 10Chen J,Xue N,Palmer M S.Using a smoothing maximum entropy model for Chinese nominal entity tagging[A] .Natural Language Processing-IJCNLP 2004[C] .Heidelberg:Springer-Verlag Berlin Heidelberg,2004.493-499.

同被引文献286

引证文献30

二级引证文献376

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部