期刊文献+

中文博客多方面话题情感分析研究 被引量:17

Multi-aspect Topic Sentiment Analysis of Chinese Blog
下载PDF
导出
摘要 博客是Web环境中个人表达观点和情感的一种重要载体,一般涉及较宽泛的话题,蕴含丰富的舆情信息。现有针对有关社会事件的用户产生内容进行情感分析的研究多数以篇章级为处理粒度,尚不能满足博客文本深度情感分析的需求。该文提出一种基于LDA话题模型与Hownet词典的中文博客多方面话题情感分析方法。该方法首先利用数据语料训练LDA话题模型,然后以滑动窗口为基本处理单位,利用训练好的LDA模型对博客文本进行话题识别与划分;在此基础上,基于Hownet词典对划分后的话题段落进行情感倾向计算。该方法有助于同时识别博客文本所涉及的多方面子话题及每个子话题上的情感倾向。实验结果表明,该方法不仅能获得较好的话题划分结果,也有助于改善情感分析的准确率。 Weblog is an important media for people to express their personal opinions and sentiment, which generally involve several topics or implied public opinions. The existing sentiment analysis researches on these user generation content are mostly in document level instead of fine granalarities. This paper proposes a novel method based on LDA topic model and HowNet lexicon to determine the sentiment orientation of blogs with multi-aspect topics. The new method utilizes data corpus to train the LDA topic model at first. Then it identifies and segments topics with the trained topic model, which taking a slide window as the basic processing unit. After that, the topics of paragraphs can be identified. And then the method conducts the sentiment analysis on topic paragraphs with HowNet lexicon. The new method can help to simultaneous identify multi-aspect topics and the sentiment orientation of these topics. The experiment results show that this approach can not only obtain a good topic partitioning results, but also help to improve sentiment analysis accuracy.
出处 《中文信息学报》 CSCD 北大核心 2013年第1期47-55,共9页 Journal of Chinese Information Processing
基金 国家自然科学基金资助项目(60903114 61003271 61001185) 广东省自然科学基金资助项目(7301329) 深圳市科技计划资助项目(JC201005280463A)
关键词 多方面情感分析 博客情感分析 LDA模型 HowNet词典 multi-aspect sentiment analysis blog sentiment analysis LDA topic model HowNet lexicon
  • 相关文献

参考文献28

  • 1杨宇航,赵铁军,于浩,郑德权.Blog研究[J].软件学报,2008,19(4):912-924. 被引量:19
  • 2Agarwal Nitin,Huan Liu. Blogosphere:research issues,tools,and applications[J].SIGKDD Explor Newsl,2008,(01):18-31.
  • 3顾明毅,周忍伟.网络舆情及社会性网络信息传播模式[J].新闻与传播研究,2009,16(5):67-73. 被引量:103
  • 4赵妍妍,秦兵,刘挺.文本情感分析[J].软件学报,2010,21(8):1834-1848. 被引量:541
  • 5Pang Bo,Lillian Lee. Opinion Mining and Sentiment Analysis[J].Found Trends Inf Retr,2008,(1-2):1-135.
  • 6黄萱菁,张奇,吴苑斌.文本情感倾向分析[J].中文信息学报,2011,25(6):118-126. 被引量:61
  • 7Turney,Peter D. Thumbs up or thumbs down?:semantic orientation applied to unsupervised classification of reviews[A].2002.417-424.
  • 8Min,Hye-Jin,Jong C Park. Toward finer-grained sentiment identification in product reviews through linguistic and ontological analyses[A].2009.169-172.
  • 9Ding Xiaowen,Bing Liu,Lei Zhang. Entity discovery and assignment for opinion mining applications[A].2009.1125-1134.
  • 10Su Qi,Xinying Xu,Honglei Guo. Hidden sentiment association in chinese web opinion mining[A].2008.959-968.

二级参考文献115

共引文献1109

同被引文献201

引证文献17

二级引证文献164

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部