期刊文献+

面向新浪微博文本的情感度判断及其探索性空间分析 被引量:5

Sentimental Judgment and Exploratory Spatial Data Analysis Based on Weibo
原文传递
导出
摘要 提出了一种基于决策树的微博情感度判断方法,并对微博情感做了探索性空间分析,给中文微博平台的海量文本规律研究提供了一个新的视角。以新浪微博数据作为基础,先利用ICTCLAS(Institute of Computing Technology,Chinese Lexical Analysis System)文本分词系统分词、HowNet知网知识库来进行词语相似度计算,再利用ID3(iterative dichotomiser 3)算法训练决策树作为分类器进行微博文本的情感度判断,最后对情感度判断结果进行探索性空间分析。结果表明,基于决策树的微博情感度判断方法的准确度为71.5%,微博用户情绪在空间上存在正的全局空间自相关特性,对局域自相关的分析也揭示了其时空聚集规律。 This paper proposes a method of sentimental judgment of Sina Weibo based on decision tree and makes exploratory spatial data analysis for the emotion of Weibo.Taking Sina Weibo as research data,we use ICTCLAS(Institute of Computing Technology,Chinese Lexical Analysis System)to process the Weibo text for word segmentation and part-ofspeech tagging,calculate the similarity between words based on HowNet system,and ID3(iterative dichotomiser 3)algorithm can be used as a Weibo text sentimental classifier to do exploratory spatial data analysis.Result shows that the method of our proposed has higher accuracy as 71.5%.There exists significant positive spatial autocorrelation(Moran's I)in Weibo sentiment.The analysis of local autocorrelation also shows the regularity of time and space accumulation.
出处 《测绘地理信息》 2018年第1期123-126,共4页 Journal of Geomatics
基金 国家自然科学基金资助项目(41471327)
关键词 空间自相关 情感度判断 决策树 分类算法 探索性空间分析 spatial autocorrelation sentimental judgment decision tree classification algorithm exploratory spatial data analysis
  • 相关文献

参考文献4

二级参考文献95

  • 1朱嫣岚,闵锦,周雅倩,黄萱菁,吴立德.基于HowNet的词汇语义倾向计算[J].中文信息学报,2006,20(1):14-20. 被引量:326
  • 2H Y Tan. Chinese place automatic recognition research. In: C N Huang, Z D Dong, eds. Proc of Computational Language.Beijing: Tsinghua University Press, 1999
  • 3Zhang Huaping, Liu Qun, Zhang Hao, et al. Automatic recognition of Chinese unknown words recognition. First SIGHAN Workshop Attached with the 19th COLING, Taipei, 2002
  • 4S R Ye, T S Chua, J M Liu. An agent-based approach to Chinese named entity recognition. The 19th Int'l Conf on Computational Linguistics, Taipei, 2002
  • 5J Sun, J F Gao, L Zhang, et al. Chinese named entity identification using class-based language model. The 19th Int'l Conf on Computational Linguistics, Taipei, 2002
  • 6Lawrence R Rabiner. A tutorial on hidden Markov models and selected applications in speech recognition. Proc of IEEE, 1989,77(2): 257~286
  • 7Shai Fine, Yoram Singer, Naftali Tishby. The hierarchical hidden Markov model: Analysis and applications. Machine Learning,1998, 32(1): 41~62
  • 8Richard Sproat, Thomas Emerson. The first international Chinese word segmentation bakeoff. The First SIGHAN Workshop Attached with the ACL2003, Sapporo, Japan, 2003. 133~143
  • 9J Hockenmaier, C Brew. Error-driven learning of Chinese word segmentation. In: J Guo, K T Lua, J Xu, eds. The 12th Pacific Conf on Language and Information, Singapore, 1998
  • 10Andi Wu, Zixin Jiang. Word segmentation in sentence analysis.1998 Int'l Conf on Chinese Information Processing, Beijing, 1998

共引文献832

同被引文献39

引证文献5

二级引证文献30

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部