期刊文献+

基于双向量模型的自适应微博话题追踪方法 被引量:4

Self-adaptive Method Based on Double-vector Model for Microblog Topic Tracking
下载PDF
导出
摘要 针对微博文本篇幅短小、网络新词层出不穷等特点以及在话题发展过程中产生的漂移问题,提出了基于双向量模型的自适应微博话题追踪方法.该方法首先提出双向量模型,将文本用词嵌入和VSM向量空间模型两种方法分别向量化,保留文本语义的同时也解决了微博新词问题.其次,将话题和微博分别用双向量模型表示,计算话题双向量模型和微博双向量模型的余弦相似度作为话题与微博的相似度.接着,将话题与微博的相似度与自适应学习获得的相似度阈值进行比较,判定微博是否为话题相关微博.最后,自适应更新话题模型,能够有效地应对微博话题发展所产生的漂移.实验结果表明,该方法能够实时地跟踪话题并降低了话题相关微博的漏检率和误检率. In order to handle the characteristics of microblog such as short texts,continuous emergence of network neologisms and topic drifting,an adaptive microblog topic tracking method based on Double-Vector model is proposed. Firstly,a Double-Vector model is proposed to transform texts into vectors with word embedding technology and VSM( Vector Space Model),so that the text semantics is preserved and the problem of microblog neologisms is solved. Secondly,the similarity between a microblog and a topic is represented by the cosine value of the Double-Vector model of the microblog and the Double-Vector model of the topic. Thirdly,the similarity between a microblog and a topic is compares with the similarity threshold that is obtained by self-adaptive learning to determine whether the microblog is topic relevant microblog or not. Finally,through self-adaptive updating the topic model,the topic drift aroused by the development of microblog topics can be effectively overcomed. Experimental results show that the proposed method can effectively track the changes of the topic in real time and reduce the missing rate and false positive rate of the topic related microblog.
作者 黄畅 郭文忠 郭昆 HUANG Chang;GUO Wen-zhong;GUO Kun(College of Mathematics and Computer Sciences,Fuzhou University,Fuzhou 350116,China;Fujian Provincial Key Laboratory of Network Computing and Intelligent Information Processing,Fuzhou 350116,China;Key Laboratory of Spatial Data Mining & Information Sharing,Ministry of Education,Fuzhou 350116,China)
出处 《小型微型计算机系统》 CSCD 北大核心 2019年第6期1203-1209,共7页 Journal of Chinese Computer Systems
基金 国家自然科学基金项目(61300104,61300103,61672158)资助 福建省高校杰出青年科学基金项目(JA12016)资助 福建省高等学校新世纪优秀人才支持计划项目(JA13021)资助 福建省杰出青年科学基金项目(2014J06017,2015J06014)资助 福建省科技创新平台计划项目(2009J1007,2014H2005)资助 福建省自然科学基金项目(2013J01230,2014J01232)资助 福建省高校产学合作项目(2014H6014,2017H6008)资助
关键词 话题追踪 微博 自适应 双向量 topic tracking microblog self-adaptive double-vector
  • 相关文献

参考文献8

二级参考文献139

共引文献259

同被引文献42

引证文献4

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部