期刊文献+

微博中特定用户的相似用户发现方法 被引量:9

Discovering Similar Users for Specific User on Microblog
下载PDF
导出
摘要 微博的用户关系分析是近期的研究热点,而用户的相似度计算是微博用户关系分析的基础.已有方法在发现相似用户时,主要面向关注和粉丝群体,用户微博相似度及交互相关性计算对微博的动态特性利用不够.该文提出了新颖的微博特定用户的相似用户发现方法,该方法的创新性主要体现在:(1)发现相似用户时,在关注和粉丝的基础上引入了访客类用户,扩展了已有方法局限于关注和粉丝构建自我网络(Ego Network)的模型,增加了发现相似用户的多样性;(2)根据微博动态社交的特点,提出了用户动态微博的相似度计算和动态交互相关性计算方法,以时间片为动态社交划分的基础,以指数衰减为累加策略,使得微博用户的相似度计算更为合理,发现的相似用户更为准确.以新浪微博为例,选取了学术研究、企业管理、教育、文化、军事5个领域的50个种子用户,使用S@n(前n个用户的得分)为评价指标,进行了相似用户发现的实验分析和比较.结果显示,访客类用户可以扩展相似用户的发现范围,访客在发现的相似用户中的比例为32%,动态的微博相似度和交互相关性计算方法能够改善用户相似度的计算效果,比已有的最新方法的S@n指标提高了1.3. Recent studies focused on users' relationship on microblog,while similarity calculation of microblog users is the basis for analysis of users' relationship.Facing the problem of finding similar users,the existing methods mainly centered on followers and fans.Application of microblog dynamic characteristics was not enough when similarity between microblog and correlation among users was calculated.The work proposed a new method on discovering similar users for specific user on microblog.The method has achieved innovative points as follows:(1)Visitors were introduced to develop the Ego Network Model limited to followers and fans,with increased diversity of similar users;(2)Calculation methods were proposed for similarity between dynamic microblog of users,as well as correlation between dynamic interactions of users.It took the time slice as base for dividing dynamic social contact,and exponential damping as the accumulation strategy.It made similarty calculation among microblog users more reasonable,discovering more accurate similar users.With the case study of Sina microblog,we selected 50 seed users inacademic research,business management,education,culture and military.S@n(score of top n users)was used as evaluation index for experimental analysis and comparison among methods discovering similar users.The results showed that visitors can extend the range discovering similar users(the proportion of visitors was 32%in the all mining similar users).Meanwhile,calculation effects of users' similarity can be improved with calculation methods for dynamic topic similarity and correlation of dynamic interaction(S@n,comparing to the latest existing methods,has increased by 1.3).
出处 《计算机学报》 EI CSCD 北大核心 2016年第4期765-779,共15页 Chinese Journal of Computers
基金 国家自然科学基金(61403156) 江苏省产学研前瞻性联合研究基金(BY2015248) 江苏省六大人才高峰基金资助(XXRJ-013)资助
关键词 用户关系分析 用户相似度计算 扩展的自我网络 动态微博相似度计算 动态交互相关性计算 社会媒体 社交网络 数据挖掘 users' relationship analysis users' similarity calculation extended ego network similarity calculation of dynamic microblog correlation calculation of dynamic interaction social media social networks data mining
  • 相关文献

参考文献6

二级参考文献95

  • 1郭岩,白硕,杨志峰,张凯.网络日志规模分析和用户兴趣挖掘[J].计算机学报,2005,28(9):1483-1496. 被引量:62
  • 2丁国栋,白硕,王斌.一种基于局部共现的查询扩展方法[J].中文信息学报,2006,20(3):84-91. 被引量:43
  • 3杨博,刘大有.Force-Based Incremental Algorithm for Mining Community Structure in Dynamic Network[J].Journal of Computer Science & Technology,2006,21(3):393-400. 被引量:8
  • 4Buckley C, Sahon G, Alan J, et al. Automatic query expansion using SMART[ C ]//Harman D. Proceedings of the 3rd text retrieval conference (TREC-3). National Institute of Standards and Technology, Gaithersburg, MD, 1995:69-80.
  • 5Ko Y, An H, Seo H. Pseudo-relevance feedback and statistical query expansion for web snippet generation[ J ]. Information Processing Letters, 109,2008 : 18-22.
  • 6Xu J, Croft B W. Improving the effectiveness of informa- tional retrieval with local context analysis [ J ]. ACM Transactions on information systems ,2000,18 ( 1 ) :79-112.
  • 7Voorhees E, Harman D. Overview of the Sixth Text Retrieval Conference ( TREC-6 ) [ C ]//Voorhees E. Proceedings of the 6th text retrieval conference (TREC- 6). NIST Special Publication,1998:240-500.
  • 8Yang Y, Pierce T, Carbonell J. A study of retrospective and on-line event detection[ C ]//Proceedings of the 21st annual international ACM SIGIR conference on research and development in information retrieval,1998:28-36.
  • 9Li Z, Wang B, Li M, et al. A probabilistic model for retrospective news event detection[ C ]//The 28th annual international ACM SIGIR conference on Research and development in information retrieval ,2005 : 106-113.
  • 10Yang H, Chua T S, Wang S, et al. Structured use of external knowledge for event-based open domain question answering [ C ]//The 26th annual international ACM SIGIR conference on Research and development in information retrieval Toronto. Canada: ACM Press,2003: 33-40.

共引文献159

同被引文献76

引证文献9

二级引证文献30

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部