摘要
为了高效地分析挖掘新浪微博社交网络信息传播过程中的关键节点,以Hadoop云计算系统作为存储和处理平台,在X-RIME大规模社会网络分析工具开源框架基础上,针对社交网络中使用HITS(hypertext induced topic selection)链接分析算法挖掘关键节点时,未能体现节点和连接的社会属性问题进行改进.新算法充分考虑了社交网络节点和边的社会属性,对HITS算法节点和边的社会属性权值进行优化计算,提出适合社交网络特点的加权HITS算法.通过Hadoop云平台分别运行加权HITS算法和传统HITS算法对新浪微博社交网络数据进行分析.实验结果表明,加权HITS算法比传统HITS算法具有更高的执行效率和结果区分度,加权HITS算法更适合于大规模社交网络信息传播过程中关键节点的分析挖掘.
To efficiently analyze and mine the key nodes in the information dissemination process of Sina Weibo social networks,the Hadoop cloud computing system is used as the storage and process platform,based on the X-RIME massive social network analysis open source framework,the traditional hyperlink analysis algorithm HITS(hypertext induced topic selection)is improved by exploring the social attributes of nodes and edges.Based on the social attribute characteristics of the nodes and edges in social networks,the social attribute weight values of nodes and edges are computed and optimized in the new weighted HITS algorithm.The weighted HITS algorithm and the traditional HITS algorithm were implemented to analyze the Sina Weibo dataset in the Hadoop cloud platform.Experimental results show that the weighted HITS algorithm provides higher efficiency and better discrimination than the traditional HITS algorithm,and the weighted HITS algorithm is more suitable for analyzing and mining the key nodes of the information dissemination process in large-scale social networks.
作者
陈红松
王钢
张鹏
Chen Hongsong;Wang Gang;Zhang Peng(School of Computer and Communication Engineering,University of Science and Technology Beijing,Beijing 100083,China;Beijing Key Laboratory of Knowledge Engineering for Materials Science,Beijing 100083,China;Railway Police College,Zhengzhou 450053,China)
出处
《东南大学学报(自然科学版)》
EI
CAS
CSCD
北大核心
2018年第4期590-595,共6页
Journal of Southeast University:Natural Science Edition
基金
中央高校基本科研业务费专项资金资助项目(FRF-GF-17-B27)
国家重点基础研究发展计划(973计划)资助项目(2013CB329605)
公安部重大研究资助项目(201202ZDYJ017)
关键词
社交网络
新浪微博
云平台
关键节点
挖掘算法
social network
Sina Weibo
cloud platform
key nodes
mining algorithm