期刊文献+

在线社交网络的UNI64采样方法 被引量:2

UNI64 Sampling Method on Online Social Networks
下载PDF
导出
摘要 在对社交网络采样方法进行研究时,常以拒绝-接受采样法得到的样本作为对照来评价其他采样方法的优劣.由于各种在线社交网络陆续将其用户ID系统由32位升级为64位,导致拒绝-接受采样法的采样命中率近乎为零.本文根据在线社交网络的特点,以新浪微博为例,对其用户ID分布情况进行分析,提出了一种改进的拒绝-接受采样法UNI64.该方法通过分析网络有效ID样本的分布情况,结合聚类的方法将整个样本空间划分为有效区间和无效区间,并使采样算法避开无效区间,仅在有效区间内生成待测样本,从而有效提高了拒绝-接受采样法在有效样本极为稀疏的样本空间内采样的命中率. When studying the sampling methods on online social networks, samples collected by acceptance-rejection method are usually used as the "ground truth" to estimate the pros and cons of other sampling methods. The acceptance rate of the original acceptance-rejection method slumps dramatically since OSN sites updated their user ID from 32 bit to64bit. According to the characteristics of online social networks and taking Sina Weibo for example, we analyzed the distribution of user IDs in Sina Weibo, and proposed an improved acceptance-rejection method called UNI64. In this method, the user ID space is divided into valid intervals and vacant intervals by analyzing the distribution of valid sample IDs and utilizing cluster method. The sampling method generates candidate IDs only in valid intervals, so that the acceptance rate could be effectively improved even in a sparse-distributed user ID space.
出处 《计算机系统应用》 2014年第12期206-212,共7页 Computer Systems & Applications
基金 北京高等学校青年英才计划(YETP0506)
关键词 在线社交网络 采样方法 随机采样 新浪微博 层次聚类 online social networks sampling method random sampling Sina Weibo hierarchical cluster
  • 相关文献

参考文献10

  • 1Watts D J, Strogatz SH. Collective dynamics of 'small-world' networks. Nature, 1998, 393: 440-442.
  • 2Newman MEJ, Watts DJ. Renormalization group analysis of the small-world network model. Physics Letters A, 1999, 263(4-6): 341-346.
  • 3Barabasi AL, Albert R. Emergence of scaling in random networks. Science, 1999, 286(5439): 509-512.
  • 4Cha M, Haddadi H, Benevenuto F, Gummadi KE Measuring user influence in twitter: The million follower fallacy. 4th International AAAI Conference on Weblogs and Social Media. 2010.
  • 5Ahn YY, Hart S, Kwak H, Moon S, Jeong H. Analysis of topological characteristics of huge online social networking services. Proc. of the 16th international conference on World Wide Web. 2007~ 835-844.
  • 6Choudhary A, Hendrix W, Lee K, Palsetia D, Liao WK. Social media evolution of the Egyptian revolution. Commun. ACM, 2012, 55(5): 74-80.
  • 7Steeg GV, Galstyan A. Information-theoretic measures of influence based on content dynamics. Proc. of the 6th ACM International Conference on Web Search and Data Mining. New York, NY, USA. 2013.3-12.
  • 8Kurant M, Markopoulou A, Thiran P. On the bias of BFS. International Teletraffic Congress (ITC 22). 2010.
  • 9Gjoka M, Kurant M, Butts CT, Markoppoulou A. Practical recommendations on crawling online social networks. IEEE Journal on Selected Areas in Communications, 2011, 29(9): 1872-1892.
  • 10Leskovec J, Faloutsos C. Sampling from large graphs. Proc. of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2006. 631-636.

同被引文献17

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部