期刊文献+

基于结构紧密性的重叠社区发现算法 被引量:14

The Overlapping Community Discovery Algorithm Based on Compact Structure
下载PDF
导出
摘要 随着网络结构的不断扩大和日益复杂,传统的重叠社区发现算法已经不能有效地处理大规模网络数据,发现合理的社区结构.本文提出了顶点引力的概念,引入顶点凝聚度和社区凝聚度作为满足社区的外部结构稀疏性和社区内部结构紧密性的判定指标,构造了基于结构紧密性的重叠社区发现算法OCSC.该算法经过预处理,核心子图划分以及核心社区的扩展三个步骤,能有效地发现重叠社区,通过对人工合成网络和真实网络结构的社区发现实验,运用NMI和F1Score等指标验证OCSC算法的合理性和优越性. With the continuous expansion and complexity of network structure,the traditional overlapping community detection algorithm can not effectively discover reasonable community structure in large-scale network structure.Based on the concept of vertex gravity proposed in this paper,we introduce vertex cohesion and community cohesion as indexes for community structure-close internal structure and sparse external structure,and then put forward overlapping community structure algorithm OCSC.The steps of OCSC algorithm include pre-processing,core sub-mapping and core community expansion.Finally,NMI and F1Score confirm the rationality and superiority of OCSC algorithm by experimentation on synthetic and real network structures.
作者 潘剑飞 董一鸿 陈华辉 钱江波 戴明洋 PAN Jian-fei;DONG Yi-hong;CHEN Hua-hui;QIAN Jiang-bo;DAI Ming-yang(Faculty of Electrical Engineering and Computer Science,Ningbo University,Ningbo,Zhejiang 315211,China;Baidu Online Technology Co.Ltd.,Beijing 100084,China)
出处 《电子学报》 EI CAS CSCD 北大核心 2019年第1期145-152,共8页 Acta Electronica Sinica
基金 国家自然科学基金(No.61572266 No.61472194) 浙江省自然科学基金(No.LY16F020003) 宁波市自然科学基金(No.2017A610114)
关键词 社区发现 重叠社区 核心社区 大规模网络结构 SPARK community discovery overlapping community core community large-scale network structure spark
  • 相关文献

参考文献3

二级参考文献93

  • 1Big data. Nature, 2008, 455(7209): 1-136.
  • 2Dealing with data. Science,2011,331(6018): 639-806.
  • 3Holland J. Emergence: From Chaos to Order. RedwoodCity,California: Addison-Wesley? 1997.
  • 4Anthony J G Hey. The Fourth Paradigm: Data-intensiveScientific Discovery. Microsoft Research, 2009.
  • 5Phan X H, Nguyen L M,Horiguchi S. Learning to classifyshort and sparse text Web with hidden topics from large-scale data collections//Proceedings of the 17th InternationalConference on World Wide Web. Beijing, China,2008:91-100.
  • 6Sahami M, Heilman T D. A web-based kernel function formeasuring the similarity of short text snippets//Proceedingsof the 15th International Conference on World Wide Web.Edinburgh, Scotland, 2006: 377-386.
  • 7Efron M, Organisciak P,Fenlon K. Improving retrieval ofshort texts through document expansion//Proceedings of the35th International ACM SIGIR Conference on Research andDevelopment in Information Retrieval. Portland, OR, USA,2012: 911-920.
  • 8Hong L,Ahmed A, Gurumurthy S,Smola A J, Tsioutsiou-liklis K. Discovering geographical topics in the twitterstream//Proceedings of the 21st International Conference onWorld Wide Web(WWW 2012). Lyon, France, 2012:769-778.
  • 9Pozdnoukhov A,Kaiser C. Space-time dynamics of topics instreaming text//Proceedings of the 3rd ACM SIGSPATIALInternational Workshop on Location-Based Social Networks.Chicago-IL,USA, 2011: 1-8.
  • 10Sun Yizhou,Norick Brandon, Han Jiawei, Yan Xifeng, YuPhilip S,Yu Xiao. Integrating meta-path selection with user-guided object clustering in heterogeneous information net-works/ /Proceedings of the 18th ACM SIGKDD InternationalConference on Knowledge Discovery and Data Mining.Beijing, China, 2012: 1348-1356.

共引文献713

同被引文献78

引证文献14

二级引证文献24

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部