期刊文献+

Web of Science科研社区挖掘算法研究 被引量:5

Research on Web of Science Academic Community Mining Algorithm
下载PDF
导出
摘要 本文以1900-2019年"Web of Science(WOS)"核心合集中的中国科学院(中科院)部分论文数据为面板数据对热点学科、科研社区及相关权威专家进行了分析研究.首先对艺术与人文、生命科学与生物医学、自然科学、社会科学、应用科学五大学科数据进行分析,发现应用科学(Technology)发表论文年增速最快,且研究热点为计算机科学(Computer Science);其次针对研究热点应用Neo4j图数据库构建论文语义网络图,对实体关系进行优化,提升了社区内部关联度;并基于Louvain社区发现算法进行了相关优化和数据挖掘,分析了其背后的优秀科研团队;最后针对挖掘出的社区,利用PageRank算法筛选出高产出的权威科研人员,为科研合作和人才发现甚至国家学科布局提供参考.实验表明,通过Neo4j图数据库中实体数据索引设计,查询性能提升高达16倍;通过对Louvain算法关系属性weight添加机构影响维度,社区模块度提升了84%. Based on the’Web of Science(WOS)’core collection papers of the Chinese Academy of Sciences(CAS)from 1900 to2019 being regarded as the panel data,this study analyzes the issues concerning about the popular subjects,scientific research communities,and related authoritative experts.Firstly,after the data researching in the five subjects of Art and Humanities,Life Sciences and Biomedicine,Natural Sciences,Social Sciences,and Applied Sciences,it is found that the fastest annual grow th of papers published are articles related to Technology Applied Sciences and the popular research topic is the Computer Science.Secondly,for the research of the popular application,network diagram about papers semantics is constructed based on the Neo4 j graph database,and entity relationship is optimized w hich improves the internal relevance of the community.Then,according to the Louvain community discovery algorithm,we perform the related optimization and data mining and analyze the outstanding scientific research team making efforts for the achievements.Finally,targeting the mined communities,we screen out the high-output authoritative scientific researchers for providing the reference for scientific research cooperation,talent discovery,and even the national discipline layout.According to the experiments,the results show that search performance is improved by up to 16 times due to the design of the entity data index based on the Neo4 j graph database.Besides,since adding the dimension of institutional influence into the weight of Louvain algorithm relationship attributes,the community modularity is increased by 84%.
作者 杜伟静 李翀 王宇宸 刘学敏 DU Wei-jing;LI Chong;WANG Yu-chen;LIU Xue-min(Computer Network Information Center,Chinese Academy of Sciences,Beijing 100190,China;University of Chinese Academy of Sciences,Beijing 100190,China)
出处 《小型微型计算机系统》 CSCD 北大核心 2020年第12期2465-2469,共5页 Journal of Chinese Computer Systems
基金 中国科学院“十三五”信息化专项项目(XXH13504-03)资助.
关键词 Web of Science Neo4j图数据库 Louvain算法 算法优化 社区发现 人才挖掘 Web of Science Neo4j graph database Louvain algorithm algorithm optimization community detection talent mining
  • 相关文献

参考文献5

二级参考文献43

  • 1赵卓翔,王轶彤,田家堂,周泽学.社会网络中基于标签传播的社区发现新算法[J].计算机研究与发展,2011,48(S3):8-15. 被引量:37
  • 2Girvan M, Newman M E J. Community structure in social and biological networks[J]. Proceedings of National Academy of Science, 2002, 9(12): 7921-7826.
  • 3Newman M E J. Fast algorithm for detecting community structure in networks[J]. Physical Review E, 2004, 69(6): 1-8.
  • 4Palla G, Derenyi 1, Farkas 1, et al. Uncovering the overlapping community structures of complex networks in nature and society[J]. Nature, 2005, 435(7043): 814-818.
  • 5Shen H, Cheng X, Cai K, et al. Detect overlapping and hierarchical community structure in networks[J]. Physica A, 2009, 388(8): 1706-1712.
  • 6Lancichinetti A, Fortunato S, Kertesz J. Detecting the overlapping and hierarchical community structure in complex networks[J]. New Journal of Physics, 2009, 11(3): 1-8.
  • 7Gregory S. Finding overlapping communities in networks by label propagation[J]. New Journal of Physics, 2010, 12 (10): 1-9.
  • 8Jin D, Yang B, Baquero C, et al. A Markov random walk under constraint for discovering overlapping communities in complex networks[J]. Journal of Statistical Mechanics: Theory and Experiment, 2011, 8 (5): 1-11.
  • 9Blei D M, Ng A v . Jordan MI. Latent Dirichlet allocation[J]. Journal of Machine Learning Research, 2003, 21(3): 993-1022.
  • 10Zhang H, Qiu B, Giles C L, et al. An LDA-based community structure discovery approach for large-scale social networks[C]//Proc of the 7th Conf on Intelligence and Security Informatics. Piscataway, NJ: IEEE, 2007: 200-207.

共引文献94

同被引文献40

引证文献5

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部