摘要
大数据环境下的科研合作预测亟需基于海量数据资源来自动学习和发现研究者间的关联性,提高预测效率和效果。首先基于海量数据构建合著网络,并以合著关系表示科研合作;接着基于深度学习的网络表示学习方法(network embedding)学习研究者在所处网络的语境信息,形成每个研究者的稠密、低维向量表示;最后通过向量相似度指标计算研究者间的语义相似度,实现科研合作预测和推荐。在图书情报领域的实验验证了该方法能够提高科研合作预测的准确率和效果,更好地进行关联推荐。该方法从数据科学视角丰富和扩展了基于复杂网络的情报分析方法。
In order to improve the efficiency and effect of predicting research collaborations in a large data environment, correlations among researchers should be learned and discovered automatically from massive datasets. Firstly, the co-authorship network is built from a massive dataset where research collaborations are indicated by co-authorship. Then, the researchers' context in the network is learned by network embedding based on the deep machine learning method, and each researcher's dense, low-dimensional vector is formatted. Finally, the semantic similarities among authors are calculated through the vector similarity indices for research collaboration prediction. Experiments in the field of Library and Information Science verify that the method can improve the accuracy and efficiency of research collaboration prediction. This method enriches and expands the information analysis methods based on complex networks from the perspective of data science.
作者
张金柱
于文倩
刘菁婕
王玥
Zhang Jinzhu;Yu Wenqian;Liu Jingjie;Wang Yue(Department of Information Management, School of Economics and Management, Nanjing University of Science and Technology, Nanjing 210094)
出处
《情报学报》
CSSCI
CSCD
北大核心
2018年第2期132-139,共8页
Journal of the China Society for Scientific and Technical Information
基金
国家自然科学基金"基于被引科学知识突变的突破性创新动态识别及其形成机理研究"(71503125)
国家自然科学基金"基于聚合的社会化短文本信息处理与细粒度倾向性分析"(71503126)
江苏省社会科学基金"基于社团结构动态演化的主题突变监测与形成机制研究"(17TQC003)
关键词
科研合作预测
网络表示学习
合著网络
链路预测
research collaboration prediction
network embedding
co-authorship network
link prediction