期刊文献+

基于信息融合的概率矩阵分解链路预测方法 被引量:11

Probability Matrix Factorization for Link Prediction Based on Information Fusion
下载PDF
导出
摘要 作为一种典型的网络大数据,社交信息网络如微博、Tweeter等,不仅包含用户间复杂的网络结构,而且包含大量用户所发表的微博/Tweet信息.现有链路预测算法大多只利用单方面的网络拓扑信息或非拓扑信息,仍然缺乏有效融合社交信息网络中拓扑与非拓扑信息的链路预测方法.为此,从社交信息网络中用户的主题角度出发,提出一种融合主题相似信息的链路预测方法.首先基于用户文本内容抽取用户的主题表示,并定义用户间的主题相似度;然后基于用户主题相似度,构建了一种用户主题相似稀疏网络;进一步将用户主题相似网络与用户间关注/被关注网络融合在统一的概率矩阵分解框架下,通过学习获得用户的潜在特征表示和网络链路参数;最终在此概率矩阵分解框架下,基于用户的潜在特征表示和链路参数计算得到用户间的链路可能性.所提出的模型提供了一种融合多种网络信息的通用策略和学习方法.实验在包含网络结构与文本信息的4组微博与推特数据集中显示,所提出的融合概率矩阵分解链路方法相比其他链路预测方法更有效. As one kind of typical network big data,social-information networks such as Weibo and Twitter include both the complex network structure among users and rich microblog/Tweet information published by users.It is notable that most of the existing methods only make use of the network topological information or the non-topological information for link prediction,but there is still a lack of effective methods by fusing the topological information or non-topological information in social-information networks.A link prediction method is proposed from the perspective of users’topic by fusing users’topic similarity in social-information networks.The method goes in accordance with the following sequence:firstly,a topic similarity between users based on users’topic representation is defined,followed by which a topic similarity-based sparse network is constructed;secondly,the information of the following/followed network and the topic similarity-based network are fused into a unified framework of probabilistic matrix factorization,based on which the latent-feature representation of the network nodes and the linking relation parameters are obtained;finally,the linking probability between network nodes is calculated based on the obtained latent-feature representation and linking relation parameters.The proposed approach provides a general modeling strategy fusing multi-network information and a learning-based solution.Link prediction experiments are conducted on four real network datasets,i.e.Twitter and Weibo.The experimental results demonstrate that the proposed method is more effective than others.
作者 王智强 梁吉业 李茹 Wang Zhiqiang;Liang Jiye;Li Ru(School of Computer U Information Technology,Shanxi University,Taiyuan 030006;Key Laboratory of Computation Intelligence U Chinese Information Processing (Shanxi University),Ministry of Education,Taiyuan 030006)
出处 《计算机研究与发展》 EI CSCD 北大核心 2019年第2期306-318,共13页 Journal of Computer Research and Development
基金 国家自然科学基金项目(U1435212 61432011 61876103) 山西省重点研发计划项目(201603D111014) 山西省1331工程项目 山西省青年科技基金项目(201701D221098)~~
关键词 社交信息网络 链路预测 概率矩阵分解 融合模型 网络数据分析 social-information network link prediction probability matrix factorization fusion model network data analysis
  • 相关文献

参考文献6

二级参考文献101

  • 1张剑,郭燕慧,钟义信.基于特征项的群组信息推荐算法[J].计算机工程与应用,2004,40(15):4-5. 被引量:6
  • 2Big data. Nature, 2008, 455(7209): 1-136.
  • 3Dealing with data. Science,2011,331(6018): 639-806.
  • 4Holland J. Emergence: From Chaos to Order. RedwoodCity,California: Addison-Wesley? 1997.
  • 5Anthony J G Hey. The Fourth Paradigm: Data-intensiveScientific Discovery. Microsoft Research, 2009.
  • 6Phan X H, Nguyen L M,Horiguchi S. Learning to classifyshort and sparse text Web with hidden topics from large-scale data collections//Proceedings of the 17th InternationalConference on World Wide Web. Beijing, China,2008:91-100.
  • 7Sahami M, Heilman T D. A web-based kernel function formeasuring the similarity of short text snippets//Proceedingsof the 15th International Conference on World Wide Web.Edinburgh, Scotland, 2006: 377-386.
  • 8Efron M, Organisciak P,Fenlon K. Improving retrieval ofshort texts through document expansion//Proceedings of the35th International ACM SIGIR Conference on Research andDevelopment in Information Retrieval. Portland, OR, USA,2012: 911-920.
  • 9Hong L,Ahmed A, Gurumurthy S,Smola A J, Tsioutsiou-liklis K. Discovering geographical topics in the twitterstream//Proceedings of the 21st International Conference onWorld Wide Web(WWW 2012). Lyon, France, 2012:769-778.
  • 10Pozdnoukhov A,Kaiser C. Space-time dynamics of topics instreaming text//Proceedings of the 3rd ACM SIGSPATIALInternational Workshop on Location-Based Social Networks.Chicago-IL,USA, 2011: 1-8.

共引文献866

同被引文献79

引证文献11

二级引证文献32

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部