摘要
由于当前各类主流网络平台的发展呈现出"社交平台内容化、内容平台社交化"的趋势,用户分化也日趋明显,出现了拥有大规模粉丝的超级节点,因此,提出了一种基于社交关系的兴趣挖掘模型。结合矩阵分解和标签传播算法,将用户分为内容发布者和普通用户两类,并分别提取和计算兴趣话题,实现了在大规模异构网络中发现、挖掘用户兴趣。基于知乎数据集上设计的对比实验,验证了模型的有效性以及算法的性能优势。与基线方法相比,这种算法在查全率上最大提升约42%,F1值最大提升约33%。
Due to great advances in the mobile Internet,the Social Network Service(SNS)has become an indispensable service.The development of current mainstream social media shows a trend that social service and information service are combined and interwork to provide a better experience.Meanwhile there is increasing polarization among users.The heterogeneous features,say,the combination of information content and sociality as well as the polarization of user roles,present challenges to traditional research in social media.Some studies of social media are mainly based on the equal position among nodes or similar relations.If the algorithms brought about by these studies are applied directly to the networks where the users are highly polarized,the results may be distorted or even be quite different from the fact.A new model for interests mining based on social relations is proposed in this paper.Dealing with the polarization in social media,we incorporate matrix factorization and the label propagation algorithm to treat information disseminators and average users,respectively,in order to discover interests of average users in a large-scale heterogeneous network.The validness of the model and the performance and advantages of the algorithm are tested and verified in Zhihu datasets.Experiments show that the maximum increase in the recall of the proposed method,compared with the baseline,is 42%.
作者
屠守中
闫洲
卫玲蔚
朱小燕
TU Shouzhong;YAN Zhou;WEI Lingwei;ZHU Xiaoyan(School of Computer Science and Technology, Tsinghua Univ., Beijing 100084, China;Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093, China)
出处
《西安电子科技大学学报》
EI
CAS
CSCD
北大核心
2019年第2期83-88,共6页
Journal of Xidian University
基金
国家自然科学基金(61332007)
关键词
异构网络
社交网络
兴趣模型
非负矩阵分解
标签传播
heterogeneous network
social networks
interest model
non-negative matrix factorization
label Propagation