摘要
社交网络特征和用户关系是社交网络分析研究的重要内容。该文对移动社交网络中存在的幂律分布及用户亲属关系判别问题进行研究。在幂律分布的研究中,该文在度、连通子图规模及用户联系人数量的分布中找出存在的三个幂律分布,同时分析其中规律和结论,并与其他社交网络进行对比。在该文亲属关系判别研究中,通过提取用户通话行为的多种显著特征,采用GBDT(gradient boost decision tree)与LR(logistic regression)融合方法,提出一种用户亲属关系判别模型,并通过实验验证该模型能有效判别出用户间是否存在亲属关系,判别精确率达到81.01%。
Social network structure and user relationship are the important topicsin social network analysis. In this paper,we study the power-law distribution and the identification of kinship between users for the mobile social net- work. Three power law distributions are revealed in the distribution of degree, connected sub-graph scale and the us- er contacts,which are compared with other social networks. We study the identification model of kinship by using GBDT (Gradient Boost Decision Tree) and LR (Logistic Regression) fusion method by extracting a variety of salient features of user's call behavior. The experiment indicates that the model can determine whether there is a kinship be- tween users at a precision of 81.01%.
作者
张树森
魏玉党
梁循
窦勇
许媛
梁天新
ZHANG Shusen;WEI Yudang;LIANG Nun;DOU Yong;XU Yuan;LIANG Tianxin(School of Information,Renmin University of China,Beijing 100872,China;National Key Laboratory of Parallel and Distributed Processing(PDL),National University of Defense Technology,Changsha,Hunan 410073,China)
出处
《中文信息学报》
CSCD
北大核心
2018年第6期114-123,共10页
Journal of Chinese Information Processing
基金
国家自然科学基金(71531012)
国家自然科学基金(U1435219)
关键词
社交网络
幂律分布
亲属关系
social networks
power-law distribution
kinship