基于推文与属性的社交网络用户重识别方法

Method for Users Re-Identification across Social Networks Based on Tweets and Attributes

下载PDF

导出

摘要大数据隐私安全正成为各界关注的热点.攻击者通过识别用户不同网站的账户,可以构建用户的完整画像,对用户隐私形成威胁.模拟评估攻击者的重识别能力是进行用户隐私保护的前提.因此,本文提出一种高相似同天同行为算法.该算法通过检测账户在不同网站是否存在多次同天发表相近或相同内容的行为,判断账户是否属于同一用户,并通过为用户属性构建一种权重计算模型,进一步提高用户重识别的准确率.经过对两个国内主流社交网站的一万多用户进行实验,本文算法表现出良好的效果.实验表明,即使不考虑用户社交关系,用户的推文与属性依然提供了足够的信息使攻击者将用户不同网站的账户相关联,从而导致更多的隐私被泄露. Big data Privacy security is becoming the hot spot in the various social industries, because attackers can build an integrate portrait to threaten privacy of users by identifying accounts in different sites. Simulation assessment of the attacker re-identification ability is the precondition of users＇ privacy protection. Therefore, this paper proposes a high similarity algorithm in same day with same behaviors. The core idea of the algorithm is as follows： if a couple account issues similar or identical content on the same day, which also appears many times in different websites, then these two accounts may belong to a person with a high possibility. In addition, this paper builds a new weighting model for the users＇ attributes to improve the accuracy of user re-identification. After the experiment on more than ten thousand users of the two major domestic social networking site, this algorithm proves to be effective. Experimental results show that even if attacker don＇t consider users＇ social relations, the users＇ tweets, attributes, still provide enough information to make the attacker correlate their different accounts, which will lead to leak of more privacy.

作者高伟张敏

机构地区中国科学院大学中国科学院软件研究所

出处《计算机系统应用》 2017年第12期94-103,共10页 Computer Systems & Applications

基金国家自然科学基金重点项目(61232005) 国家自然科学基金(61402456)

关键词社交网络用户重识别推文属性相似度 social network users re-identification tweets attributes similarity

分类号 TP309 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

1张君燕.“众筹字典”欢乐多[J].发明与创新（高中生）,2017(11):60-60.
2范杰.尹丹:硕高,为中国品质代言[J].中华儿女,2017,0(23):90-91.
3杨萍,赵张燕,撒灵,乔意凡,邓小燕,张心歌.基于嵌入式Linux的智能家居机器人[J].电子世界,2017,0(20):167-168. 被引量：1
4俄罗斯拟禁止军人网上发布照片视频[J].保密工作,2017,0(10):57-57.
5严文博,姚远志,张卫明,俞能海.基于二维码和信息隐藏的物流系统隐私保护方案[J].网络与信息安全学报,2017,3(11):22-28. 被引量：9
6张海云.基于载波聚合场景下多点协作及多天线技术的分析[J].数码世界,2017,0(12):189-189.
7龚宁静,冷静.一种基于LBS的多用户位置共享方法MULS[J].软件导刊,2017,16(12):143-146. 被引量：1
8张娴.针对MySQL数据库的自动化工具的注入与防御[J].数码世界,2017,0(12):557-558.
9阚泓,傅君.Google交互纺织品专利分析[J].纺织导报,2017(10):87-88. 被引量：3
10余翔,刘志红,闫冰冰.基于社交关系的移动机会网络路由算法[J].计算机工程,2017,43(12):98-102. 被引量：2

计算机系统应用

2017年第12期

浏览历史

内容加载中请稍等...

基于推文与属性的社交网络用户重识别方法

相关作者

相关机构

相关主题

浏览历史