An Efficient Clustering Algorithm for k-Anonymisation 被引量：4

An Efficient Clustering Algorithm for k-Anonymisation

导出

摘要 K-anonymisation is an approach to protecting individuals from being identified from data. Good k-anonymisations should retain data utility and preserve privacy, but few methods have considered these two conflicting requirements together. In this paper, we extend our previous work on a clustering-based method for balancing data utility and privacy protection, and propose a set of heuristics to improve its effectiveness. We introduce new clustering criteria that treat utility and privacy on equal terms and propose sampling-based techniques to optimally set up its parameters. Extensive experiments show that the extended method achieves good accuracy in query answering and is able to prevent linking attacks effectively. K-anonymisation is an approach to protecting individuals from being identified from data. Good k-anonymisations should retain data utility and preserve privacy, but few methods have considered these two conflicting requirements together. In this paper, we extend our previous work on a clustering-based method for balancing data utility and privacy protection, and propose a set of heuristics to improve its effectiveness. We introduce new clustering criteria that treat utility and privacy on equal terms and propose sampling-based techniques to optimally set up its parameters. Extensive experiments show that the extended method achieves good accuracy in query answering and is able to prevent linking attacks effectively.

作者 Grigorios Loukides

机构地区 School of Computer Science

出处《Journal of Computer Science & Technology》 SCIE EI CSCD 2008年第2期188-202,共15页 计算机科学技术学报（英文版）

关键词 k-anonymisation data privacy greedy clustering k-anonymisation, data privacy, greedy clustering

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献22

1Li N, Li T, Venkatasubramanian S. t-closeness: Privacy beyond k-anonymity and l-diversity. In Proc. ICDE, Istanbul, Turkey, 2007, pp.106-115.
2Loukides G, Shao J. Speeding up clustering-based kanonymisation algorithms with pre-partitioning. In Proc. The 24th British National Conference on Databases, Glasgow, UK, 2007, pp.203-214.
3Loukides G, Shao J. Capturing data usefulness and privacy protection in K-anonymisation. In Proc. The 22nd Annual A CM Symposium on Applied Computing, Seoul, Korea, 2007, pp.370-374.
4Sweeney L. K-anonymity: A model for protecting privacy. International Journal on Uncertainty, Fuzziness and Knowledge-Based Systems, 2002, 10(5): 557-570.
5Samarati P. Protecting respondents identities in microdata release. IEEE Transactions on Knowledge and Data Engineering, 2001, 13(9): 1010-1027.
6LeFevre K, DeWitt D J, Ramakrishnan R. Mondrian multidimensional K-anonymity. In Proc. ICDE, Atlanta, Georgia, USA, 2006, p.25.
7Bayardo R J, Agrawal R. Data privacy through optimal kanonymization. In Proc. ICDE, Tokyo, Japan, 2005, pp.217- 228.
8Iyengar V S. Transforming data to satisfy privacy constraints. In Proc. KDD, Edmonton, Alberta, Canada, 2002, pp.279- 288.
9LeFevre K, DeWitt D J, Ramakrishnan R. Workload-aware anonymization. In Proc. KDD, Philadelphia, PA, USA, 2006, pp.277-286.
10Fung B C M, Wang K, Yu P S. Top-down specialization for information and privacy preservation. In Proc. ICDE, Tokyo, Japan, 2005, pp.205-216.

同被引文献41

1杨晓春,刘向宇,王斌,于戈.支持多约束的K-匿名化方法[J].软件学报,2006,17(5):1222-1231. 被引量：60
2钱晓东.数据挖掘中分类方法综述[J].图书情报工作,2007,51(3):68-71. 被引量：28
3SWEENEY L. k-Anonymity : A Model for Protecting Privacy [ J ]. International Journal of Uncertainty, Fuzziness and Knowledge Based Systems, 2002, 10 (5) : 557-570.
4MACHANAVAJJHALA, GEHRKE J, KIFER D, et al. l-Diversity: Privacy Beyond k-Anonymity [J/OL]. [2008-12] http: //www. scribd.com/doc/2917722/Diversity-Privacy-Beyond-kAnonymity.
5LI Ning-hui, LI Tian-cheng, VENKATASUBRAMANIAN S. t-Coseness : Privacy beyond k-Anonymity and/-Diversity [ C ] // Proceedings of IEEE 23rd International Conference on Data Engineering. Istanbul: IEEE Computer Society, 2007: 106-115.
6TRUTA T M, VINAY B. Privacy Protection: p-Sensitive k-Anonymity Property [ C] // Proceedings of the 22nd IEEE International Conference on Data Engineering. [ S. l. ] : IEEE, 2006: 94.
7LOUKIDES G, SHAO Jian-hua. Speeding Up Clustering-Based k-Anonymisation Algorithms with Pre-Partitioning [ C ] //Proceedings of the 24th British National Conference on Databases. Glasgow, UK: [ s. n. ], 2007: 203-214.
8BYUN J, KAMRA E, BERTINO E, et al. Efficient k-Anonymization Using Clustering Techniques [ C ] //Proceedings of the 12th International Conference on Database Systems for Advanced Applications. Bangkok, Thailand: [ s. n. ], 2007: 158- 200.
9XU Jian, WANG Wang, PEI Jian, et al. Utility-Based Anonymization for Privacy Preservation with Less Information Loss [J]. ACM SIGKDD Explorations Newsketter, 2006, S (2): 21-30.
10Wang Dawei, Liau Chum-Jung, J, Hsu Tsan-Sheng. Medical Privacy Protection Based on Granular Computing[J]. Artificial Intelligence in Medicine, 2004, 32(2): 137-149.

引证文献4

1李太勇,唐常杰,吴江,周敏.基于两次聚类的k-匿名隐私保护[J].吉林大学学报（信息科学版）,2009,27(2):173-178. 被引量：1
2熊树洁,邱桃荣,龚科华,白小明.基于β重要度的数据隐含化[J].计算机工程,2009,35(23):127-129.
3刘文军,游兴中.一种改进的凝聚层次聚类法[J].吉首大学学报（自然科学版）,2011,32(4):11-14. 被引量：10
4田钦瑞,李桥兴.养老机构智慧化水平测度:理论与实证[J].中国全科医学,2024,27(7):857-866.

二级引证文献11

1岳强斌,欧渊,石倩.装备维修流程设计需求聚类分析[J].重庆理工大学学报（自然科学）,2012,26(12):65-69.
2刘让国,彭会湘,陈莉.基于WebGis的态势表达解决方案探讨[J].计算机与网络,2013,39(2):61-64. 被引量：2
3饶威,王凤云,丁坚勇.基于改进层次聚类法的电力设备家族缺陷评估[J].浙江电力,2013,32(3):9-13. 被引量：6
4郭鑫,颜一鸣,徐洪智,董坚峰.不确定树数据库中的动态聚类算法[J].小型微型计算机系统,2013,34(6):1339-1343. 被引量：4
5郭鑫,颜一鸣,徐洪智,覃遵跃.动态云平台下的快速闭树聚类并行算法[J].计算机工程,2013,39(9):80-83. 被引量：2
6郭鑫,颜一鸣.一种动态云模型下树数据挖掘算法[J].小型微型计算机系统,2013,34(12):2749-2752. 被引量：8
7颜一鸣,郭鑫.一种基于Hadoop的动态树增量更新方法[J].计算机工程,2014,40(3):67-70. 被引量：1
8侯锟,于晓鹏.数据库服务模型中数据安全机制的研究[J].吉林大学学报（信息科学版）,2014,32(4):413-417.
9戴危艳,李少华,刘诗宇.距离法定量评价储层地质模型的不确定性[J].地质与资源,2015,24(5):478-482.
10李敏,陈果,沈大千,陈飞洋,罗宇昆,王昕.基于改进凝聚层次聚类算法的变压器绕组及铁心故障诊断研究[J].高压电器,2018,54(1):236-242. 被引量：18

1晓芸,点点豆豆.夫妻幸福的秘诀[J].食品与药品,2005,7(06B):85-87.
2愚人.就地取材玩转磁盘加密[J].电脑爱好者,2016,0(24):19-20.
3陈芬.Ghost的另类技巧4则[J].大众软件,2004(17):90-90.
4YE Qing,ZHENG Shi-hui,GUO Hong-fu,YANG Yi-xian.Generalization of perfect concurrent signatures[J].The Journal of China Universities of Posts and Telecommunications,2012,19(6):94-104.
5QIU Shuo,LIU Jiqiang,SHI Yanfeng,HAN Zhen.Multi-Party Identity-Based Symmetric Privacy-Preserving Matching with Cloud Storage[J].Wuhan University Journal of Natural Sciences,2014,19(5):426-432.
6熊胜超,吴瑕,彭智勇.保持数据可用性的细粒度轨迹隐私保护方案[J].华东师范大学学报（自然科学版）,2015(5):96-103.
7刘辉.创设交流型课堂搞好中职计算机教学[J].成功,2013(8):249-249.
8郝玉春,李强,谭文,李东海.Partially Decentralized Controller Design via Model Predictive Control[J].Chinese Journal of Chemical Engineering,2012,20(6):1094-1101. 被引量：1
9施建靖.交流型信息技术课堂的探索[J].科海故事博览：科技探索,2010(11):191-191.
10罗永龙,黄刘生,仲红.Secure Two-Party Point-Circle Inclusion Problem[J].Journal of Computer Science & Technology,2007,22(1):88-91. 被引量：16

Journal of Computer Science & Technology

2008年第2期

浏览历史

内容加载中请稍等...

An Efficient Clustering Algorithm for k-Anonymisation 被引量：4

参考文献22

同被引文献41

引证文献4

二级引证文献11

相关作者

相关机构

相关主题

浏览历史