期刊文献+

面向高维数据发布的差分隐私算法及应用综述

Survey of Differential Privacy Algorithms and Applications for High-Dimensional Data Publishing
下载PDF
导出
摘要 随着大数据和机器学习技术的进一步发展,处理具有几十上百维特征的复杂结构和关系且蕴含丰富语义信息的高维数据成为一项挑战。在保障个人隐私不被泄露的前提下,如何安全地使用这些高维数据,成为当前的一个重要话题。我们查阅资料发现:关于差分隐私技术本身的综述很多,但是面向高维数据发布的差分隐私算法及应用的综述却很少。基于此,本文通过对差分隐私在高维数据领域的应用进行综述,深入了解不同方法在保护高维数据隐私方面的优劣,并指导面向高维数据发布的差分隐私算法未来研究的方向,从而更好地应对隐私保护和数据分析的挑战。本文首先介绍了差分隐私的原理和特性,总结了当前差分隐私技术本身的研究工作。然后从数据降维和数据合成两个角度分析了差分隐私在高维数据环境中的应用,探讨了差分隐私面临的问题和挑战,并提出了初步的解决方法,旨在更好地解决当前高维数据保护和使用的问题。最后,本文提出了未来可能的研究方向以促进技术交流,推动差分隐私在高维数据应用中的进一步突破。 With the further development of big data and machine learning technologies,handling high-dimensional data with complex structures,relationships,and rich semantic information containing dozens to hundreds of features has become a challenge.Safely utilizing such high-dimensional data,while ensuring the privacy of individuals,has become a significant topic today.Upon reviewing existing literature,we found numerous reviews on differential privacy technology itself,but few on the algorithms and applications of differential privacy specifically tailored for high-dimensional data.Therefore,this paper provides a review of the application of differential privacy in the field of high-dimensional data,aiming to delve into the strengths and weaknesses of different methods in protecting the privacy of high-dimensional data and to guide future research directions for differential privacy algorithms tailored for high-dimensional data publishing.Firstly,this paper introduces the principles and characteristics of differential privacy,summarizing the current research work on the technology itself.Then,it analyzes the application of differential privacy in high-dimensional data environments from the perspectives of data dimensionality reduction and data synthesis,discussing the challenges and issues faced by differential privacy and proposing preliminary solutions to better address the issues of privacy protection and data analysis in the current high-dimensional data landscape.Lastly,potential future research directions are proposed to facilitate technological exchange and further advancements in the application of differential privacy in high-dimensional data settings.
作者 龙春 秦泽秀 李丽莎 李婧 杨帆 魏金侠 付豫豪 LONG Chun;QIN ZeXiu;LI LiSha;LI Jing;YANG Fan;WEI JinXia;FU YuHao(Computer Network Information Center,Chinese Academy of Sciences,Beijing 100083,China;University of Chinese Academy of Sciences,Beijing 100049,China)
出处 《农业大数据学报》 2024年第2期170-184,共15页 Journal of Agricultural Big Data
基金 国家重点研发计划:金融数据全周期流转安全风险评估监测与溯源技术研究(2023YFC3304704) 中国科学院网络安全和信息化专项(CASWX2022GC-04) 中国科学院青年创新促进会项目(2022170)。
关键词 差分隐私 高维数据 扰动机制 隐私分配 differential privacy high-dimensional data perturbation mechanism privacy allocation
  • 相关文献

参考文献14

二级参考文献132

  • 1何玲,吴限.全国“信易贷”平台取得四方面积极成效[J].中国信用,2021(6):32-32. 被引量:1
  • 2袁康.金融科技的技术风险及其法律治理[J].法学评论,2021,39(1):115-130. 被引量:56
  • 3孙慧中,杨健宇,程祥,苏森.一种基于随机投影的本地差分隐私高维数值型数据收集算法[J].大数据,2020,6(1):3-11. 被引量:4
  • 4SWEENEY L. ^-anonymity: a model for protecting privacy[ J ]. Inter-national Journal on Uncertainty, Fuzziness and Knowledge-based Systems,2002,10(5) :557-570.
  • 5SWEENEY L. Achieving A>anonymity privacy protection using gener-alization and suppression[ J]. International Journal on Uncertainty,Fuzziness and Knowledge-based Systems, 2002,10(5) : 571-588.
  • 6Li Ning-hui, LI Tian-cheng, VENKATASUBRAMANIAN S. (-closeness :privacy beyond A:-anonymity and /-diversity [ C ] //Proc of the 23rd International Conference on Data Engineering. Washington DC: IEEE Computer Society ,2007 :106-115.
  • 7MACHANAVAJJHALA A,KIFER D, GEHRKE J, et al. /-diversity; privacy beyond A:-anonymity [ C ] //Proc of the 22nd International Conference on Data Engineering. Washington DC:IEEE Computer Society,2006 :24-35.
  • 8CORMODE G,PROCOPIUC M,SRIVASTAVA D. et aL Differentially private publication of sparse data [ J ]. ArxiV Preprint arXiv : 1103. 0825,2011.
  • 9SARATHY R,MURALIDHAR K. Some additional insights on applying differential privacy for numeric data [ C ]//Proc of International Conference on Privacy in Statistical Databases. Berlin : Springer-Ver-lag,2010:210-219.
  • 10DWORK C, NAOR M,PITASSI T,et al. Pan-private streaming algorithms [C ] //Proc of the 1st Symposium on Innovations in Computer Science. Beijing:Tsinghua University Press, 2010.

共引文献233

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部