期刊文献+

基于OPTICS聚类的差分隐私保护算法的改进 被引量:7

Improvement of differential privacy protection algorithm based on OPTICS clustering
下载PDF
导出
摘要 采用聚类算法预先处理个人隐私信息实现差分隐私保护,能够减少直接发布直方图数据带来的噪声累积现象,同时减小了直方图因合并方式不同带来的重构误差。针对DP-DBSCAN差分隐私算法存在对数据参数输入敏感问题,将基于密度聚类的OPTICS算法应用于差分隐私保护中,并提出改进的DP-OPTICS差分隐私保护算法,对稀疏型数据集进行压缩处理,对比采用同方差噪声和异方差噪声两种添加噪声方式,考虑攻击者能够攻破隐私信息的概率,确定隐私参数ε的上界,有效平衡了敏感信息的隐私性和数据的可用性之间的关系。将DP-OPTICS算法和基于OPTICS聚类的差分隐私保护算法、DP-DBSCAN算法进行对比,DP-OPTICS算法在时间消耗上介于其余二者之间,但是在取得相同参数的情况下,聚类的稳定性在三者中最好,因此改进后OP-OPTICS差分隐私保护算法总体上是可行的。 Clustering algorithm is used to preprocess personal privacy information in order to achieve differential privacy protection, which can reduce the reconstruction error caused by directly distributing histogram data, and the reconstruction error caused by different combining methods of histogram. Aiming at the problem of sensitivity to input data parameters in DP- DBSCAN ( Differential Privacy-Density-Based Spatial Clustering of Applications with Noise) differential privacy algorithm, the OPTICS (Ordering Points To Identify Clustering Structure) algorithm based on density clustering was applied to differential privacy protection. And an improved differential privacy protection algorithm, called DP-OPTICS (Differential Privacy- Ordering Points To Identify Clustering Structure) was introduced, the sparse dataset was compressed, the same variance noise and different variance noise were used as two noise-adding ways by comparison, considering the probability of privacy information's being broken by the attacker, the upper bound of privacy parameter 8 was determined, which effectively balanced the relationship between the privacy of sensitive information and the usability of data. The DP-OPTICS algorithm was compared with the differential privacy protection algorithm based on OPTICS clustering and DP-DBSCAN algorithm. The DP-OtrrlCS algorithm is between the other two in time consumption. However, in the case of having the same parameters, the stability of the DP-OPTICS algorithm is the best among them, so the improved OP-OPTICS differential privacy protection algorithm is generally feasible.
出处 《计算机应用》 CSCD 北大核心 2018年第1期73-78,共6页 journal of Computer Applications
基金 国家自然科学基金资助项目(61462009) 广西民族大学科研基金资助项目(2014MDYB029) 广西民族大学研究生科研创新资助项目(gxun-chxps201671) 广西民族大学中国-东盟研究中心(广西科学实验中心)2014年度开放课题项目(TD201404)~~
关键词 聚类算法 个人隐私 重构误差 差分隐私保护 OPTICS算法 clustering algorithm personal privacy reconstruction error differential privacy protection Ordering PointsTo Identify Clustering Structure (OPTICS) algorithm
  • 相关文献

参考文献4

二级参考文献60

  • 1Zeng H-J,He Q-C,Chen Z,et al.Learning To Cluster Web Search Results[A].In:Proceedings of the 27th Int.Conf.on Research and Development in Information Retrieval (SIGIR'04)[C].July 2004.210-217.
  • 2Ankerst M,Breunig M,Kriegel H-P,et al.OPTICS:Ordering Points to Identify the Clustering Structure[A].In:Proc 1999 ACM-SIGMOD Int.Conf.on Management of Data[C].Philadelphia,PA,June 1999.49-60.
  • 3Dubes R C and Jain A K.Algorithms for Clustering Data[M].Prentice Hall,1988.
  • 4Ester M,Kriegel H-P,Sander J,et al.A DensityBased Algorithm for Discovering Clusters in Large Spatial Databases with Noise[A].In:Proc 1996 Int.Conf.on Knowledge Discovery and Data Mining (KDD'96)[C].Portland,Oregon:AAAI Press,1996.226-231.
  • 5Blum A,Dwork C,McSherry F,et al.Practical Privacy:The SuLQ Framework[C] //24th ACM SIGMOD International Conference on Management of Data / Principles of Database Systems,Baltimore (PODS 2005).Baltimore,Maryland,USA,June 2005.
  • 6Dwork C.Differential Privacy[C] //33rd International Colloquium on Automata,Languages and Programming,part Ⅱ (ICALP 2006).Venice,Italy,Springer Verlag,July 2006.
  • 7Dwork C.Differential Privacy:A Survey of Results[C] //Theory and Applications of Models of Computation(TAMC2008).Xi'an,China,Springer Verlag,April 2008.
  • 8Dwork C.The Differential Privacy Frontier[C] //6th Theory of Cryptography Conference (TCC 2009).San Francisco,CA,Springer Verlag,March 2009.
  • 9Dwork C.Differential Privacy in New Settings[C] //Symposium on Discrete Algorithms (SODA),Society for Industrial and Applied Mathematics.Austin,TX,January 2010.
  • 10Dwork C.A Firm Foundation for Private Data Analysis[J].Communications of the ACM,2011,54 (1):86-95.

共引文献102

同被引文献60

引证文献7

二级引证文献22

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部