期刊文献+

一种增强的差分隐私数据发布算法 被引量:3

An Enhanced Differential Privacy Data Release Algorithm
下载PDF
导出
摘要 为在同等隐私保护强度下提高发布数据的分类准确率,在Diff Gen算法基础上提出一种改进的差分隐私数据发布算法Gini Diff。该算法将原始数据集完全泛化,在每轮迭代中通过指数机制选择特化方案,并以构建决策树的方式将特化后的记录划归到新的等价类,使用拉普拉斯机制为等价类计数添加噪声并生成发布数据集。运用基尼系数增益衡量不同特化方案的可用性,合理分配隐私预算并动态计算其消耗,发布数据集的可用性得到有效提高。实验结果表明,该算法发布的数据在分类准确率方面优于Diff Gen,接近理想水平。 In order to improve the classification accuracy of released data under the same privacy preserving strength,on the basis of Diff Gen algorithm,an enhanced differential privacy data release algorithm named as Gini Diff is proposed.This algorithm completely generalizes original dataset,selects specialization scheme by using exponential mechanism in each round of iteration,and classifies specialized records into newequivalence classes in the way of building decision tree,and uses Laplace mechanism to add noise to counters of equivalence classes,and generates dataset for release. Owing to the fact that the algorithm uses gini-index gain for the utility of different specialization schemes,reasonable privacy budget allocation and dynamical budget consumption calculation,the utility of the dataset for release is effectively improved. Experimental results showthat the algorithm outperforms Diff Gen algorithms in classification accuracy and the classification accuracy is close to ideal level.
出处 《计算机工程》 CAS CSCD 北大核心 2017年第4期160-165,共6页 Computer Engineering
基金 国家自然科学基金(61370220) 河南省高校科技创新团队支持计划项目(15IRTSTHN010) 河南省科技攻关计划项目(142102210425) 河南省教育厅科学技术研究重点基础研究计划项目(13A520240 14A520048) 河南科技大学科研创新能力培育基金(2013ZCX022)
关键词 差分隐私 数据发布 决策树 基尼系数增益 指数机制 拉普拉斯机制 differential privacy data release decision tree gini-index gain exponential mechanism Laplace mechanism
  • 相关文献

参考文献5

二级参考文献29

  • 1栾丽华,吉根林.决策树分类技术研究[J].计算机工程,2004,30(9):94-96. 被引量:110
  • 2高学东,尹阿东,张健,宫雨,武森.利用上凸函数对决策树算法的改进[J].中国管理科学,2004,12(4):144-148. 被引量:2
  • 3姜传贤,孙星明,易叶青,杨恒伏.基于JADE算法的数据库公开水印算法的研究[J].系统仿真学报,2006,18(7):1781-1784. 被引量:9
  • 4纪希禹.数据挖掘技术应用实例[M].北京:机械工业出版社,2009.
  • 5X. Feng,J. C. Zhao,K. Xu.Link prediction in complex networks: a clustering perspective[J].The European Physical Journal B.2012(1)
  • 6Vaidya, Jaideep,Atluri, Vijayalakshmi,Warner, Janice,Guo, Qi.Role Engineering via Prioritized Subset Enumeration[J].IEEE Transactions on Dependable and Secure Computing.2010(3)
  • 7Linyuan Lü,Tao Zhou.Link prediction in weighted networks: The role of weak ties[J].EPL (Europhysics Letters).2010(1)
  • 8Huiping Guo,Yingjiu Li,Anyi Liu,Sushil Jajodia.A fragile watermarking scheme for detecting malicious modifications of database relations[J].Information Sciences.2005(10)
  • 9Yingwei Cui,Jennifer Widom,Janet L. Wiener.Tracing the lineage of view data in a warehousing environment[J].ACM Transactions on Database Systems (TODS).2000(2)
  • 10Dwork (2. Differential privacy[C]//Proeeedings of the 33rd in- ternational conference on Automata, Languages and Program- ming-Volume Part II. Springer-Verlag, 2006 : 1-12.

共引文献998

同被引文献26

引证文献3

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部