Utility-Based Anonymization Using Generalization Boundaries to Protect Sensitive Attributes

Utility-Based Anonymization Using Generalization Boundaries to Protect Sensitive Attributes

下载PDF

导出

摘要 Privacy preserving data mining (PPDM) has become more and more important because it allows sharing of privacy sensitive data for analytical purposes. A big number of privacy techniques were developed most of which used the k-anonymity property which have many shortcomings, so other privacy techniques were introduced (l-diversity, p-sensitive k-anonymity, (α, k)-anonymity, t-closeness, etc.). While they are different in their methods and quality of their results, they all focus first on masking the data, and then protecting the quality of the data. This paper is concerned with providing an enhanced privacy technique that combines some anonymity techniques to maintain both privacy and data utility by considering the sensitivity values of attributes in queries using sensitivity weights which determine taking in account utility-based anonymization and then only queries having sensitive attributes whose values exceed threshold are to be changed using generalization boundaries. The threshold value is calculated depending on the different weights assigned to individual attributes which take into account the utility of each attribute and those particular attributes whose total weights exceed the threshold values is changed using generalization boundaries and the other queries can be directly published. Experiment results using UT dallas anonymization toolbox on real data set adult database from the UC machine learning repository show that although the proposed technique preserves privacy, it also can maintain the utility of the publishing data. Privacy preserving data mining (PPDM) has become more and more important because it allows sharing of privacy sensitive data for analytical purposes. A big number of privacy techniques were developed most of which used the k-anonymity property which have many shortcomings, so other privacy techniques were introduced (l-diversity, p-sensitive k-anonymity, (α, k)-anonymity, t-closeness, etc.). While they are different in their methods and quality of their results, they all focus first on masking the data, and then protecting the quality of the data. This paper is concerned with providing an enhanced privacy technique that combines some anonymity techniques to maintain both privacy and data utility by considering the sensitivity values of attributes in queries using sensitivity weights which determine taking in account utility-based anonymization and then only queries having sensitive attributes whose values exceed threshold are to be changed using generalization boundaries. The threshold value is calculated depending on the different weights assigned to individual attributes which take into account the utility of each attribute and those particular attributes whose total weights exceed the threshold values is changed using generalization boundaries and the other queries can be directly published. Experiment results using UT dallas anonymization toolbox on real data set adult database from the UC machine learning repository show that although the proposed technique preserves privacy, it also can maintain the utility of the publishing data.

作者 Abou-el-ela Abdou Hussien Nagy Ramadan Darwish Hesham A. Hefny

机构地区 Department of Computer Science Department of Computer and Information Sciences

出处《Journal of Information Security》 2015年第3期179-196,共18页 信息安全（英文）

关键词 PRIVACY PRIVACY PRESERVING Data Mining K-ANONYMITY GENERALIZATION Boundaries Suppression Privacy Privacy Preserving Data Mining K-Anonymity Generalization Boundaries Suppression

分类号 R73 [医药卫生—肿瘤]

引文网络
相关文献

1Fokrul Alom Mazarbhuiya.Finding a Link between Randomness and Fuzziness[J].Applied Mathematics,2014,5(9):1369-1374. 被引量：1
2Abou-el-ela Abdou Hussien,Nermin Hamza,Hesham A. Hefny.Attacks on Anonymization-Based Privacy-Preserving: A Survey for Data Mining and Data Publishing[J].Journal of Information Security,2013,4(2):101-112. 被引量：1
3Xiaohui Lin,Junling Zhang,Can Hu,Yide Huang,Bin Chen,Ning Xie,Hui Wang.Utility-Based Node Cooperation Mechanism in Wireless Sensor Networks[J].International Journal of Communications, Network and System Sciences,2013,6(5):236-243.
4You Wu,Xiao-Juan Sun,Yu-Ping Jia,Da-Bing Li.Review of improved spectral response of ultraviolet photodetectors by surface plasmon[J].Chinese Physics B,2018,27(12):35-45. 被引量：3
5Alexa G. Santana,Crébio J. áVila,Harley N. de Oliveira,Patrícia P. Bellon,Eunice C. Schlick-Souza.Direct and Indirect Effect of Bt Cotton and No Bt Cotton on the Development and Reproduction of the Predator <i>Podisus nigrispinus</i>(Dallas, 1851) (Hemiptera: Pentatomidae)[J].American Journal of Plant Sciences,2017,8(6):1438-1448. 被引量：1
6G. Danko,C. Lu.Variable Daily Air Temperature Model for Analysis and Design[J].Applied Mathematics,2018,9(8):1015-1038.
7Syed Ahmed Shah,Chinta Nageswaranath,Modem Ramesh,Mangipudi Venkata Ramanamurthy.Torsional Vibrations of Coated Hollow Poroelastic Spheres[J].Open Journal of Acoustics,2017,7(1):18-26.
8Eram Ghazi,Abbasali Aliakbari Bidokhti,Mojtaba Ezam,Masoud Torabi Azad,Smaeyl Hassanzadeh.Physical Properties of Persian Gulf Outflow Thermohaline Intrusion in the Oman Sea[J].Open Journal of Marine Science,2017,7(1):169-190.
9Myron Frederick Weiner,Linda Susan Hynan,Heidi Rossetti,Matthew Wesley Warren,Colin Munro Cullum.The relationship of montreal cognitive assessment scores to framingham coronary and stroke risk scores[J].Open Journal of Psychiatry,2011,1(2):49-55.
10Ibrahim Q. Mohammed,Fadhil A. Lawa.Stratigraphic Correlation between Subsurface Maastrichtian Digma Formation and Safra Unit from Outcrop Sections, Western Desert of Iraq[J].International Journal of Geosciences,2017,8(9):1192-1209.

Journal of Information Security

2015年第3期

浏览历史

内容加载中请稍等...

Utility-Based Anonymization Using Generalization Boundaries to Protect Sensitive Attributes

相关作者

相关机构

相关主题

浏览历史