期刊文献+

基于MapReduce的并行抽样路径K-匿名隐私保护算法 被引量:3

A parallel sampling path K-anonymity privacy protection based on MapReduce
下载PDF
导出
摘要 K-匿名算法及现存K-匿名改进算法大多使用牺牲时间效率降低发布数据信息损失量的方法实现数据的匿名化,但随着数据量的急剧增长,传统的数据匿名化方法已不适用于对较大数据的处理。针对K-匿名算法在单机执行过程中产生大量频繁项集和重复搜索数据表的缺点,将MapReduce模型引入到抽样泛化路径K-匿名算法中对其进行优化。该方法兼具MapReduce及抽样泛化算法的优点,高效分布式匿名化数据集,降低发布数据集信息损失量,提高数据的可用性。实验结果表明:当数据量较大时,该优化算法在时间效率及数据精度方面有显著提高。 K-anonymous algorithm and improved algorithm K-anonymous mostly use the method of sacrificing of time to lower data information loss to realize the data anonymity, but with the rapid growth of data quantity, the traditional methods of data anonymity is not suitable for processing of large data. Aimed at the shortage of time complexity and execution efficiency K-anonymity in stand-alone that it generates a lot of frequent sets and searches the data table repeatedly, this paper introduces the MapReduce technology to K-anonymity algorithm to optimize this algorithm. The algorithm with the advantage of MapReduce and sampling generic algorithm can compute distributed anonymous data set effectively and reduce the information loss of released data set, so it improves the availability of data. The experimental results show that the algorithm increases observably in time efficiency and data accuracy.
出处 《电子技术应用》 北大核心 2017年第9期132-136,共5页 Application of Electronic Technique
基金 宿迁市科技计划项目(Z201445 S201410 Z201448) 宿迁学院科研基金项目(2013KY13)
关键词 MAP REDUCE K-匿名 抽样 MapReduce K-anonymity sample
  • 相关文献

参考文献5

二级参考文献77

  • 1宁焕生,张瑜,刘芳丽,刘文明,渠慎丰.中国物联网信息服务系统研究[J].电子学报,2006,34(B12):2514-2517. 被引量:151
  • 2彭京,唐常杰,程温泉,石葆梅,乔少杰.一种基于层次距离计算的聚类算法[J].计算机学报,2007,30(5):786-795. 被引量:11
  • 3J Dean,S Ghemawat.MapReduce:Simplified data processing on large clusters[J].Communications of the ACM,2008,51(1):107-113.
  • 4J L Wagener.High performance fortran[J].Computer Standards & Interfaces,Elsevier,1996,18(4):371-377.
  • 5W Gropp,E Lusk,et al.Using MPI:Portable Parallel Programming with the Message Passing Interface[M].Cambridge:MIT Press,1999.1-350.
  • 6A Geist,A Beguelin,et al.PVM:Parallel Virtual Machine:A Users' Guide and Tutorial for Networked Parallel Computing[M].Cambridge:MIT Press,1995.1-299.
  • 7A Verma,N Zea,et al.Breaking the mapreduce stage barrier .Proc of IEEE International Conference on Cluster Computing .Los Alamitos:IEEE Computer Society,2010.235-244.
  • 8H C Yang,A Dasdan,et al.Map-Reduce-Merge:Simplified relational data processing .Proc of ACM SIGMOD International Conference on Management of Data .New York:ACM,2007.1029-1040.
  • 9S V Valvag,D Johansen.Oivos:Simple and efficient distributed data processing .Proc of IEEE International Conference on High Performance Computing and Communications .Piscataway:IEEE,2008.113-122.
  • 10Z Vrba,P Halvorsen,et al.Kahn process networks are a flexible alternative to mapreduce .Proc of IEEE International Conference on High Performance Computing and Communications .Piscataway:IEEE,2009.154-162.

共引文献343

同被引文献30

引证文献3

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部