期刊文献+

Parzen窗核密度估计的大规模数据模式分类隐私保护方法 被引量:1

A Pattern Classification Privacy Preservation Algorithm Based on Parzen Window Kernel Density Estimation for Large Data Set
原文传递
导出
摘要 针对大规模数据集上的模式分类任务,提出基于Parzen窗核密度估计的模式分类隐私保护算法。利用Parzen窗算法对原始大规模训练集服从的概率密度进行估计,根据估计的概率密度函数构造la个替换训练样本,其中l为原始样本的数目,a通过10折交叉验证方式确定。最后发布替换训练样本进行模式分类,以实现原始数据上的隐私保护。在Adult数据集上的仿真实验充分验证了算法的有效性。 In this paper, a pattern classification privacy preservation algorithm is proposed based on the Parzen window kernel density estimation on large scale dataset. Firstly, the probability density is estimated through the original large scale training set. Then the replacement training samples are constructed by the estimated probability. Finally, the replacement training samples are published for the pattern classification training. Thus the privacy on the original training set can be protected effectively. The simulation experiments on Adult datasets fully verify the effectiveness of the proposed algorithm.
出处 《科技导报》 CAS CSCD 北大核心 2014年第36期104-109,共6页 Science & Technology Review
基金 国家自然科学基金项目(61073041 61073043 61370083 61402126) 黑龙江省自然科学基金项目(F200901) 福建省自然科学基金项目(2011J1296) 高等学校博士学科点基金项目(20112304110011 20112304110012)
关键词 PARZEN窗 核密度估计 数据发布 隐私保护 Parzen window kernel density estimation data publish privacy preserving
  • 相关文献

参考文献11

  • 1Han J W, Kamber M. Data mining: Concepts and techniques[M]. San Francisco, CA: Morgan Kaufmann, 2001: 257-259.
  • 2周水庚,李丰,陶宇飞,肖小奎.面向数据库应用的隐私保护研究综述[J].计算机学报,2009,32(5):847-861. 被引量:219
  • 3周恩策,刘纯平,张玲燕,龚声蓉,刘全.基于时间窗的自适应核密度估计运动检测方法[J].通信学报,2011,32(3):106-114. 被引量:14
  • 4Yang J, Yu X, Xie Z Q. A novel virtual sample generation method based on Gaussian distribution[J]. Knowledge-Based Systems, 2011, 24 (6): 740-748.
  • 5Cortes C, Vapnik V. Support vector networks[J]. Machine Learning, 1995, 20(8): 273-297.
  • 6Quinlan J R. C4.5: Programs for Machine Learning[M]. San Mateo, CA: Morgan Kaufmann, 1993, 17-69.
  • 7Xiao X, Tao Y. Personalized privacy preservation[C]//Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data. Illinois, Chicago: ACM, 2006: 229-240.
  • 8Sweeney L. K-anonymity: A model for protecting privacy[J]. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 2002, 10(5): 557-570.
  • 9Machanavajjhala A, Kifer D, Gehrke J, et al. L-diversity: Privacy beyond K-anonymity[J]. ACM Transactions on Knowledge Discovery from Data, 2007(1): 3-15.
  • 10Agrawal R, Srikant R. Privacy-preserving data mining[J]. ACM Sigmod Record, 2000, 29(2): 439-450.

二级参考文献84

共引文献230

同被引文献10

引证文献1

二级引证文献23

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部