反频繁集挖掘可计算复杂性问题研究

Study of Computational Complexity on Inverse Frequent Set Mining

下载PDF

导出

摘要频繁集挖掘是总结二进制数据的重要技术,但如何找到一个二进制数据集与频繁集挖掘结果相一致却十分困难。文中从可计算复杂度的观点研究了频繁集的隐私保持。特别分析了反频繁挖掘问题的可计算复杂度。给出了决定是否存在与一个已知频繁集兼容的数据集是一个NP难度问题;当原始数据集d由6个集合组成时计算与已知频繁集兼容的数据集的数量是一个P类完全问题。 Frequent set mining is a well known technique to summartize binary data. However, it is difficult to find a binary data set that is compatible with frequent set mining results. The paper studies the frequent sets preserve privacy from the viewpoint of computational complexity, and specially analyzes the computational complexity of inverse frequent set mining. The paper forwards that deciding whether there is a data set compatible with the given frequent sets is NP- hard and computing the number of data sets compatible with the given frequent sets is P- hard even in the case when the original data set d consists of six sets.

作者吕品陈年生董武世

机构地区武汉工程大学计算机科学与工程学院湖北师范学院计算机系

出处《计算机技术与发展》 2006年第4期25-27,共3页 Computer Technology and Development

基金湖北省自然科学基金资助项目(2004ADA023)

关键词反频繁集挖掘隐私保持投影 inverse frequent set mining preserve privacy projection

分类号 TP301.5 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献10

1Mannila H.Local and global methods in data mining:Basic techniques and open problems[A].In:Widmayer P,Triguero F,Morales R,et al.Automata,Languages and Programming,volume 2380 of Lecture Notes in Computer Science[C].[s.l.]:Springer-Verlag,2002.57-68.
2Evfimievski A,Srikant R,Agrawal R,et al.Privacy preserving mining of association rules[A].In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining[C].Edmonton,Alberta,Canada:ACM Press,2002.217-228.
3Oliverira S R M,Zaī O R.Privacy prescrving frequent iternset mining[A].In:Clifton C,Estivill-Castro V.IEEE ICDM Workshop on Privacy,Security,and Data Mining.volume 14of Conferences in Research and Practice in Information Technology[C].Maebashi City,Japan:[s.n.],2002.43-54.
4Gouka K,Zaki M J.Efficiently mining maximal frequent itemsets[A].In:Cercone N,Lin T Y,Wu X.Proceedings of the 2001 IEEE International Conference on Data Mining[C].Washington,DC:IEEE Computer society,2001.163-170.
5Gunopulos D,Khardon R,Mannila H,et al.Discovering all most specific sentences[J].ACM Transactions on Database Systems,2003,28(2):140-174.
6Calders T,Goethals B.Minimal k-free representations of frequent sets[A].In:Lavrac N,Gamgerger D,Todorovski L,et al.Knowledge Discovery in Databases:PKDD 2003,volume 2838 of Lecture Notes in Artifical Intelligence[C].[s.l.]:Springer-Verlag,2003.
7Garey M R,Johnson D S.Computers and Intractability:A Guide to the Theory of NP-Completeness[Z].New YorkSan Francisco:W.H.Freeman and Company,1979.
8Knuth D E.Sorting and Searching,volume 3 of The Art of Computer Programming (2nd ed)[M].Reading,Massachusetts:Addison-Wesley Publishing CO.,1998.
9J ukan S.Extremal Combinatorics:With Applications in Computer Science[A].EATCS Texts in Theoretical Computer Science[C].Berlin:Springer-Verlag,2001.
10Gali Z.Efficient algorithms for finding maximum matchings in graphs[J].ACM Computing Surveys,1986,18(1):23-38.

1吕品,董武世.近似反频繁集挖掘可计算复杂度分析与研究[J].计算机工程与应用,2006,42(24):179-180.
2娄兰芳,潘庆先.基于集合运算的频繁集挖掘优化算法[J].山东大学学报（理学版）,2008,43(11):54-57. 被引量：1
3杨妮妮.基于集合和位运算的频繁集挖掘优化算法[J].科学技术与工程,2009,9(23):7173-7175. 被引量：1
4陈晓云.一种带约束条件的关联规则频繁集挖掘[J].计算机工程与应用,2003,39(2):205-208. 被引量：4
5温磊,李敏强.基于有向项集图的频繁集挖掘优化算法[J].计算机工程,2003,29(22):111-113.
6徐利军,谢康林,徐虹.基于数据流的频繁集挖掘[J].上海交通大学学报,2006,40(3):502-506. 被引量：5
7张月琴.基于0-1矩阵的频繁项集挖掘算法研究[J].计算机工程与设计,2009,30(20):4662-4664. 被引量：8
8王波,钱晓棠,张斌,张明卫.基于连接的频繁集聚类算法[J].辽宁工程技术大学学报（自然科学版）,2005,24(z2):150-152.
9黄剑,李明奇,郭文强.并行Fp-growth算法在搜索引擎中的应用[J].计算机科学,2015,42(S1):459-461 483. 被引量：2
10康雁,黄文奇.求解圆形Packing问题的一个启发式算法[J].计算机研究与发展,2002,39(4):410-414. 被引量：10

计算机技术与发展

2006年第4期

浏览历史

内容加载中请稍等...

反频繁集挖掘可计算复杂性问题研究

参考文献10

相关作者

相关机构

相关主题

浏览历史