期刊文献+

关联规则算法FP-growth的研究与分析 被引量:4

Research and Analysis on FP-growth Association Rules Algorithm
下载PDF
导出
摘要 FP-growth算法是关联规则挖掘中应用最为广泛的挖掘算法,与经典算法Apriori算法最大的区别是不需要挖掘候选集,所以在挖掘效率上有了很大的提升,但是在构建模式树FP-tree时是基于整个事务数据库的,当遇到大型数据库或挖掘约束条件严格时,算法执行过程中占用内存较大,对空间要求较高,且是递归调用,执行效率不高。在对FP-growth算法研究的基础上提出了一种改进算法,该算法改变FP-tree结构,将一棵FP-tree分为多条子树进行频繁模式的挖掘,减少了内存的占用,提高了算法的执行效率。 The FP-growth algorithm is the most widely used algorithm in association rule mining. Compared with classical algorithm Apriori algorithm, the foremost difference is that it has no need of candidate set, so the mining efficiency is greatly improved. But its model tree FP-tree is consmtcted based on the whole transaction database, so the algorithm occupies larger memory and more space in large database or strict mining constraints, and at the same time it is a recursive call, so the execution efficiency is very low. This paper puts forward a kind of improvement algorithm based on research of FP-growth algorithm. This algorithm changes FP-tree's structure, dividing FP-tree into multi subtrees for nfining frequent patterns, reducing the usage of memory, so it improves the efficiency and makes the data mining more easily.
作者 侯长满 余彪
机构地区 中国人民解放军
出处 《计算机与网络》 2016年第24期58-61,共4页 Computer & Network
关键词 数据挖掘 关联规则 频繁模式 FP—growth算法 data mining association rule frequent pattern FP-growth algorithm
  • 相关文献

参考文献7

二级参考文献44

  • 1李立希,李铧汶,杨春燕.可拓学在数据挖掘中的应用初探[J].中国工程科学,2004,6(7):53-59. 被引量:13
  • 2陆介平,杨明,孙志挥,鞠时光.快速挖掘全局最大频繁项目集[J].软件学报,2005,16(4):553-560. 被引量:27
  • 3谭光明,冯圣中,孙凝晖.RNA二级结构预测中动态规划的优化和有效并行[J].软件学报,2006,17(7):1501-1509. 被引量:12
  • 4陈文伟,黄金才.可拓知识与可拓数据挖掘[J].广西师范大学学报(自然科学版),2006,24(4):159-162. 被引量:6
  • 5[1]Han J, Kambe M. Data Mining: Concepts and Techniques. Morgaan Kaufmann Publishers, San Francisco, CA, 2001
  • 6[2]Agrawal R, Srikant R. Fast Algorithms for Mining Association Rules. Proc. 20th Int'l Conf. Very Large Databases, Santiago, Chile, 1994-09: 497-499
  • 7[3]Han J, Pei J, Yin Y. Mining Frequent Patterns Without Candidate Generation. In Proc. 2000 ACM-SIGMOD Int. Conf. Management of Data (SIGMOD'00), Dalas, TX, 2000-05: 1-12
  • 8[4]Pei J. Pattern Growth Methods for Frequent Pattern Mining [doctor thesis]. Simon Fraser University, 2002-06-13
  • 9[5]Burdick D, Calimlim M, Gehrke J. MAFIA: A Maximal Frequent Itemset Algorithm for Transactional Databases. In Int'l Conf. on Data Engineering, 2001-04
  • 10KENNETH L V, MOSES C M. Information retrieval in document spaces using clustering [ M ]. Master' s Thesis, Department of Informatics and Mathematical Modelling Technical University of Denmark, Auguest, 2005.

共引文献246

同被引文献27

引证文献4

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部