期刊文献+

基于关系数据库的关联规则挖掘算法DB-growth 被引量:3

Association rule mining algorithm DB-growth Algorithm bases on relational database
下载PDF
导出
摘要 从大数据中挖掘隐藏的、多维的有价值的关联规则具有广泛的应用价值。关联规则挖掘经典算法Apriori存在重复扫描数据库并产生大量候选项集的瓶颈问题,FP-growth算法虽不产生候选集,但FP-tree不支持大数据的存储与遍历,不能有效支持大数据挖掘;另外,Apriori以及FP-growth算法实施增量挖掘都需要重构关联规则,不适用于增长型事务数据挖掘。针对这些问题,设计基于关系数据库表SourceIndex的DB-growth算法,采用模式组合生成模式串的方式,更新数据库构建频繁集,有效地提高了关联规则的挖掘效率,同时对增量挖掘及深度挖掘也能得到较好的支持。 Excavating potential,multidimensional valuable association rules from big data has wide application.The main association rule mining algorithm Apriori has the bottlenecks of scaning repeately database and generating big number of candidate sets,Though the FP algorithm does not generate candidate sets,but FP-tree can't handle the problem of storage and traversal of big data;In addition,Apriori and FP-growth algorithm needs to reconstruct association rules while implementing increment mining,its not available for growth-oritened data mining.Facing those problems,designing DB-growth algorithm based on relational database table SourceIndex,applying string combinate to generate pattern,insert or update database to construct frequent sets,mining association rules by querying database,in addition,it supports increment mining and depth mining.
出处 《南昌大学学报(理科版)》 CAS 北大核心 2015年第1期25-30 38,共7页 Journal of Nanchang University(Natural Science)
基金 国家自然科学基金项目(62162049)
关键词 关联规则挖掘 APRIORI算法 FP-GROWTH算法 DB-growth算法 增量挖掘 深度挖掘 Association rule mining Apriori algorithm FP-growth algorithm DB-growth algorithm Increment mining Depth mining
  • 相关文献

参考文献13

  • 1Ramesh C. Agarwal,Charu C. Aggarwal,V.V.V. Prasad.A Tree Projection Algorithm for Generation of Frequent Item Sets[J]. Journal of Parallel and Distributed Computing . 2001 (3)
  • 2S. Parthasarathy,M. J. Zaki,M. Ogihara,W. Li.Parallel Data Mining for Association Rules on Shared-Memory Systems[J]. Knowledge and Information Systems . 2001 (1)
  • 3Jiawei Han,Jian Pei.Mining frequent patterns by pattern-growth[J]. ACM SIGKDD Explorations Newsletter . 2000 (2)
  • 4T D Do,S C Hui,A Fong.Mining Frequent Itemsets with Category-Based Constraints. Lecture Notes in Computer Science . 2003
  • 5Agrawal R,Srikant R.Fast algorithms for mining association rules[].Proceedings of the th International Conference on Very Large Data Bases.1994
  • 6C Wang,C Tjortjis.PRICES: An Efficient Algorithm for Mining Association Rules. Lecture Notes in Computer Science . 2004
  • 7Hannu Toivonen.Sampling large databases for association rules. Proceedings of the 22nd International Conference on Very Large Databases (VLDB‘96) . 1996
  • 8Y Li,R P Gopalan.Effective Sampling for Mining Association Rules. Lecture Notes in Computer Science . 2004
  • 9Zaki M J.Parallel and distributed association mining: a survey[].IEEE Concurrency Special Issue on Parellel Mechanisms for Data Mining.1999
  • 10Cheang DW,Han JW,Ng VT,et al.A fast distributed algorithm for mining association rules. Proceedings of IEEE 4 th International Conference Parallel and Distributed Information Systems . 1996

共引文献19

同被引文献18

引证文献3

二级引证文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部