期刊文献+

一种针对大型事务数据库的关联规则挖掘算法 被引量:2

Algorithm of Association Rule Mining for Large Transaction Databases
下载PDF
导出
摘要 为进一步解决对大型数据库进行关联规则挖掘时产生的CPU时间开销大和I/O操作频繁问题,给出一种改进的关联规则挖掘算法(ARMAC).该算法引入有向无环图和tidlist结构用以提高频繁项目集的计算效率,并将数据库划分为内存可以满足要求的若干部分,解决了对大型数据库挖掘时磁盘操作频繁的问题,从而有效地适用于大型数据库的关联规则挖掘.该算法吸取连续关联规则挖掘(CARMA)算法的优势,只需扫描两次数据库便可完成挖掘过程.实验结果表明:该算法在大型事务数据库中具有更高的执行效率. To further reduce both the large overhead of CPU and frequent operation of I/O occurred in the process of the association rules mining on the large transaction database,this paper presents an improved algorithm of association rule mining(ARMAC).In this algorithm,a directed acyclic graph(DAG) and the tidlist configuration are taken to improve the computing efficiency of the frequent item sets,and the database is partitioned into several parts whose RAM can meet the corresponding demand,thus overcoming the problems of disk’s frequent operation on mining the large database,which is effectively applied to the association rule mining of large database.Taking advantages of the algorithm of continuous association rule mining(CARMA),this improved algorithm can implement the mining by only scanning the database twice.Experimental results show that this proposed algorithm is of higher execution efficiency in large transaction database.
出处 《空军雷达学院学报》 2011年第3期205-208,共4页 Journal of Air Force Radar Academy
关键词 数据挖掘 频繁项集 大型数据库 有向无环图 关联规则 data mining frequent item sets large database directed acyclic graph(DAG) association rules
  • 相关文献

参考文献10

  • 1SAVASERE A, OMIEC1NSKI E, NAVATHE S. An efficient algorithm for mining association rules in large databases [C]//Proc. of Intl. Conf. on Very Large Databases.1995: 432-444.
  • 2HIDBER C. Online association rule mining[C]//Proc, of ACM SIGMOD Intl. Conf. on Management of Data. 1999: 145-156.
  • 3LIN J, DUNFIAM M H. Mining association roles: Antiskew algorithrns[C]//Proc, of Intl. Conf. on Data Engineering (ICDE). 1998: 236-251.
  • 4SHENOY P, HARITSA J R, SUDARSHAN S, et al. Turbo- charging vertical mining of large databases [C]//Proc. of Intl. Conf. Management of Data. 2000: 22-23.
  • 5PUDI V, HARITSA J. Quantifying the utility of the past in mining large databases [J]. Information Systems, 2000, 25 (5):323-343.
  • 6丁艳辉,王洪国,高明,谷建军.一种基于矩阵的关联规则挖掘新算法[J].计算机科学,2006,33(4):188-189. 被引量:13
  • 7HU Ya-han, CHEN Liang. Mining association rules with multiple minimum supports a new mining algorithm and a support tuning mechanism [J]. Decision Support System, 2006, 42(1): 1-24.
  • 8郑泉,王建东.基于FP-树挖掘大数据库的方法及算法PCM[J].计算机工程与应用,2004,40(7):182-184. 被引量:6
  • 9HAN J, PEI J, Y1N Y. Mining frequent pattems without candidate generation [J]. Data Mining and Knowledge Discovery, 2004, 8(1):53-87.
  • 10AGRAWAL R, SRIKANT R. Fast algorithms for mining association rules in large databaes [C]//Proc. of 1994 International Conference on Very Large Databases. 1994: 487-499.

二级参考文献10

  • 1[1]Jiawei Han,Micheline Kamber. Data Mining:Concepts and Techniques.CopyrightC2001 by Morgan Kaufmann Publishers,Inc
  • 2[2]R Agrawal ,R Srikant. Fast algorithms for mining association rules[C].In:Proc 1994 Int Conf Very Large Data Bases(VLDB'94),Santiago,Chile, 1994-09
  • 3[3]J Han,J Pei,Y Yin. Mining frequent patterns without candidate generation[C].In:Pro 2000 ACM-SIGMOD Int Conf Management of Data(SIGMOD'00), Dallas ,TX ,2000-05:1~12
  • 4[4]R Agarwal,C Aggarwal,V V V Prasad. A tree projection algorithm for generation of frequent itemsets. In J Parallel and Distribute Computing, 2000
  • 5Agrawal R, Srikant R. Mining sequential patterns: [IBM Research Report]. 1995
  • 6Agrawal R, Imielinski T, Swami A. Mining association rules between sets in large databases, In: Proc. the ACM SIGMOD Conf,Management of Data, May 1993, 207-216
  • 7Agrawal R, Srikant R. Fast algorithm for mining association rules: [IBM Research Reprot], 1994
  • 8Agrawal R, Mannila H,Toivonen H, et al. Fast Discovery of Association Rules. In Advances in Knowledge Discovery and Data Mining, AAAI/MIT Press,1996. 306-328
  • 9Huang Liusheng, Chen Huaping, Wang Xun , Cheng Guoliang ,A Fast Algorithm for Mining Association Rules, In J, Comput .Sei, & Technol, ,2000,15(6):619-624
  • 10Jiawei Han, Micheline Kamber. Data Mining :Concepts and Techniques[C]. Mongan Kaufmann publishers, 2000. 225-278

共引文献16

同被引文献12

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部