一种改进的基于关联图的关联规则挖掘算法被引量：1

A revised graph-based algorithm for mining association rules

下载PDF

导出

摘要关联规则是数据挖掘研究的一个重要课题 ,而最大频繁项集的生成是影响关联规则挖掘的关键问题 .在已有的频繁集发现算法中 ,DLG算法通过减少事务数据库的扫描次数 ,进而有效减少挖掘过程的I/O代价 .在阐述DLG算法的实现原理与执行过程的基础上 ,为进一步减少候选项集的数量 ,提出一种改进算法DLG .其主要思想是在关联图构造阶段 ,统计每一个频繁项目的入度 ,以此作为剪枝的依据 . Mining association rules is an important part of data mining field. An d generating the frequent itemsets is a key problem of mining association rules. Among the proposed algorithms of finding frequent itemsets, DLG is a efficient algorithm to controls I/O cost by reducing the number of database passes. The pr inciple and implemental process are discussed, and then a revised algorithm is p resented based on DLG in order to cut down the number of candidates further. The p rinciple of DLG is to count the in-degree of each frequent itemset on which a p runing is based in the phase of constructing graphs. Finally, the performance an alysis and comparison experiments are done and the result shows the algorithm is excellent.

作者罗楠李玉忱

机构地区山东大学计算机科学与技术学院

出处《山东大学学报（工学版）》 CAS 2004年第1期99-103,共5页 Journal of Shandong University（Engineering Science）

关键词关联规则关联图比特向量 association rules association bit vector

分类号 TP311.13 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献1

1蔡之华,吕维先,颜雪松.基于关联图的关联规则挖掘算法研究[J].小型微型计算机系统,2002,23(4):450-452. 被引量：15

二级参考文献4

1[1]J.S.park,M.S.Chen,P.S.Yu.An efficientive Hash-based algo rithm for mining association rules[C].Processings of ACM SIG MOD,1995 24 (2):175～186
2[2]Maurice Houtsma Arun Swami.Set-oriented mining of association rules [C].In int'l Conf.On Data Enginnering,Taibe,Taiwan.March 1995
3[3]R.Agrawal,Tomasz Imielinski,Arun Swami.Mining association rules between sets of items in large databases [C].In Proc.Washington,D.C.Of the ACM SIGMOD Conference on Man agement of Data.may 1993 207～216.
4[4]R.Agrawal,ramakrishnan Scrikant.Fast algorithms for mining association rules[C].In Proc.Of the 20th Iht' 1 coference on Very large databases,Santiago,Chile,Sept.,1994 487～499

共引文献14

1刘晓玲.一种利用逻辑运算挖掘关联规则的算法[J].济南职业学院学报,2007(1):58-59.
2牛小飞,石冰,卢军,吴科.挖掘关联规则的高效ABM算法[J].计算机工程,2004,30(11):118-120. 被引量：16
3袁宁,石冰,吴卫华.一种高效的挖掘量化关联规则的MQAR算法[J].计算机与现代化,2004(9):21-24.
4曹慧.向量矩阵挖掘关联规则的算法设计[J].计算机工程与科学,2004,26(11):69-70.
5刘晓玲,李玉忱.一种利用逻辑“与”运算挖掘频繁项集的算法[J].中国科技信息,2005(15A):122-123. 被引量：2
6何丽君,董蕊,袁克杰.常见关联规则算法分析与比较[J].大连民族学院学报,2005,7(5):39-42. 被引量：6
7刘晓玲,李玉忱.一种不产生候选项集的关联规则挖掘算法[J].山东师范大学学报（自然科学版）,2006,21(1):46-48. 被引量：2
8冯洁,陶宏才.一种频繁项集的快速挖掘算法[J].微计算机信息,2007(18):164-166. 被引量：7
9巫红霞,谢强.基于有向图的频繁集挖掘算法[J].湖州师范学院学报,2008,30(1):65-69.
10刘瑞祥,邹海.对挖掘关联规则中的Apriori算法的一种改进[J].计算机与现代化,2009(7):5-8. 被引量：6

同被引文献19

1HERNANDEZ-LEON R, PALANCAR J H, CARRASCO-OCHOA J A, et al. Algorithms for mining frequent itemsets in static and dynamic datasets [ J ]. Intelligent Data Analysis, 2010, 14(3) :419-435.
2HAN J, KAMBER M. Data mining: concepts and techniques[M]. 2nd ed. San Francisco, CA, USA: Morgan Kaufmann Publisher, 2006.
3PIATETSKY-SHAPIRO G. Data mining and knowledge discovery 1996 to 2005 : overcoming the hype and moving from "university" to "business" and "analytics" [ J ]. Data Mining Knowledge Discovery, 2007, 15 ( 1 ) : 99- 105.
4CHIANG D A, WANG Y F, WANG Y H, et al. Mining disjunctive consequent association rules [J]. Applied Soft Computing, 2011, 11(2): 2129-2133.
5AGRAWAL R, IMIELINSKI T, SWAMI A. Mining associations between sets of items in massive databases[C]//Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data. Washington D C, USA: ACM Press, 1993. 207-216.
6AGRAWAL R, SRIKANT R. Fast algorithms for mining association rules in large databases [ C ]//Proceedings of the 20th International Conference on Very Large Data Bases. Santiago de Chile, Chile: Morgan Kaufmann Publisher, 1994 : 487-499.
7SONG W, YANG B R, XU Z Y. Index-BitTableFI: an improved algorithm for mining frequent itemsets [ J ]. Knowledge-Based Systems, 2008, 21 (6): 507-513.
8VREEKEN J, LEEUWEN M, SIEBES A. Krimp: mining itemsets that compress [J].Data Mining Knowledge Discovery, 2011, 23 ( 1 ) : 169-214.
9HAN J, PEI J, YIN Y, et al. Mining frequent patterns without candidate generation: a frequent-pattern tree approach [ J ]. Data Mining and Knowledge Discovery, 2004, 8 ( 1 ) : 53-87.
10LIU G, LU H, LOU W, et al. Efficient mining of frequent patterns using ascending frequency ordered prefix- tree [ J ]. Data Mining and Knowledge Discovery, 2004, 9 (3) : 249-274.

引证文献1

1宋威,刘文博,李晋宏.基于动态裁剪频繁模式树的频繁项集并发挖掘算法[J].山东大学学报（工学版）,2011,41(4):49-55. 被引量：3

二级引证文献3

1周兴华,陆建峰,汤九斌.基于多线程技术的数据流频繁模式挖掘[J].计算机应用,2013,33(A01):69-72.
2江雨燕,李平.基于PFP-Growth算法的海量频繁项集挖掘[J].计算机技术与发展,2013,23(9):63-65. 被引量：2
3罗芳.一种基于裁剪FP-Tree的频繁项集挖掘算法[J].宜春学院学报,2015,37(12):22-25. 被引量：1

1张伟,陈芸,邹汉斌,周霆.基于倒排文件的布尔规则隐藏算法[J].计算机工程,2005,31(14):97-98. 被引量：1
2尹士闪,马增强,毛晚堆.基于频繁项目集链式存储方法的关联规则算法[J].计算机工程与设计,2012,33(3):1002-1007. 被引量：4
3李鲲鹏,兰巨龙,李印海.基于Bloom filter的高效正则表达式匹配算法[J].计算机应用研究,2012,29(3):950-954. 被引量：4
4黄红星.基于图的关联规则改进算法[J].计算机与数字工程,2009,37(12):38-41. 被引量：1
5赵文栋,张进,彭来献,田畅.一种基于Bloom过滤器的服务模糊匹配算法[J].计算机科学,2013,40(3):175-179. 被引量：3
6黄莎莎.基于特征聚类集成技术的组特征选择方法[J].微型机与应用,2014,33(11):79-82. 被引量：2
7谢伙生,孙金涛.基于比特向量组的数据流邻近序列模式挖掘算法研究[J].福州大学学报（自然科学版）,2012,40(5):567-571.
8朱睿,王斌,杨晓春,王国仁.大数据环境下支持概率数据范围查询索引的研究[J].计算机学报,2016,39(10):1929-1946. 被引量：2
9曾文彬,张虹.UML在系统分析与设计中的应用[J].计算机应用与软件,2007,24(7):93-95. 被引量：13
10叶青,杨家本,柴跃廷.基于粗集理论的知识处理方法在专家系统中的应用[J].信息与控制,2001,30(3):193-198. 被引量：3

山东大学学报（工学版）

2004年第1期

浏览历史

内容加载中请稍等...

一种改进的基于关联图的关联规则挖掘算法被引量：1

参考文献1

二级参考文献4

共引文献14

同被引文献19

引证文献1

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

一种改进的基于关联图的关联规则挖掘算法 被引量：1

参考文献1

二级参考文献4

共引文献14

同被引文献19

引证文献1

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

一种改进的基于关联图的关联规则挖掘算法被引量：1