一种高效的最大频繁项集挖掘算法DFMFI-Miner 被引量：1

An Efficient Algorithm DFMFI-Miner for Mining Maximal Frequent Itemsets

下载PDF

导出

摘要分析最大频繁项集和完全频繁项集的关系,提出了一个挖掘最大频繁项集的高效算法DFMFI M iner(The M iner Basedon D epth-F irst Search ing forM in ingMaximal Frequent Item sets),采用深度优先方法搜索项集空间,采用垂直位图及一定的压缩方法对表示事务数据库并进行约简,并采用多种有效剪枝策略和优化策略,提高了算法的效率。在多个数据集上进行了实验,实验结果表明该算法特别适于挖掘具有长频繁项集的数据集。 The relationship between maximal frequent itemsets and all frequent itemsets is discussed and an efficient algorithm DFMFI - Miner （The Miner Based on Depth - First Searching for Mining Maximal Frequent Itemsets） for mining maximal frequent itemsets is proposed. The algorithm uses the depth -first method to search in itemsets space and the vertical bitmap to represent and compress transaction database. It also uses some efficient pruning strategies to reduce the searching space and decrease the candidate itemsets in order to improve the efficiency. The algorithm is implemented in many datasets and the results of experiment show that the algorithm is especially effective for mining the datasets with long frequent itemsets.

作者陈慧萍王建东王煜

机构地区南京航空航天大学信息科学与技术学院河海大学计算机及信息工程学院

出处《计算机仿真》 CSCD 2006年第7期79-83,共5页 Computer Simulation

基金国家基础研究发展基金(973计划 G1999032701) 江苏省自然科学基金(BK2002091)资助

关键词数据挖掘深度优先搜索频繁项集最大频繁项集 Data mining Depth - first seaching Frequent itemsets Maximal frequent itemsets

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献10

1R Agrawal,T Imielinski and A.N.Swami.Mining association rules between sets of items in large databases[C].In P.Buneman and S.Jajodia,editors,Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data,ACM Press,1993,22(2):207-216.
2R Agrawal and R Srikant.Fast algorithms for mining association rules[C].In J.B.Bocca,M.Jarke,and C.Zaniolo,editors,Proceedings 20th International Conference on Very Large Data Bases,Morgan Kaufmann,1994.487-499.
3R Agrawal,H Mannila,R Srikant,H Toivonen and A I Verkamo.Fast discovery of association rules[M].In U.M.Fayyad,G.Piatetsky-Shapiro,P.Smyth,and R.Uthurusamy,editors,Advances in Knowledge Discovery and Data Mining,MIT Press,1996.307-328.
4Ashoka Savasere,Edward Omiecinski,B Shamkant.Navathe:An Efficient Algorithm for Mining Association Rules in Large Databases[M].VLDB 1995:432-444.
5J S Park,M S Chen and P S Yu.An effective hash-based algorithm for mining association rules[M].SIGMOD'95,San Jose,CA,May 1995.
6J Han,J Pei and Y Yin.Mining Frequent Patterns without Candidate Generation:A Frequent-Pattern Tree Approach Mining Frequent Patterns without Candidate Generation[J].Data Mining and Knowledge Discovery,2004,8:53-87.
7Zaki and Hsiao.CHARM:An Efficient Algorithm for Closed Itemset Mining[C].Proc.2002 SIAM Int.Conf.Data Mining (SDM'02),Arlington,VA,April 2002.457-473.
8J Wang,J Han and J Pei.CLOSET+:Searching for the Best Strategies for Mining Frequent Closed Itemsets[C].Proc.2003 ACM SIGKDD Int.Conf.on Knowledge Discovery and Data Mining (KDD'03),Washington,D.C.,Aug.2003.
9R J Bayardo.Efficiently mining long patterns from databases[M].SIGMOD 98,Seattle,Washington,1998.85-93.
10J Zaki.Scalable Algorithm for association mining[J].Knowledge and Data Engineering.2000,12(2):372-390.

同被引文献6

1张忠平,李岩,杨静.基于矩阵的频繁项集挖掘算法[J].计算机工程,2009,35(1):84-86. 被引量：19
2张笑达,徐立臻.一种改进的基于矩阵的频繁项集挖掘算法[J].计算机技术与发展,2010,20(4):93-96. 被引量：8
3方艾芬,李先通,蔄世明,岳鹏飞.基于关联规则挖掘的伴随车辆发现算法[J].计算机应用与软件,2012,29(2):94-96. 被引量：9
4吴湘华,张祖平.Apriori算法中频繁项集求法的改进[J].科技创新与应用,2013,3(15):58-58. 被引量：1
5郑志娴.基于云计算的Apriori算法设计[J].莆田学院学报,2014,21(5):61-64. 被引量：2
6曹波,韩燕波,王桂玲.基于车牌识别大数据的伴随车辆组发现方法[J].计算机应用,2015,35(11):3203-3207. 被引量：10

引证文献1

1陈瑶,桂峰,卢超,王华.基于频繁项集挖掘算法的伴随车应用与实现[J].计算机应用与软件,2017,34(4):60-64. 被引量：3

二级引证文献3

1刘惠惠,张祖平,龙哲.基于Spark的FP-Growth伴随车辆发现与应用[J].计算机工程与应用,2018,54(8):7-13. 被引量：3
2倪德,马传香.FP-growth算法及其优化在税务系统中的应用[J].计算机应用,2018,38(A02):140-143. 被引量：3
3孔明,魏东,冉义兵,毕国鹏.基于Fork/Join的事务日志伴随模式挖掘方法[J].小型微型计算机系统,2023,44(2):239-247.

1黄红星.挖掘完全频繁项集的蚁群算法[J].微电子学与计算机,2014,31(12):144-147. 被引量：4
2赵群礼.基于FP-Tree的最大频繁项目集综合更新算法[J].安徽教育学院学报,2006,24(3):42-47. 被引量：1
3汪金苗,张龙波,闫光辉,王凤英.一种不确定性数据中最大频繁项集挖掘方法[J].山东理工大学学报（自然科学版）,2013,27(5):17-21. 被引量：1
4蔡进,薛永生,张东站.基于分区分类法快速更新频繁项集[J].计算机工程与应用,2007,43(9):170-173.
5陈凯,冯全源.最大频繁项集的高效挖掘[J].微电子学与计算机,2005,22(8):22-25. 被引量：13
6吴媚,高玲.基于界标窗口的数据流频繁项集挖掘算法的改进[J].山东师范大学学报（自然科学版）,2014,29(3):21-25.
7王少鹏,闻英友,赵宏.滑动窗口下数据流完全加权最大频繁项集挖掘[J].东北大学学报（自然科学版）,2016,37(7):931-936. 被引量：2
8张炘,廖频,郭波.一种基于条件矩阵的最大频繁项集挖掘算法[J].计算机仿真,2010,27(11):73-77.

计算机仿真

2006年第7期

浏览历史

内容加载中请稍等...

一种高效的最大频繁项集挖掘算法DFMFI-Miner 被引量：1

参考文献10

同被引文献6

引证文献1

二级引证文献3

相关作者

相关机构

相关主题

浏览历史