期刊文献+

一种高效的最大频繁项集挖掘算法DFMFI-Miner 被引量:1

An Efficient Algorithm DFMFI-Miner for Mining Maximal Frequent Itemsets
下载PDF
导出
摘要 分析最大频繁项集和完全频繁项集的关系,提出了一个挖掘最大频繁项集的高效算法DFMFI M iner(The M iner Basedon D epth-F irst Search ing forM in ingMaximal Frequent Item sets),采用深度优先方法搜索项集空间,采用垂直位图及一定的压缩方法对表示事务数据库并进行约简,并采用多种有效剪枝策略和优化策略,提高了算法的效率。在多个数据集上进行了实验,实验结果表明该算法特别适于挖掘具有长频繁项集的数据集。 The relationship between maximal frequent itemsets and all frequent itemsets is discussed and an efficient algorithm DFMFI - Miner (The Miner Based on Depth - First Searching for Mining Maximal Frequent Itemsets) for mining maximal frequent itemsets is proposed. The algorithm uses the depth -first method to search in itemsets space and the vertical bitmap to represent and compress transaction database. It also uses some efficient pruning strategies to reduce the searching space and decrease the candidate itemsets in order to improve the efficiency. The algorithm is implemented in many datasets and the results of experiment show that the algorithm is especially effective for mining the datasets with long frequent itemsets.
出处 《计算机仿真》 CSCD 2006年第7期79-83,共5页 Computer Simulation
基金 国家基础研究发展基金(973计划 G1999032701) 江苏省自然科学基金(BK2002091)资助
关键词 数据挖掘 深度优先搜索 频繁项集 最大频繁项集 Data mining Depth - first seaching Frequent itemsets Maximal frequent itemsets
  • 相关文献

参考文献10

  • 1R Agrawal,T Imielinski and A.N.Swami.Mining association rules between sets of items in large databases[C].In P.Buneman and S.Jajodia,editors,Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data,ACM Press,1993,22(2):207-216.
  • 2R Agrawal and R Srikant.Fast algorithms for mining association rules[C].In J.B.Bocca,M.Jarke,and C.Zaniolo,editors,Proceedings 20th International Conference on Very Large Data Bases,Morgan Kaufmann,1994.487-499.
  • 3R Agrawal,H Mannila,R Srikant,H Toivonen and A I Verkamo.Fast discovery of association rules[M].In U.M.Fayyad,G.Piatetsky-Shapiro,P.Smyth,and R.Uthurusamy,editors,Advances in Knowledge Discovery and Data Mining,MIT Press,1996.307-328.
  • 4Ashoka Savasere,Edward Omiecinski,B Shamkant.Navathe:An Efficient Algorithm for Mining Association Rules in Large Databases[M].VLDB 1995:432-444.
  • 5J S Park,M S Chen and P S Yu.An effective hash-based algorithm for mining association rules[M].SIGMOD'95,San Jose,CA,May 1995.
  • 6J Han,J Pei and Y Yin.Mining Frequent Patterns without Candidate Generation:A Frequent-Pattern Tree Approach Mining Frequent Patterns without Candidate Generation[J].Data Mining and Knowledge Discovery,2004,8:53-87.
  • 7Zaki and Hsiao.CHARM:An Efficient Algorithm for Closed Itemset Mining[C].Proc.2002 SIAM Int.Conf.Data Mining (SDM'02),Arlington,VA,April 2002.457-473.
  • 8J Wang,J Han and J Pei.CLOSET+:Searching for the Best Strategies for Mining Frequent Closed Itemsets[C].Proc.2003 ACM SIGKDD Int.Conf.on Knowledge Discovery and Data Mining (KDD'03),Washington,D.C.,Aug.2003.
  • 9R J Bayardo.Efficiently mining long patterns from databases[M].SIGMOD 98,Seattle,Washington,1998.85-93.
  • 10J Zaki.Scalable Algorithm for association mining[J].Knowledge and Data Engineering.2000,12(2):372-390.

同被引文献6

引证文献1

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部