期刊文献+

基于FP-tree挖掘密集型数据最大频繁模式算法 被引量:2

An Algorithm for Mining Maximal Frequent Patterns of Dense Datasets Based on Fp-tree
下载PDF
导出
摘要 Fp-growth算法是当前挖掘频繁项目集算法中速度最快,应用最广,并且不需要候选集的一种挖掘关联规则的算法.但是,Fp-growth算法也存在着算法结构复杂和空间利用率低等缺点.在FP-tree结构的基础上提出了密集型数据最大频繁模式挖掘算法FP-DMax.算法FP-DMax只需要2次扫描数据库,在挖掘过程中不产生候选项集,大大提高了算法的时空效率.实验表明,算法FP-DMax在挖掘密集型数据最大频繁模式方面是高效的. Fp-growth algorithm is one of the currently fastest and most popular one for mining association rule without candidate generation. However, it has disadvantages such as complicated data structure and lower space utilization rate. This paper develops the algorithm FP-DMax for mining maximal frequent patterns of dense datasets based on the data structure FP-tree. The algorithm only scans the database twice and generates no candidate itemsets. The experiment shows that the algorithm FP-DMax is efficient on mining dense datasets.
出处 《湖南城市学院学报(自然科学版)》 CAS 2007年第1期76-78,共3页 Journal of Hunan City University:Natural Science
基金 湖南省自然科学基金资助项目(06JJ2050)
关键词 数据挖掘 关联规则 最大频繁模式 FP—tree Data mining association rule maximal frequent patterns FP-tree
  • 相关文献

参考文献5

  • 1Agrawal R,Imielinski T,Swami A.Mining association rules between sets of items in large database[C].Peter Buneman,Sushil Jajodia Proceedings of SIGMID'93,Washington D C:ACMPress,1993:207-216.
  • 2Han J,Kambe M.Data Mining:Concepts and Techniques[M].San Francisco:Morgaan Kaufmann Publishers,2001.
  • 3Han J,Pei J,Yin Y.Mining Frequent Patterns Without CandidateGeneration[C].In Proc 2000 ACM-SIGMOD Int Conf Management of Data (SIGMOD'00),Dalas,TX,2000,5:1-12
  • 4Pei J.Pattern Growth Methods for Frequent Pattern Mining[D].Simon Fraser University,2002-06-13
  • 5Burdick D,Calimlim M,Gehrke J.MAFIA:A Maximal FrequentItemset Algorithm for Transactional Databases[C].In Int'l Conf on Data Engineering,2001.

同被引文献14

引证文献2

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部