期刊文献+

关联规则中FP-tree的最大频繁模式非检验挖掘算法 被引量:4

Non-check mining algorithm of maximum frequent patterns in association rules based on FP-tree
下载PDF
导出
摘要 基于FP-tree的最大频繁模式挖掘算法是目前较为高效的频繁模式挖掘算法,针对这些算法需要递归生成条件FP-tree、做超集检验等问题,在分析DMFIA-1算法的基础上,提出了最大频繁模式的非检验挖掘算法NCMFP。该算法改进了FP-tree的结构,使挖掘过程中不需要生成条件频繁模式树也不需要超集检验。算法采用的预测剪枝策略减少了挖掘的次数,采用的求取公共交集的方式保证了挖掘结果的完整性。实验结果表明在支持度相对较小情况下,NCMFP的效率是同类算法的2~5倍。 The algorithms based on FP-tree,for mining maximal frequent patterns,have high performance but with many drawbacks.For example,they must recursively generate conditional FP-trees,have to do the process of superset checking.In order to overcome these drawbacks of the existing algorithms,an algorithm Non-Check Mining algorithm of Maximum Frequent Pattern(NCMFP)for mining maximal frequent patterns was put forward after the analysis of DMFIA-1 algorithm.In the algorithm,neither constructing conditional frequent pattern tree recursively nor superset checking was needed through modifying the structure of FP-tree.This algorithm reduced the number of mining through early prediction before mining.The application of a method to get the public intersection sets could obtain a complete result.The experiment shows that the efficiency of NCMFP is two to five times as much as that of the similar algorithms in the case of a relatively small support.
作者 惠亮 钱雪忠
出处 《计算机应用》 CSCD 北大核心 2010年第7期1922-1925,共4页 journal of Computer Applications
基金 江苏省自然科学基金资助项目(BK20003017)
关键词 关联规则 数据挖掘 频繁模式树 最大频繁项集 超集检验 association rule data mining Frequent Pattern Tree(FP-tree) maximum frequent itemsets superset checking
  • 相关文献

参考文献11

  • 1BAYARDO R.Efficiently mining long patterns from databases[C] // Proceedings of 1998 ACM SIGMOD International Conference on Management of Data.New York:ACM,1998:85-93.
  • 2路松峰,卢正鼎.快速开采最大频繁项目集[J].软件学报,2001,12(2):293-297. 被引量:113
  • 3宋余庆,朱玉全,孙志挥,陈耿.基于FP-Tree的最大频繁项目集挖掘及更新算法[J].软件学报,2003,14(9):1586-1592. 被引量:164
  • 4BURDICK D,CALIMLIM M,GEHRKE J.MAFIA:A maximal frequent itemsets algorithm for transactional databases[C] // Proceedings of the 17th International Conference on Data Engineering.Washington,DC:IEEE Computer Society,2001:443-452.
  • 5GOUDA K,ZAKI M J.Efficiently mining maximal frequent itemsets[C] // Proceedings of the IEEE International Conference on Data Mining.Washington,DC:IEEE Computer Society,2001:163-170.
  • 6ZHOU Q H,WESLEY C,LU B J.SmartMiner:A depth 1st algorithm guided by tail information for mining maximal frequent itemsets[C] // Proceedings of the IEEE International Conference on Data Mining.Washington,DC:IEEE Computer Society,2002:570-577.
  • 7GRAHNE G,ZHU J F.High performance mining of maximal frequent itemsets[C] // Proceedings of the 6th SIAM International Workshop on High Performance.New York:HPDM Press,2003:135-143.
  • 8刘乃丽,李玉忱,马磊.一种基于FP-tree的最大频繁项目集挖掘算法[J].计算机应用,2005,25(5):998-1000. 被引量:8
  • 9陈晨,鞠时光.基于改进FP-tree的最大频繁项集挖掘算法[J].计算机工程与设计,2008,29(24):6236-6239. 被引量:14
  • 10王现君,宋晶晶,姜保庆.在单向FP-tree上挖掘频繁闭项集[J].计算机工程与应用,2008,44(10):150-153. 被引量:4

二级参考文献45

  • 1马丽生,邓辉文,齐逸.一种新的最大频繁项目集挖掘算法[J].计算机应用,2006,26(11):2670-2673. 被引量:6
  • 2Han Jia-wei,Kamber M.Data ruing:concepts and techniques[M]. [S.l.]:Morgan Kaufmann Publishers,2001.225-279.
  • 3Agrawal R,Srikant R.Fast algorithms for mining association rules[C]// Proc of 1994 Int'l Conf on Very Large Data Bases.Santiago,Chili: VLDB Endowment, 1994. 487-499.
  • 4Park J S,Chen M S,Yu P S.An effective Hash-based algorithm for mining association rules[C]//Proc of 1995 ACM-SIGMOD Int'l Conf on Management of Datal.San Jose,CA:ACM Press,1995. 175-186.
  • 5Agrawal R,Srikant R.Mining sequential patterns[C]//ICDE'951. Taipei,Taiwan:IEEE Computer Society Press, 1995.3-14.
  • 6Brin S,Motwani R,Silverstein C.Beyond market basket: Generalizing association rules to correlations[C]//SIGMOD'97,1997:265-276.
  • 7Pasquier N,Bastide Y,Taouil R,et al.Discovering frequent closed itemsets for association rules[C]//ICDT'99,1999.398-416.
  • 8Zaki M,Hsiao C.CHARM:an effcient algorithm for closed itemset mining[C]//SDM' 02,2002 . 34-43.
  • 9Burdick D,Calimlim M,Gehrke J.MAFIA:amaximal frequent itemset algorithm for transactional databases[C]//ICDE'01,2001.443-452.
  • 10Pei J,Han J,Mao R.CLOSET:an efficient algorithm for mining frequent closed itemsets[C]//DMKD'00,2000.11-20.

共引文献231

同被引文献22

引证文献4

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部