期刊文献+

基于集合枚举树的关联规则生成算法 被引量:4

Association Rules Generating Algorithm Based on Set-Enumeration Tree
下载PDF
导出
摘要 在经典算法中由频繁项集生成关联规则需要生成频繁项集的所有非空子集作为候选后件集。李雄飞对此做出改进,提出逐层搜索后件的宽度优先算法。求下集极大元的Boundary算法也可用于求所有关联规则后件。论文提出一个深度优先算法GRSET(GenerateRulesbyusingSet-EnumerationTree),该算法利用集合枚举树,按照深度优先的方法逐一找出所有关联规则后件并得到相应的关联规则。通过实验对这三种算法进行比较,结果显示GRSET算法效率较高。 The classical algorithm of mining association rules gnerated by a frequent itemset has to generate all nonempty subsets of the frequent itemset as candidate set of consequences,Li Xiongfei aimed at this and proposed an improved algorithm.The algorithm finds all consequences layer by layer,so it is breadth-first.We also can use Boundary algorithm of finding all maximal elements of a lower segment to get all consequences of the association rules,ln this paper,we propose a new algorithm GRSET(Generate Rules by using Set-Enumeration Tree) which uses the structure of Set-Enumeration Tree and depth-first method to find all consequences of the association rules one by one and get all association rules corresponding to the consequences.Experiments show that GRSET algorithm is more efficient than the other two algorithms.
出处 《计算机工程与应用》 CSCD 北大核心 2006年第26期152-155,共4页 Computer Engineering and Applications
基金 国家自然科学基金资助项目(编号:60474022) 河南省骨干教师资助项目(编号:G2002026) 河南省自然科学计划资助项目(编号:200510475028)
关键词 数据挖掘 频繁项集 关联规则 深度优先算法 data mining,frequent itemset,association rules,depth-first algorithm
  • 相关文献

参考文献10

  • 1Jiawei Han,Micheline Kamber.Data Mining Concepts and Techniques[M].First edition,Beijing:Higher Education Press,2001:152~157
  • 2R Agrawal,R Srikant.Fast algorithms for mining association rules in large databases[C].In:Proc of the 20th Int Conf on Very Large Data Bases (VLDB'94),Santiago,Chile,1994:487~499
  • 3颜跃进,李舟军,陈火旺.频繁项集挖掘算法[J].计算机科学,2004,31(3):112-114. 被引量:20
  • 4Burdick D,Calimlim M,Gehrke J.MAFIA:A maximal frequent itemset algorithm for transactional databases[C].In:Proc Of the 17th Int'l Conf On Data Engineering,2001:443~452
  • 5Zhou QH,Wesley C,Lu BJ.SmartMiner-A depth 1st algorithm guided by tail information for mining maximal frequent itemsets[C].In:Proc of the IEEE Int'l Conf On Data Mining(ICDM2002),2002:570~577
  • 6Dao-I Lin,Zvi M Kedem.Pincer-Search:An Efficient Algorithm for Discovering the Maximum Frequent Set[J].IEEE transactions on knowledge and data engineering,2002; 14 (3):553~566
  • 7Agarwal RC,Aggarwal CC,Prasad VVV.Depth First generation of long patterns[C].In:Proc Of the 6th ACM SIGKDD Int'l Conf On Knowledge Discovery and Data Mining,2000:108~118
  • 8高俊,施伯乐.快速关联规则挖掘算法研究[J].计算机科学,2005,32(3):200-201. 被引量:10
  • 9刘大有,刘亚波,尹治东.关联规则最大频繁项目集的快速发现算法[J].吉林大学学报(理学版),2004,42(2):212-215. 被引量:10
  • 10Rymon R.Search Through Systematic Set Enumeration[C].In:Proc Of Third International Conference on Principles of Knowledge Representation and Reasoning,1992:539~550

二级参考文献38

  • 1Imielinski T, Virmani A. MSQL: Aquery languang for database mining. Data Mining and Knowledge Discovery, 1999,3: 373-408
  • 2Groth R. Data Mining: Building Competitive Advantage. Prentice Hall,1999
  • 3Goebel M,Gruenwald L. A survey of data mining and knowledge discovery software tools. SIGKDD Explorations, 1999,1:20-33
  • 4Grahne G. Efficient mining of constrained correlated sets. In:Proc. 2000 Intl. Conf. Data Engineering (ICDE'00), San Diego:2000. 512-521
  • 5Han J. Mining frequent patterns without candidate generation. In:Proc. ACM-SIGMOD Int. Conf. Dallas. 2000
  • 6Han J,Pei J. Freespan: Frequent pattern-projected sequential pattern mining: [Technical Report CMPT2000-06]. Simon Fraser University, 2000. 6-12
  • 7Han J. Data Mining: Concepts and Techniques. Burnaby: Simon Fraser University, 2000. 155-163
  • 8Liu J Q,Pan Y H,Wang K,Han J W. Mining Frequent Item Sets by Opportunistic Projection, KDD'02, Edmonton, Canada, July 2002
  • 9Han J, Pei J,Yin Y. Mining Frequent Patterns without Candidate Generation. In: Proc. 2000ACM-SIGMOD Int. Conf. on Management of Data (SIGMOD'00),Dallas, TX, May 2000
  • 10Pei J,Hah J,Lu H, et al. H-Mine: Hyper-Structure Mining of Frequent Patterns in Large Databases,In:Proc. 2001 Int. Conf. on Data Mining(ICDM'01) ,San Jose,CA,Nov. 2001

共引文献37

同被引文献38

引证文献4

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部