期刊文献+

一种不产生候选项挖掘频繁项集的新算法 被引量:14

One Algorithm for Discovering Frequent Itemsets without Candidate Generations
下载PDF
导出
摘要 Apriori算法是关联规则挖掘算法中应用最为广泛的一种算法,它的主要目的是从大量的事务数据中通过候选项集挖掘出有趣的频繁项集,从而为用户提供有意义的关联关系。但随着数据库规模的扩大,apriori算法可能会产生如下两大棘手问题:大量候选项集的产生将造成巨大计算量的浪费;为剪掉无用候选项如何设置阈值。这些问题相对于众多普通用户来说都具有挑战性。该文提出的代码与运算是一种无须候选项挖掘频繁项集的算法,用户无须为设置阈值而煞费苦心。同时事务压缩算法的加入大大减少了算法中的计算量。 This paper gives an efficient algorithm for association rule mining,namely And Code(AC)algorithm.AC algo-rithm can discover all frequent itemsets from transaction database quickly without candidate generations.Compared with apriori algorithm,it avoids great amounts of candidates and some exact or experienced thresholds for these candidates.The steps of AC algorithm is:firstly it makes one corresponding code for every itemset according to coding rules after transaction reduction,Secondly it runs And algorithm for itemset codes so as to achieve all frequent itemset codes,Last step will transform these codes into corresponding itemsets,then these itemsets can be classed into frequent itemsets according to the support thresholds of frequent itemsets.
出处 《计算机工程与应用》 CSCD 北大核心 2004年第15期182-185,共4页 Computer Engineering and Applications
关键词 关联规则挖掘 apfiori 算法 代码与运算 事务压缩 association rules,apriori algorithm,And Code algorithm,transaction reduction
  • 相关文献

参考文献7

  • 1Jiawei Han,Micheline Kamber. Data Mining:Concepts and Techniques.2001:225~244
  • 2Agrawal R,Imielinski T,Swami A.Mining Association Rules between Sets in Large Databases[C].In:Proceedings of the 1991 ACMSIGMOD International Conference on Management of Data:SIGMOD'93,New York:ACM Press, 1991:207~216
  • 3朱绍文,王泉德,黄浩,彭清涛,陆玉昌.关联规则挖掘技术及发展动向[J].计算机工程,2000,26(9):4-6. 被引量:39
  • 4R Agrawal,R Srikant. Fast Algorithms for Mining Association Rules[J]. Business Intelligence, 1998:560~564
  • 5N Megiddo, R Srikant. Discovering Predictive Association Rules[C].In: Proc of the 4th Int'l Conference on Knowledge Discovery in Databases and Data Mining,New York,1998-08
  • 6J Parl,M Chen,P Yu. An Effective Hash based Algorithm for MiningAssociation Rules[C].In:Proceedings of the ACM SIGMOD International Conference on Management of Data, 1995; (2): 175~186
  • 7颜雪松,蔡之华.一种基于Apriori的高效关联规则挖掘算法的研究[J].计算机工程与应用,2002,38(10):209-211. 被引量:67

二级参考文献2

共引文献104

同被引文献67

引证文献14

二级引证文献137

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部