期刊文献+

基于关系数据库的频繁项集挖掘算法研究 被引量:3

Research on Frequent Itemsets Mining Algorithm Based on Relational Database
原文传递
导出
摘要 Apriori算法及其改进是目前应用最为广泛的频繁项集挖掘算法,但其在关系数据库中挖掘频繁项集时,产生大量候选项集,导致重复扫描数据库,从而导致其效率低下.本文在深入研究Apriori算法及其改进算法和关系数据库特征的基础上,提出了基于关系数据库的频繁项集挖掘算法,并详细描述了其实现和优化方法.本算法不产生候选项集,只需一次事务扫描,大幅提高算法执行效率,此外,本算法经过简单修改就能满足大部分的关联分析需求.在零售业中的应用实验证明:该算法在一定的条件下比经典的Apriori算法具有更高的效率. Apriori algorithm and its improvements is the most widely used algorithm for mining frequent itemsets, but 'producing a great quantity of candidate itemsets during min- ing frequent itemsets in relational database, and scans transaction database repeatedly. This paper makes profound researches on Apriori algorithm and the characteristics of relational database, and proposes a frequent itemsets mining algorithm based on relational database, and presents its concrete implementation and its optimization method. The algorithm doesn't produce, candidate itemsets, and only scans transaction database once, so promotes consider- ably efficiency. Moreover, the algorithm meets most requirements of correlation analysis. The result of experiments in the retail industry show that, the frequent itemsets mining algorithm based on relational database' has higher efficiency than the classical Apriori algorithm under certain conditions.
出处 《数学的实践与认识》 CSCD 北大核心 2013年第12期198-203,共6页 Mathematics in Practice and Theory
关键词 关系数据库 频繁项集 关联规则 APRIORI relational database frequent itemsets association rule Apriori
  • 相关文献

参考文献6

  • 1Jiawei Han,Micheline Kamber(著),范明,孟小峰(译).数据挖掘概念与技术[M].北京:机械工业出版社,2007.3.2.
  • 2Endu Duneja R I T S, Bhopal M P A K, Sachan R I T S. A survey on frequent itemset mining with association rules[J]. International Journal of Computer Applic- ations, 2012, 46(23): 18-24.
  • 3Park J S, Chen M S, Yu P S. An effective Hash based algorithm for mining association rules [C]// Proceedings of the 1995 ACM SIGM OD International Conference on Management of Data, San Jose, CA, 1995, 24(2): 175-186.
  • 4Agrawal R, Imielinski T, Swami A. Mining association rules between sets of items in large databases [C]//Buneman P, Jajodia S, eds. Proc. of the ACM SIGM OD Conf. on Management of Data ( SIGMOD. 93) . New York: ACM Press, 1993, 22(2): 207-216.
  • 5Savasere A, Omiecianski E, Navathe S. An efficient algorithm for mining association rules in large databases, 1995, 21: 432.
  • 6Toivonen H. Sampling large databases for association rules[C]//Proc of the 22nd lnt'l Conf. on Very Large Data Bases. Mumbai, India, 1996, 22: 134-145.

共引文献63

同被引文献23

引证文献3

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部