期刊文献+

一种基于数据压缩的Apriori算法 被引量:6

Improved Apriori based on data compression
下载PDF
导出
摘要 随着物联网技术的飞速发展,数据采集手段迅速增加,对海量数据分析与处理的需求也愈加强烈。关联规则挖掘算法通过数据之间的关联分析,挖掘出数据之间的隐含关系,进而获得了大量应用。在众多的关联规则算法中,传统的Apriori算法虽然得到了大量应用,但是因为该算法产生大量的候选集,而且需要多次对数据库进行扫描,导致该算法的运行效率大大降低。为了克服Apriori算法的以上缺点,通过数据压缩的方法减少了数据库扫描次数的同时,对生成的候选集进行了多次验证,大大减少了无效候选集的数量。大量的数据挖掘实验证明提出的改进算法可以在正确挖掘数据集关联规则的同时,大大提高了算法的运行效率。 The Apriori algorithm is one of the most influential algorithms for mining association rules. It can work on the large dataset efficiently. However, the traditional Apriori algorithm has two bottlenecks. It generates a large number of candidate sets, and most of them are useless. It has to scan the database for many times. This paper presents an improved Apriori algorithm based on the data compression methodology. The improved algorithm can reduce the number of database scans and the number of candidate set by pre-judging at the same time. Complicated experiment demonstrates that a significant improvement has been achieved by the algorithm.
出处 《计算机工程与应用》 CSCD 2013年第14期117-120,共4页 Computer Engineering and Applications
基金 国家重大专项(No.2011ZX03005-002) 中国博士后科学基金项目(No.20100470568) 王宽诚教育基金
关键词 数据挖掘 关联规则 APRIORI算法 数据压缩 频繁集检测 data mining association rules Apriori data compression detection of frequent set
  • 相关文献

参考文献10

  • 1Shen Q,Liu Y, Zhao Z, et al.Distributed Hash table based ID management optimization for Intemet of things[C]//IWCMC, 2010: 686-690.
  • 2Agrawal R, Imielinski T, Swami A.Mining association rules between sets of items in large databases[C]//Proceedings of the ACM SIGMOD Conference on Management of Data, 1993:207-216.
  • 3Agrawal R, Srikant R.Fast algorithms for mining association rules in large database[C]//Proceedings of the 20th International Conference on Very Large Data Bases, 1994:487-499.
  • 4Park J S,Chen M S,Yu P S.An effective Hash-based algo- rithm for mining association rules[C]//Pruceedings of ACM SIGMOD International Conference on Management of Data, 1995 : 175-186.
  • 5Savasere A, Omiecinski E,Navathe S.An efficient algorithm for mining association rules in large databases[C]//Proceed- ings of the 21st International Conference on Very Large Database, 1995 : 432-443.
  • 6Brin S, Motwani R, Ullman J D, et al.Dynamic iternset count- ing and implication rules for market basket data[C]//ACM SIGMOD International Conference on the Management of Data, 1997 : 255-264.
  • 7Mannila H, Toivonen H, Verkamo A.Efficient algorithm for discovering association rules[C]//AAAI Workshop on Know- ledge Discovery in Databases,1994:181-192.
  • 8Han J, Pei J, Yin Y.Mining frequent patterns without candi- date generation[C]//Proc 2000 ACM-SIGMOD Int Conf Management of Data( SIGMOD ' 00), 2000 : 1-12.
  • 9Mannila H, Toivonen H, Verkamo l.Efficient algorithm for discovering association rules[C]//AAAI Workshop on Know- ledge Discovery in Databases, 1994: 181-192.
  • 10Wu Libing, Gong Kui, He Yanxiang, et al.A study of improv- ing Apriori algorithm[C]//2010 2nd International Workshop on Intelligent Systems and Applications(ISA),2010:22-23.

同被引文献82

引证文献6

二级引证文献30

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部