期刊文献+

基于PFP的关联规则增量更新算法 被引量:6

Association rules incremental updating algorithm based on PFP
下载PDF
导出
摘要 对快速增长的数据进行挖掘的有效途径之一是采用增量式更新算法,其中最具代表性的是MRFUP算法。该算法的剪枝策略减少了关联规则的计算,但在处理增长快速的数据时效率过低,且频繁计算新增数据。文章以提高海量数据下关联规则增量更新效率为目标,通过扩展能够并行处理关联规则的PFP算法而提出一种基于PFP的关联规则增量更新算法MRPFP。该算法能充分利用云平台强大的存储和并行计算能力。该算法的实验结果表明,MRPFP处理海量数据的效率优于MRFUP算法,更适用于海量数据的关联规则挖掘。 One effective way for the rapidly growing data mining is the incremental updating algorithm,which is represented by the MRFUP algorithm.MRFUP algorithm has a good advantage in the maintenance of association rules with its pruning strategy,but it has low efficiency in the rapidly growing data processing and calculates the new data frequently.In this paper,aiming at improving the efficiency of association rules incremental updating of the massive data,an association rules incremental updating algorithm MRPFP is proposed by extending the parallel processing algorithm of association rules PFP.The algorithm can take advantage of powerful cloud storage and parallel computing capabilities.The experimental results show that MRPFP is more efficient in processing massive data than MRFUP and more suitable for the association rules mining of massive data.
出处 《合肥工业大学学报(自然科学版)》 CAS CSCD 北大核心 2015年第4期500-503,551,共5页 Journal of Hefei University of Technology:Natural Science
关键词 关联规则 Map/Reduce模式 增量更新 并行FP-Growth算法 association rule Map/Reduce programming pattern incremental updating parallel FP-Growth algorithm
  • 相关文献

参考文献21

  • 1Shah S,Chauhan N C, Bhanderi S D. Incremental mining of association rules: a survey [J]. International Journal of Computer Science and Information Technologies, 2012,3 (3) :4071-4074.
  • 2Cheung D W. Maintenance of discovered association rules in large database: an incremental updating technique[C]// Proc of 1996 Int Conf on Data Engineering. IEEE Computer Soc Press, 1996 : 106- 114.
  • 3Han Jiawei, Kamber M. Data mining concepts and tech- niques [ M ]. MorganKaufmann Publishers, 2002: 151-159.
  • 4Leung C K S,Khan Q I, Hoque T. CanTree: a tree struc- ture for efficient incremental mining of frequent patterns [C]//Proceedings of the Fifth IEEE International Confer- ence on Data Mining, 2005 : 274-281.
  • 5Pramudiono I, Kitsuregawa M. Parallel fp-growth on pc cluster[M]//Advances in knowedge Discovery and Data Mining. Berlin:Springer,2003:467-473.
  • 6Aouad L M, Le-Khac N A, Kechadi T M. Distributed fre- quent itemsets mining in heterogeneous platforms[J]. Jour- nal of Engineering, Computing and Archtecture, 2007, 1 (2):1-12.
  • 7Zhou Le, Zhong Ziyong, Chang Jin, et al. Balanced paral- lel fp-growth with MapReduce[C]//Conference on Infor- mation Computing and Telecommunications, IEEE, 2010 : 243-246.
  • 8Chen Ke, Zhang Lijun,Li Sansi, et al. Research on associa- tion rules parallel algorithm based on fp-growth[M]//In- formation Computing and Applications. Borlin: Springer, 2011:249-256.
  • 9黄德才,张良燕,龚卫华,刘端阳.一种改进的关联规则增量式更新算法[J].计算机工程,2008,34(10):38-39. 被引量:21
  • 10朱玉全,陈耿,宋余庆,孙志挥.Shared-nothing并行事务数据库系统中规则的挖掘与更新算法[J].小型微型计算机系统,2003,24(8):1499-1502. 被引量:3

二级参考文献57

  • 1朱红蕾,李明.一种高效维护关联规则的增量算法[J].计算机应用研究,2004,21(9):107-109. 被引量:9
  • 2朱玉全,宋余庆,陈耿.关联规则挖掘中增量式更新算法的研究[J].计算机工程与应用,2005,41(15):186-187. 被引量:8
  • 3付长贺,赵传立,唐恒永.一种改进的关联规则增量式更新算法[J].沈阳师范大学学报(自然科学版),2006,24(1):51-54. 被引量:2
  • 4张健沛,杨悦,刘卓.一种新的关联规则增量式挖掘算法[J].计算机工程,2006,32(23):43-44. 被引量:6
  • 5[1]Agrawal R, Imielinski T, Swami A. Mining association rules between sets of items in large databases. In: Proceedings of ACM SIGMOD International Conference on Management of Date, Washington DC, 1993.207~216
  • 6[2]Agrawal R, Srikant R. Fast algorithm for mining association rules. In: Proceedings of the 20th International Conference on VLDB, Santiago, Chile, 1994. 487~499
  • 7[3]Han J, Kamber M. Data Mining: Concepts and Techniques. Beijing: Higher Education Press, 2001
  • 8[5]Agrawal R, Shafer J C. Parallel mining of association rules:Design, implementation, and experience. IBM Research Report RJ 10004,1996
  • 9[6]Savasere A, Omiecinski E, Navathe S. An efficient algorithm for mining association rules. In: Proceedings of the 21th International Conference on VLDB, Zurich, Switzerland, 1995. 432~444
  • 10[7]Hah J, Jian P et al. Mining frequent patterns without candidate generation. In: Proceedings of ACM SIGMOD International Conference on Management of Data, Dallas, TX, 2000.1~12

共引文献138

同被引文献36

  • 1李松生,赵燕伟,顾熙仁.改进的FUP算法在五金产品质量分析系统中的应用[J].吉林大学学报(工学版),2012,42(S1):251-254. 被引量:1
  • 2CHEUNG D W,HAN J,NG V T,et al.Maintenance of Discovered Association Rules in large Database:An Incremental Updating Technique[C]//In1Proc of the 12th Int Conf on Data Engineering,New Orleans,Louisiana,1996:106-114.
  • 3Agrawal R,Imielinski T,Swami A.Mining Association Rules Between Sets of Items in Large Databases[J].ACM SIGMOD Record,1993,22(2):207-216.
  • 4Sathish K.Efficient tree based distributed data mining algorithms for mining frequent patterns[J].International Journal of Computer Application,2010,11(10):233-242.
  • 5Rahul M.Comparative analysis of apriori algorithm and frequent pattern algorithm for frequent pattern mining in web log data[J].International Journal of Computer Science and Information Technologies,2012,3(4):4662-4665.
  • 6SRIKAN R, AGRAWAL R. Mining Quanlitative Rules in Large Relational Table [ C ]. ACM SIGMOD. Conference on Manage of Date. San Diego, Californian, USA: ACM Press, 1996: 1-12.
  • 7SATHISH K. Efficient Tree Based Distributed Data Mining Algorithms for Mining Frequent Patterns [ J 1-International Journal of Computer Application, 2010, 11 (10): 233-242.
  • 8MISHRA R, CHOUBEY A. Comparative Apriori Algorithm and Frequent Pattern Analysis of Algorithm forFrequent Pattern International Journal Mining in Web Log Data [J]. of Computer Science and Information Eehnologies, 2012, 3(4): 4662-4665.
  • 9冯玉才,冯剑琳.关联规则的增量式更新算法[J].软件学报,1998,9(4):301-306. 被引量:227
  • 10杨云,罗艳霞.FP-Growth算法的改进[J].计算机工程与设计,2010,31(7):1506-1509. 被引量:25

引证文献6

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部