期刊文献+

基于MapReduce的Apriori前后项约束关联规则改进算法 被引量:3

Improved Algorithm for Association Rules of Apriori Before and After Items Based on MapReduce
下载PDF
导出
摘要 针对经典的Apriori算法依赖内存,只适用于小规模数据集,在面对海量数据集时显得无能为力以及该算法没有考虑用户的需求情况等问题,提出了基于MapReduce的Apriori前后项约束关联规则改进算法.该方法首先对经典Apriori算法挖掘过程进行了改进,加入了用户的前后项约束规则,使得在挖掘过程中剪枝的程度更大并且获取到更加精准的规则.然后利用云计算的MapReduce编程技术,对改进的Apriori算法的各个步骤并行化.实验结果表明,改进的算法在处理不同的数据集时有一定的优势,然后经过MapReduce模型并行化后,提高了对海量数据的处理能力和效率,并且具有良好的扩展性. Aiming at the memory dependence of the classic Apriori algorithm,it is only suitable for small-scale datasets,it seems to be powerless in the face of massive datasets,and the algorithm does not consider the user’s needs.The improved algorithm of Apriori pre-term constraint association rules based on MapReduce is proposed.Firstly,the method of the classic Apriori algorithm mining process is improved,and the user’s pre-and post-item constraint rules are added,which makes the pruning degree more in the mining process and obtains more precise rules.Then,using the MapReduce programming technology of cloud computing,the steps of the improved Apriori algorithm are parallelized.The experimental results show that the improved algorithm has certain advantages in dealing with different data sets.After parallelization by MapReduce model,it improves the processing ability and efficiency of massive data and has good scalability.
作者 王伟 储泽楠 韩毅 吴朝霞 焦清局 WANG Wei;CHU Zenan;HAN Yi;WU Zhaoxia;JIAO Qingju(School of Computer Science and Information Engineering,Anyang Institute of Technology,Anyang 455000,China;Anyang Information System Application Engineering Technology Research Center,Anyang Institute of Technology,Anyang 455000,China;Henan Province High Precision Spindle Engineering Laboratory,Anyang Institute of Technology,Anyang 455000,China;National CNC System Engineering Technology Research Center,Huazhong University of Science and Technology,Wuhan 430000,China;Ministry of Education Oracle Information Processing Key Laboratory,Anyang Normal University,Anyang 455000,China)
出处 《信阳师范学院学报(自然科学版)》 CAS 北大核心 2020年第3期448-453,共6页 Journal of Xinyang Normal University(Natural Science Edition)
基金 国家自然科学基金项目(61806007) 河南省科技计划项目(182102210197) 河南省高等学校重点科研项目(2020ZDJH002)。
关键词 关联规则 APRIORI算法 项约束 MAPREDUCE 并行算法 HADOOP association rules apriori algorithm item constraint MapReduce parallel algorithm hadoop
  • 相关文献

参考文献2

二级参考文献11

共引文献13

同被引文献35

引证文献3

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部