基于GPU的并行化Apriori算法的设计与实现被引量：5

Design and Implementation of Apriori on GPU

下载PDF

导出

摘要大数据和高度并行的计算架构的时代已经来临,如何让传统的串行数据挖掘方法在当下获得更高的效率是一个值得探讨的问题。根据现代GPU大规模并行运算架构的特点(单结构多数据),对传统的串行Apriori算法进行并行化处理。使用最新的CUDA技术完成对传统串行Apriori算法中的支持度统计、候选集生成这两个计算的并行化实现,讨论了多种实现方法的差异,并提出改进方案。实验表明:改进后的并行算法使支持度统计在10000条事务的条件下效率提高16%,候选集生成在10000条事务的条件下效率提高25%。 Big data and parallel computation era have come,and it is a trend to convert serial data mine algorithm into parallel algorithm to take advantage of cheap machine. In this paper two main steps, namely support counting and candidate set generation in serial apriori algorithm, were rebuilt parallelly on CUDA architecture. Meanwhile the difference between various implements of parallel apriori was compared to find a better solution. Finally, the experiments indicate that the time of support counting and candidate set generation decreases 16% and 25% respectively on a data set containing 10000 items.

作者唐家维王晓峰

机构地区上海海事大学信息工程学院

出处《计算机科学》 CSCD 北大核心 2014年第10期238-243,共6页 Computer Science

基金国家海洋公益性行业专项(201305026)资助

关键词数据挖掘关联规则频繁模式并行算法 Data minint, Association rules, Frequent itemset mining, Parallel agorithm

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献9

1Agrawal R,Srikant R.Fast algorithms for mining association rules[C]∥Proceedings of the 20th International Conference on Very Large Data Bases (VLDB’94).1994:487-499.
2Agrawal R,Shafer J C.Parallel mining of association rules[J].IEEE Transactions on Knowledge and Data Engineering,1996,8(6):962-969.
3Shah K D,Mahajan S.Maximizing the Efficiency of ParallelApriori Algorithm[C]∥ International Conference on Advances in Recent Technologies in Communication and Computing.IEEE,2009:107-109.
4Li Ning,Zeng Li,He Qing,et al.Parallel Implementation ofApriori Algorithm Based on MapReduce[C]∥Software Engineering,Artificial Intelligence,Networking and Parallel & Distributed Computing (SNPD).2012:236-241.
5Shintani T,Kitsuregawa M.Hash based parallel algorithms for mining association rules[C]∥Fourth International Comperence on Parallel and Distributed Information Systems.IEEE,1996:19-30.
6Cui Qing-min,Guo Xiao-bo.Research on Parallel AssociationRules Mining on GPU[C]∥Proceedings of the 2nd International Conference on Green Communications and Networks.2013:215-222.
7Yang Yuan-sen,Yang Chung-ming,Hsieh T J.GPU parallelization of an object-oriented nonlinear dynamic structural analysis platform[J].Simulation Modelling Practice and Theory,2014,40:112-121.
8Shan Feng,Hart John C.Parallel computing on geostatistical data using CUDA[C]∥IDEALS.2014.
9Smirnov V.Parallel Integration Using OpenMP and GPU toSolve Engineering Problems[J].Applied Mechanics and Materials,2014,475:1190-1194.

同被引文献42

1吴恩华.图形处理器用于通用计算的技术、现状及其挑战[J].软件学报,2004,15(10):1493-1504. 被引量：141
2徐章艳,刘美玲,张师超,卢景丽,区玉明.Apriori算法的三种优化方法[J].计算机工程与应用,2004,40(36):190-192. 被引量：71
3徐计,王国胤,于洪.基于粒计算的大数据处理[J].计算机学报,2014,37(113):1-22.
4Dalai N. Triggs B. Histogram of oriented gradients for humanI>tection[C]//CVPR. 2005 :886-893.
5Belongie S, Malik J,Puzicha J. Shape Matching and object recog-nition Using Shape Contexts[J]. IEEE Trans, on Pattern Analy-sis and Machine Intelligence, 2002,24C4) : 509-522.
6Shi J , Thomasi C. Good feature to track[C] // IEEE Conferenceon Computer Vision and pattern Recognition. 1994:593-560.
7Lucas B. Kanade T. An iterative image registration techniquewith an application to stereo vision[C] //' Proceedings of the In-ternational Joint Conference on Artificial Intelligence, 1982:674-679.
8Strengert M,Kraus M,Ertl T. Pyramid Methods in GPU-BasedImage Processing[C]//Proceeding of Vision. Modeling.and Vi-sualization 2006. 2006:169-176.
9Dollar P, Appel R.Belongie S. F'ast Feature pyramids for objectdetection[J]. IEEE Transactions on Pattern Analysis and Ma-chine Intelligence,2014,36(8) : 1532-1545.
10Nvidia. NVIDIA CUDA A Programming Guide version 4. 0[EB/OIJ. http://www. nvidia. com/object/cuda-cn.

引证文献5

1张杰,柴志雷,喻津.基于GPU的图像特征并行计算方法[J].计算机科学,2015,42(10):297-300. 被引量：6
2张忠林,田苗凤,刘宗成.大数据环境下关联规则并行分层挖掘算法研究[J].计算机科学,2016,43(1):286-289. 被引量：27
3方刚,吴跃.基于复合粒度计算的频繁模式挖掘研究[J].计算机应用研究,2016,33(6):1620-1623. 被引量：3
4赵月,任永功,刘洋.基于MapReduce的改进的Apriori算法及其应用研究[J].计算机科学,2017,44(6):250-254. 被引量：10
5王蒙,方睿,邹书蓉.基于矩阵相乘的Apriori改进算法[J].计算机与数字工程,2018,46(10):1974-1979. 被引量：5

二级引证文献51

1杜华明,张明昌,刘爽,张瑜嘉.基于数据融合与挖掘的城市综合管廊运维管理探索[J].建筑电气,2022,41(11):64-70. 被引量：1
2王永贵,谢南,曲海成.基于存储改进的分区并行关联规则挖掘算法[J].计算机应用研究,2020,37(1):167-171. 被引量：6
3艾锐峰,欧阳军,程杰,周凯,孙云鹏.实时演进数据序列集的内在模式提取与行为预测[J].计算机系统应用,2018,27(12):75-82.
4陈墨,金磊,龚向阳,满毅.面向5G海量网管数据的故障溯源技术[J].北京邮电大学学报,2018,41(5):131-136. 被引量：9
5许川佩,王光.基于OpenCL的尺度不变特征变换算法的并行设计与实现[J].计算机应用,2016,36(7):1801-1806. 被引量：3
6亢华爱.面向机器学习的通信网络大数据相关性分析算法研究[J].激光杂志,2016,37(8):145-148. 被引量：4
7吴翔翔,范远超,叶恩光,刘镇.基于GPU的并行化运动目标检测方法的研究[J].电子设计工程,2016,24(22):134-137.
8张春生,图雅,李艳.基于精简二元矩阵的蒙医方剂关联规则挖掘[J].世界科学技术-中医药现代化,2017,19(2):365-369. 被引量：3
9张春,周静.动车组故障关联规则挖掘优化算法研究与应用[J].计算机与现代化,2017(9):74-78. 被引量：4
10李伟,朱赵元.一种基于并行矩阵目标明确的Apriori算法[J].浙江工业大学学报,2017,45(5):574-579. 被引量：5

1凌华科技发布aTCA-6150[J].测控技术,2010,29(4):104-104.
2李金霞.研华新一代的RISC（精简指令集运算架构）平台——一种高性能绿色能源处理核心技术[J].自动化信息,2012(3):73-75.
3夏明波,王晓川,孙永强,金士尧.序列模式挖掘算法研究[J].计算机技术与发展,2006,16(4):4-6. 被引量：13
4张伟丰,杨丽华.基于矩阵的多段支持度关联规则挖掘算法[J].湖北汽车工业学院学报,2014,28(2):72-76. 被引量：3
5邓勇,施文康.发现频繁情节的改进算法[J].上海交通大学学报,2005,39(3):405-408. 被引量：1
6石岩.助力化千里为咫尺——就pcANYWHERE谈远程管理软件[J].中国经济和信息化,1999,0(13):37-37.
7何辞,曹建军,郝放.基于桌面虚拟化的卫星网络高速上网技术研究[J].无线电工程,2016,46(9):33-36.
8晓辉.Wyse将把精简型计算机进行到底[J].网络安全技术与应用,2006(9):16-16.
9魏紫.使用GPU实现快速K近邻搜索算法[J].科技信息,2009(27):45-45.
10MSI微星 X-Slim X340 超薄机身、低功耗处理器[J].家庭电子,2009(5):35-35.

计算机科学

2014年第10期

浏览历史

内容加载中请稍等...

基于GPU的并行化Apriori算法的设计与实现被引量：5

参考文献9

同被引文献42

引证文献5

二级引证文献51

相关作者

相关机构

相关主题

浏览历史

基于GPU的并行化Apriori算法的设计与实现 被引量：5

参考文献9

同被引文献42

引证文献5

二级引证文献51

相关作者

相关机构

相关主题

浏览历史

基于GPU的并行化Apriori算法的设计与实现被引量：5