摘要
在海量数据的关联规则数据挖掘中,采用并行计算是非常必要的;针对当前的关联规则算法,运用并行算法的思想,结合云计算环境下的Hadoop架构,提出了Hadoop下的并行关联规则算法的设计,最后实验表明,该算法能处理节点失效,并且能实现节点负载均衡。
It is very necessary to use parallel computing in association rule data mining of massive data.According to current association rule algorithm,the design of parallel association rule algorithm under Hadoop was pointed out by using parallel algorithm and by combining Hadoop framework under cloud computing environment.The final experiment showed that this algorithm could deal with node failure and achieve node load balance.
出处
《重庆工商大学学报(自然科学版)》
2012年第11期36-39,60,共5页
Journal of Chongqing Technology and Business University:Natural Science Edition