摘要
针对当前数据规模不断增大,单机的数据挖掘运行效率低下的问题,本文采用Hadoop平台对聚类K-means算法进行研究以解决此类问题。首先对Hadoop平台的架构和搭建进行了详细描述;其次详细分析了K-means算法;最后给出了算法实现,并对算法进行了实验分析。
In view of the increasing scale of data and the inefficient operation of data mining in single machine, this paper uses Hadoop platform to cluster K-means algorithm to solve such problems. Firstly, the architecture and construction of the Hadoop platform are described in detail; secondly, the K-means algorithm is analyzed; finally, the algorithm implementation is given, and the algorithm is experimentally analyzed.
作者
汪一百
WANG Yi-bai(Changsha Medical University,Changsha 410219,Hunan)
出处
《电脑与电信》
2018年第4期18-20,共3页
Computer & Telecommunication
基金
湖南省教育厅科研项目
项目编号:16C0184