摘要
从云计算三个层次的服务模式出发,提出了一种基于云计算平台的分布式并行信息系统数据采集分析系统。首先,通过Hadoop云计算平台提供的分布式文件系统提升数据的存取速度,增强系统的容错性。在此基础上,利用MapReduce编程模型并行化数据流系综分类算法,提高数据的分类挖掘效率。最后,采用Web Service技术构建了SOA服务体系架构,从而整合了技术平台。测试结果表明,检测系统运行高效,并且检测精度高,具有一定的实用性和推广价值。
This paper proposes a cloud-based distributed parallel Information system for data acquisition and analysis system from three levels of service models of cloud computing.Firstly,this system optimizes data accessing speed and enhances system fault-tolerance via HDFS of Hadoop cloud computing platform.Furthermore,it parallels the ensemble classification algorithm of data streams with MapReduce programming model for improving processing efficiency in classification mining.Finally,it constructs SOA architecture with web service technology to integrate technology platform.The testing results show that this system runs effectively and accurately.It is worth using widely on practical application.The technology and methods adopted in the system are practical and worthy of using abroad.
出处
《微型电脑应用》
2014年第2期23-25,共3页
Microcomputer Applications