摘要
Hadoop是新一代并行分布处理“大数据”的架构和技术.本文主要讨论了Hadoop的分布式系统架构方式,并重点描述了分布式文件系统HDFS、分布式并行计算MapReduce及其生态系统等实现原理和运行机制.
Hadoop is a new generation of framework and technology used for paral lel distr ibut ion processing"big data". This paper mainly discusses the methods of distributed system framework based on Hadoop, and focuses on describing the implementation principles and operating mechanism with respect to the distributed file system HDFS, the distributed parallel computing MapReduce and its ecosystem.
出处
《四川工商学院学术新视野》
2017年第4期26-30,共5页
Academic New Vision of Sichuan Technology and Business University
基金
四川省大学生创新训练项目“基于Hadoop的大数据平台架构与实践”(项目编号:13672)