摘要
Hadoop是Apache基金会下的一个开源分布式计算平台,以分布式文件系统HDFS(Hadoop Distributed File System)和Map Reduce分布式计算框架为核心,为用户提供了底层细节透明的云分布式基础设施。在对Hadoop进行深入分析和研究的基础上,搭建基于Hadoop的云计算平台,并完成分布式文本文件处理任务以及对文件文本内容处理算法的改进和实现。
Hadoop is an open source distributed computing platform under Apache Foundation. Taking HDFS (Hadoop Distributed File System)and MapReduce distributed computing framework as the core, it provides users with details of transparent distributed cloud infrastructure of the lower tier. Based on an in-depth analysis and study of Hadoop, a Hadoop- based cloud computing platform was established and distributed text file processing tasks and algorithms were completed.
出处
《天津科技》
2016年第1期52-55,共4页
Tianjin Science & Technology