Mapreduce模型及支撑系统概述

Overview graphs model and support system

下载PDF

导出

摘要 MapReduce是由并行编程模型及相关支撑系统组成的数据处理框架,通过定义接口和运行时支持库,通过定义良好的接口和运行时支持库,能够自动并行执行大规模计算任务,通过隐藏底层实现细节,降低实现并行编程的难度,Hadoop是目前MapReduce框架最流行的开源实现。文章首先介绍了MapReduce并行编程模型及其hadoop的运行原理、运行机制,深入研究了MapReduce计算任务在Hadoop系统中的运行过程。 MapReduce is composed of parallel programming model and its support system data processing framework,through the definition of interface support library and runtime support library,through the definition of a good interface and operation,capable of automatic parallel execution of large-scale computing tasks,by hiding the underlying implementation details,reduce the difficulty of parallel programming,Hadoop is currently the most popular MapReduce framework open source implementation.Firstly,this paper introduces the MapReduce parallel programming model and the operation principle and operation mechanism of Hadoop,and deeply studies the operation process of MapReduce computing task in Hadoop system.

作者李炜贺丽娟

机构地区陕西国防工业职业技术学院

出处《电子测试》 2017年第9期77-78,共2页 Electronic Test

关键词大数据 MAPREDUCE HADOOP HDFS big data graphs hadoop HDFS

分类号 TP311.13 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献1

1应毅,刘亚军.MapReduce并行计算技术发展综述[J].计算机系统应用,2014,23(4):1-6. 被引量：18

二级参考文献31

1李国杰.大数据研究的科学价值.中国计算机学会通讯,2012,8(9):8—15.
2Ghemawat S, Gobioff H, Leung ST. The Google file system. ACM SIGOPS Operating Systems Review. ACM. 2003, 37(5): 29-43.
3Dean J, Ghemawat S. MapReduce: Simplified data processing on large clusters. Communications of the ACM, 2008, 51(1): 107-113.
4Chang F, Dean J, Ghemawat S, et al. Bigtable: A distributed storage system for structured data. Proc. of the 7th USENIX Symp. on Operating Systems Design and Implementation. 2006. 205-218.
5TomWhite著.周敏奇,王晓玲,金澈清等译.Hadoop权威指南(第二版).北京:清华大学出版社,2011.
6Shvachko K, Kuang H, Radia S, et al. The Hadoop distributed file system. Mass Storage Systems and Technologies (MSST). 2010 IEEE 26th Symposium on. IEEE. 2010. 1-10.
7IDC发布最新《数字宇宙研究报告》.http://www.ecas.cn/xxkw/kbcd/201115-93655/ml/xxhjsyjcss/201212/t20121229_3730152.html.
8Bu Y, Howe B, Balazinska M, et al. HaLoop: Efficient iterative data processing on large clusters. Proc. of the VLDB Endowment, 2010, 3(1-2): 285-296.
9Ekanayake J, Li H, Zhang B, et al. Twister: A runtime for iterative mapreduce. Proc. of the 19th ACM [ntemational Symposium on High Performance Distributed Computing. ACM. 2010. 810-818.
10Zaharia M, Chowdhury M, Das T, et al. Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing. Proc. of the 9th USENIX Conference on Networked Systems Design and Implementation. USENIX Association. 2012.2-2.

共引文献17

1陈凤娟.基于MapReduce的关联规则挖掘[J].电脑与电信,2014(8):59-60.
2周国军.一种基于MapReduce的关联规则挖掘算法[J].玉林师范学院学报,2014,35(5):128-134. 被引量：1
3陈文竹,陈岳林,蔡晓东,华娜.基于并行框架的鲁棒自适应前景检测算法[J].计算机系统应用,2015,24(4):153-158.
4李金忠,汤鹏杰,夏洁武,谭云兰.迭代式MapReduce研究进展[J].计算机工程与应用,2015,51(12):123-132. 被引量：2
5何东之,张吉沣,赵鹏飞.不确定性传播算法的MapReduce并行化实现[J].山东大学学报（工学版）,2015,45(5):22-28. 被引量：1
6何广才,周根宝.基于MapReduce的改进蚁群算法在TSP中的应用[J].内蒙古农业大学学报（自然科学版）,2015,36(5):125-132. 被引量：5
7谭黔林,莫春娟.基于MapReduce的海量文件检索方法研究[J].河池学院学报,2016,36(2):101-105. 被引量：1
8徐宏博,赵文涛,孟令军.一种基于MapReduce的改进文本输入方式的并行分词方法研究[J].电脑知识与技术,2016,0(8):171-175.
9秦军,冯亮亮,孙蒙.基于异构Hadoop集群的负载均衡策略研究[J].计算机技术与发展,2017,27(6):110-113. 被引量：2
10朱坤,黄瑞章,张娜娜.一种基于MapReduce模型的高效频繁项集挖掘算法[J].计算机科学,2017,44(7):31-37. 被引量：9

1丁晶,李刚,谭毅培.基于Hadoop系统大数据平台在天津市地震局的应用[J].电子技术与软件工程,2017(18):159-161. 被引量：2
2张晓丽,滑亚慧.一种基于HDFS小文件存储优化方案[J].计算技术与自动化,2017,36(3):134-138. 被引量：3
3李金忠,彭蕾,刘欢,罗文浪.大规模图计算系统研究进展[J].小型微型计算机系统,2017,38(10):2394-2400. 被引量：2
4陈琳.浅论Hadoop平台在大数据中的应用[J].太原学院学报（自然科学版）,2017,35(3):56-59. 被引量：2
5谢明达.关于变电站厂站自动化的系统安全探究[J].华东科技（学术版）,2017,0(10):168-168.
6林珠,吴佩珊.面向交通大数据的智能处理平台建设研究[J].计算技术与自动化,2017,36(3):114-117. 被引量：3
7郑星星,何叶元.电梯制动器的结构形式及检验检测探究[J].丝路视野,2017,0(20):82-82. 被引量：1
8冯航.Linux操作系统文件管理概论[J].数码世界,2017,0(10):312-312.
9唐磊.基于Ambari的Hadoop集群部署实验的设计与实现[J].信息记录材料,2017,18(11):98-101. 被引量：1
10付建伟.变电站二次回路保护电器的选用研究[J].设备管理与维修,2017(11):21-22.

电子测试

2017年第9期

浏览历史

内容加载中请稍等...

Mapreduce模型及支撑系统概述

参考文献1

二级参考文献31

共引文献17

相关作者

相关机构

相关主题

浏览历史