期刊文献+

大数据处理模型Apache Spark研究 被引量:33

Research on Apache Spark for Big Data Processing
下载PDF
导出
摘要 Apache Spark是当前流行的大数据处理模型,具有快速、通用、简单等特点。Spark是针对Map Reduce在迭代式机器学习算法和交互式数据挖掘等应用方面的低效率,而提出的新的内存计算框架,既保留了Map Reduce的可扩展性、容错性、兼容性,又弥补了Map Reduce在这些应用上的不足。由于采用基于内存的集群计算,所以Spark在这些应用上比Map Reduce快100倍。介绍Spark的基本概念、组成部分、部署模式,分析Spark的核心内容与编程模型,给出相关的编程示例。 Apache Spark is a popular model for large scale data processing at present, which is fast, general and easy. Compared with the Map Reduce computing framework, Spark is efficient in iterative machine learning algorithms and interactive data mining applications while re-taining the compatibility, scalability and fault-tolerance of Map Reduce. With its in-memory computing, Spark is up to 100 x faster than Hadoop Map Reduce in memory. Presents the basic conception, component and the deploying mode of Spark, introduces the internal abstraction and the programming model, gives the programming examples.
作者 黎文阳
出处 《现代计算机(中旬刊)》 2015年第3期55-60,共6页 Modern Computer
关键词 SPARK HADOOP MAPREDUCE 大数据 数据分析 Spark Hadoop Map Reduce Big Data Data Analysis
  • 相关文献

参考文献8

  • 1Zaharia M, Chowdhury M, Das T, et al. Resilient Distributed Datasets: A Fault-Tolerant Abstraction for in-Memory Cluster Computing [C]. Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation. USENIX Association, 2012:2-2.
  • 2Zaharia M, Chowdhury M, Franklin M J, et al. Spark: Cluster Computing with Working Sets[C]. Proceedings of the 2nd USENIX Con- ference on Hot Topics in Cloud Computing,2010:10-10.
  • 3Spark[EB/OL]. http://spark.apache.org.
  • 4Scala[EB/OL]. https://www.scala-lang.org.
  • 5Yu Y, Isard M, Fetterly D, et al. DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language[C]. OSDI. 2008, 8: 1. ld.
  • 6Hadoop MapReduce Tutorial[EB/OL]. http://hadoop.apache.org/docs/rl.2.1/mapred_tutorial.html.
  • 7Apache Mesos. http://mesos.apache.org.
  • 8Spark Programming Guides[EB/OL]. http://spark.apache.org/docs/1.1.0/quick-start.html.

同被引文献192

引证文献33

二级引证文献90

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部