摘要
Spark是主流的大数据并行计算框架。文章将通过几段Scala脚本,演示在Spark环境下通过Map-Reduce框架处理大数据。
Spark is the major framework of parallel computing of big data. This article will illustrate how to process big data through Map-Reduce framework under the background of Spark by several Scala scripts.
作者
邱丽娟
Qiu Lijuan(Xiamen Nanyang University, Xiamen 361102, China)
出处
《无线互联科技》
2017年第1期44-45,共2页
Wireless Internet Technology