摘要
该文阐述了Spark处理技术在大数据框架上的性能提升优势,分析了BDAS生态系统框架中Spark的任务处理流程图。详细说明了Spark集群的搭建过程和运行状态,并通过Spark Shell的交互界面进行交互式编程,实现对文本内容中单词出现次数的统计。
The performance advantages of Spark processing technical in big data framework is described,the process flowchart of Spark in the framework of BDAS ecosystem is analyzed.The construction process and running state of Spark cluster are described in detail.Statistics on the number of words in the text content by interactive programming through the Shell Spark interactive interface.
出处
《电脑知识与技术》
2016年第5X期14-16,共3页
Computer Knowledge and Technology
基金
江苏省高职院校教师2015年度国内高级访问学者计划(2015FX063)