摘要
大数据技术发展与开源运动的结合已成为大数据技术创新中的一个鲜明特点。目前,大数据分析处理流程中所使用的关键技术几乎都源自开源模式,知名的大数据开源项目如分布式计算和存储系统Hadoop、基于内存计算的集群计算系统Spark,以及多款非关系型数据库(NoSQL)产品等。文章对Hadoop、Spark等知名大数据开源项目进行分析和解读,为读者开展大数据应用提供技术参考和支持。
The combination of technology development and open source movement has become a distinguish-ing feature in big data era. Almost all major technolo-gies of the big data analysis and processing process are derived from the open source model, including some well-known open source big data projects such as Hadoop, Spark and a variety of NoSQL products. This paper does research and analysis with the famous open source projects (e.g. Hadoop, Spark, etc.) to offer big data users some technical reference and support.
出处
《现代电信科技》
2014年第8期17-22,共6页
Modern Science & Technology of Telecommunications
关键词
大数据
开源
Hadoop
Spark
NoSQL
big data
open source
Hadoopk
Spark
NoSQL