期刊文献+

架构大数据:挑战、现状与展望 被引量:614

Architecting Big Data:Challenges,Studies and Forecasts
下载PDF
导出
摘要 大数据分析相比于传统的数据仓库应用,具有数据量大、查询分析复杂等特点.为了设计适合大数据分析的数据仓库架构,文中列举了大数据分析平台需要具备的几个重要特性,对当前的主流实现平台——并行数据库、MapReduce及基于两者的混合架构进行了分析归纳,指出了各自的优势及不足,同时也对各个方向的研究现状及作者在大数据分析方面的努力进行了介绍,对未来研究做了展望. Compared with traditional data warehouse applications,big data analytics are huge and complex.To design a favorable architecture for big data analytics,this paper lists some key features for big data analytics,summarizes current main implementation platforms(parallel databases,MapReduce,and hybrid architectures based on them),and points their pros and cons.Some current researches are also investigated,our work are introduced and some challenging research problems in the future are discussed.
出处 《计算机学报》 EI CSCD 北大核心 2011年第10期1741-1752,共12页 Chinese Journal of Computers
基金 国家重大科技专项核高基项目(2010ZX01042-001-002) 国家自然科学基金(61070054 61170013) 中国人民大学科学研究基金(中央高校基本科研业务费专项资金 10XNI018) 中国人民大学研究生基金(11XNH120)资助~~
关键词 大数据 大规模可扩展 MAPREDUCE 并行数据库 深度分析 big data large scale MapReduce parallel database deep analytics
  • 相关文献

参考文献42

  • 1WinterCorp: 2005 TopTen Program Summary. http:// www. wintercorp, com/WhitePapers/WC TopTenWP. pdf.
  • 2TDWI Checklist Report: Big Data Analytics. http://tdwi. org/research/2010/08/Big-Data-Analytics, aspx.
  • 3Chaudhuri S, Dayal U. An overview of data warehousing and OLAP technology. SIGMOD Rec, 1997,26(1): 65-74.
  • 4Madden S, DeWitt D J, Stonebraker M. Database parallelism choices greatly impact scalability. DatabaseColumn Blog. http://www, databasecolumn, com/2007/10/database-parallelism-choices, html.
  • 5Dean J, Ghemawat S. MapReduce: Simplified data processing on large clusters//Proceedings of the 6th Symposium on Operating System Design and Implementation (OSDI ' 04). San Francisco, California, USA, 2004: 137-150.
  • 6DeWitt D J, Gerber R H, Graefe G, Heytens M L, Kumar K B, Muralikrishna M. GAMMA--A high performance dataflow database machine//Proceedings of the 12th International Conference on Very Large Data Bases (VLDB' 86). Kyoto, Japan, 1986:228-237.
  • 7Fushimi S, Kitsuregawa M, Tanaka H. An overview of the system software of a parallel relational database machine// Proceedings of the 12th International Conference on Very Large DataBases(VLDB'86). Kyoto, Japan, 1986:209-219.
  • 8Brewer E A. Towards robust distributed systems//Proceedings of the 19th Annual ACM Symposium on Principles of Distributed Computing (PODC' 00). Portland, Oregon, USA, 2000:7.
  • 9http: //www. dbms2, com/2008/08/26/known-applications of mapreduce/.
  • 10[OL].<http://hadoop.apache.org.>.

共引文献2

同被引文献4251

引证文献614

二级引证文献7091

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部