期刊文献+

数据库与MapReduce融合的大数据管理技术探索 被引量:4

Research on Big Data Management with Architecture of Fusion of Database and MapReduce
原文传递
导出
摘要 大数据管理是随着时代和技术发展而提出和演化的命题。随着大数据从传统的结构化数据向无结构化数据的转移,Key/value存储、NoSQL、MapReduce等技术成为数据库技术之外大数据管理的多样化手段。MapReduce以其开放性成为当前大数据的代表技术,在大数据应用中,如何让MapReduce与数据库高效协同,发挥各自的技术优势和平台优势,提供高性能、高可扩展性、高可用性的大数据服务平台成为重要的研究课题。本文讨论在大数据存储、管理与服务主题上的观点和技术路线,探索将MapReduce作为数据库新的应用与开发平台的可行性。 Big data management is a proposition that came up and is evolving along with time and technology development. As big data research focus moves from traditional structured data to unstructured data, Key/ value store, NoSQL, MapReduce etc. became diverse means for big data management besides database. MapReduce is the mainstream technique for big data because of its open feature. It is a critical issue to make MapReduce cooperate with database efficiently in order to provide a big data service platform of highperformance, high scalability and high availability. This paper discusses the technical opinions on big data storing, management and service topics, and explores the feasibility of use MapReduce as new application and development platform for databases.
作者 张延松
出处 《科研信息化技术与应用》 2013年第1期19-29,共11页 E-science Technology & Application
基金 国家科技重大专项"核心电子器件 高端通用芯片及基础软件产品"(2010ZX01042-001-002)
关键词 大数据 KEY value存储 MAPREDUCE 大数据仓库 Big data Key/value store MapReduce Big data warehouse
  • 相关文献

参考文献15

  • 1Douglas, Laney. 3D Data Management: ControllingData Volume, Velocity and Variety. Gartner. Retrieved 6February 2001.
  • 2Lith, Adam, Mattsson,Jakob. Investigating storagesolutions for large data - A comparison of well performingand scalable data storage solutions for real time extractionand batch insertion of data. 2010.
  • 3Yongqiang He, Rubao Lee, Yin Huai, Zheng Shao, NamitJain, Xiaodong Zhang, Zhiwei Xu. RCFile: A fast andspace-efficient data placement structure in MapReduce-based warehouse systems. ICDE 2011: 1199-1208.
  • 4Dittrich J,Quiane-Ruiz JA,Jindal A, Kargin Y, Setty V,Schad J. Hadoop-H-: Making a yellow elephant run like acheetah (without it even noticing)‘ PVLDB,2010,3(1—2):518-529.
  • 5Kamil Bajda-Pawlikowski, Daniel J. Abadi,AviSilberschatz, Erik Paulson. Efficient processing of datawarehousing queries in a split execution environment.SIGMOD Conference 2011: 1165—1176.
  • 6Dean J, Ghemawat S. MapReduce: Simplified dataprocessing on large clusters. In: Brewer E,Chen P, eds.Proc. of the OSDI.Califomia: USENIX Association,2004.,137-150.
  • 7Thusoo A, Sarma JS,Jain N, Shao Z, Chakka P,AnthonyS,Liu H,Wyckoff P,Murthy R. Hive a warehousingsolution over a MapReduce framework. PVLDB, 2009,2(2): 938-941.
  • 8Rubao Lee, Tian Luo, Yin Huai, Fusheng Wang,Yongqiang He, Xiaodong Zhang. YSmart: Yet AnotherSQL-to-MapReduce Translator. ICDCS 2011: 25—36.
  • 9Wang HJ,Qin XP, Zhang YS,Wang S,Wang ZW.LinearDB: A relational approach to make data warehousescale like MapReduce.In: Yu JX, Kim MH,Unland R,eds. Proc. of the DASFAA. Hong Kong: Springer-Verlag,2011,306-320.
  • 10张延松,焦敏,王占伟,王珊,周烜.海量数据分析的One-size-fits-all OLAP技术[J].计算机学报,2011,34(10):1936-1946. 被引量:31

二级参考文献13

  • 1O'Neil Patrick E, O'Neil Elizabeth J, Chen Xue-Dong, Revilak Stephen. The star schema benchmark and augmented fact table indexing//Proceedings of the TPCTC. Lyon, France, 2009:237 -252.
  • 2Han Wook-Shin, Ng Jack, Markl Volker, Kache Holger, Kandil Mokhtar. Progressive optimization in a shared-nothing parallel database//Proeeedings of the SIGMOD. Beijing, China, 2007:809 820.
  • 3Lima Alexandre A B, Furtado Camille, Valduriez Patrick, Mattoso Marta. Parallel OLAP query processing in database clusters with data replication. Distributed and Parallel Databases, 2009, 25(1-2): 97-123.
  • 4Furtado Pedro: Model and procedure for performance and availability wise parallel warehouses. Distributed and Parallel Databases, 2009, 25(1-2): 71- 96.
  • 5Yang Christopher, Yen Christine, Tan Ceryen, Madden Samuel. Osprey: Implementing MapReduce-style fault toler ance in a shared nothing distributed database//Proceedings of the ICDE. Long Beach, California, USA, 2010:657-668.
  • 6Chen Songting. Cheetah: A high performance, custom data warehouse on top of MapReduce//Proceedings of the VLDB. Singapore, 2010, 3(2): 1459-1468.
  • 7SAP NetWeaver: A Complete Platform for Large-Scale Busi ness Intelligence. Winter Corporation White Paper. May, 2005.
  • 8The Vertica Analytic Database: Rethinking Data Warehouse Architecture. Winter Corporation White Paper. May, 2005.
  • 9MacNicol R, French B. Syhase IQ muhiplex designed for an alytics//Proceedings of the VLDB. Toronto, Canada, 2004: 1227-1230.
  • 10Stonebraker Michael, Abadi Daniel J, Batkin Adam, Chen Xuedong et al. C Store: A column-oriented DBMS//Proceed ings of VLDB. Trondheim, Norway, 2005:553 -564.

共引文献30

同被引文献75

引证文献4

二级引证文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部