期刊文献+

并行框架下基于位图索引的多表星型连接算法

Bitmap index based multi-table star schema join technology algorithm in parallel framework
下载PDF
导出
摘要 分析面向大数据平台的MapReduce分布式编程技术以及实现数据查询时的连接算法,针对SSB数据模型,提出基于分布式缓存的多表星型连接优化技术。利用谓词向量技术,将维表中间连接的数据依赖转化为表上的位图索引过滤,减少数据依赖产生的巨大网络开销;采用分布式缓存技术充分利用处理节点的内存,优化网络传输,减少查询代价。 The algorithm of the MapReduce distributed programming technology and the connection algorithm realizing data queries on the platform of big data were analyzed. Additionally, aiming at the star schema benchmark (SSB) data models, the distributed cache multi-table star scheme join optimization technology was proposed. The predicate vector technology was used to convert the reliance of data in the middle dimension table to the bitmap index filter to reduce the huge network overhead caused by the data reliance. The distributed caching technology was used to process the nodes memory, which optimized the network transmission and reduced the query cost.
出处 《计算机工程与设计》 CSCD 北大核心 2014年第9期3107-3112,共6页 Computer Engineering and Design
基金 2012年黑龙江省科技攻关基金项目(GC12A307)
关键词 并行框架 星型模式 分布式缓存 位图索引 连接 parallel framework star schema distributed cache bitmap index join
  • 相关文献

参考文献10

  • 1黄山,王波涛,王国仁,于戈,李佳佳.MapReduce优化技术综述[J].计算机科学与探索,2013,7(10):865-885. 被引量:30
  • 2Yang H,Dasdan A,Hsiao RL,et al.Map-reduce-merge:Simplified relational data processing on large clusters[C]//Proceedings of the ACM SIGMOD International Conference on Management of data.ACM,2007:1029-1040.
  • 3O' Neil P,O' Neil E,Chen X.Star schema benchmark-revision 3[R/OL].USA:University of Massachusetts Boston.http://www.cs.umbo edu/-poneil/StarSchemaB.PDF,2009.
  • 4Vernica R,Carey MJ,Li Chen.Efficient parallel set-similarity joins using MapReduce[C]//Proceedings of the ACM SIGMOD International Conference on Management of Data.NY,USA:ACM,2010:495-506.
  • 5孙大烈,李建中.基于MapReduce的Skyline-join查询算法[J].哈尔滨工业大学学报,2012,44(1):103-106. 被引量:6
  • 6赵保学,李战怀,陈群,潘巍,姜涛,金健.基于共享的MapReduce多查询优化技术[J].计算机应用研究,2013,30(5):1405-1409. 被引量:7
  • 7Blanas S,Patel JM,Ercegovac V,et al.A comparison of join algorithms for log processing in MapReduce[C]//Proceedings of the ACM SIGMOD International Conference on Management of data.ACM,2010:975-986.
  • 8Okcan A,Riedewald M.Processing theta-joins using MapReduce[C]//Proceedings of the ACM SIGMOD International Conference on Management of Data.NY,USA:ACM,2011:949-960.
  • 9Afrati FN,Ullman JD.Optimizing multiway joins in a MapReduce environment[J].IEEE Transactions on Knowledge and Data Engineering,2011,23 (9):1282-1298.
  • 10张延松,焦敏,王占伟,王珊,周烜.海量数据分析的One-size-fits-all OLAP技术[J].计算机学报,2011,34(10):1936-1946. 被引量:31

二级参考文献121

  • 1O'Neil Patrick E, O'Neil Elizabeth J, Chen Xue-Dong, Revilak Stephen. The star schema benchmark and augmented fact table indexing//Proceedings of the TPCTC. Lyon, France, 2009:237 -252.
  • 2Han Wook-Shin, Ng Jack, Markl Volker, Kache Holger, Kandil Mokhtar. Progressive optimization in a shared-nothing parallel database//Proeeedings of the SIGMOD. Beijing, China, 2007:809 820.
  • 3Lima Alexandre A B, Furtado Camille, Valduriez Patrick, Mattoso Marta. Parallel OLAP query processing in database clusters with data replication. Distributed and Parallel Databases, 2009, 25(1-2): 97-123.
  • 4Furtado Pedro: Model and procedure for performance and availability wise parallel warehouses. Distributed and Parallel Databases, 2009, 25(1-2): 71- 96.
  • 5Yang Christopher, Yen Christine, Tan Ceryen, Madden Samuel. Osprey: Implementing MapReduce-style fault toler ance in a shared nothing distributed database//Proceedings of the ICDE. Long Beach, California, USA, 2010:657-668.
  • 6Chen Songting. Cheetah: A high performance, custom data warehouse on top of MapReduce//Proceedings of the VLDB. Singapore, 2010, 3(2): 1459-1468.
  • 7SAP NetWeaver: A Complete Platform for Large-Scale Busi ness Intelligence. Winter Corporation White Paper. May, 2005.
  • 8The Vertica Analytic Database: Rethinking Data Warehouse Architecture. Winter Corporation White Paper. May, 2005.
  • 9MacNicol R, French B. Syhase IQ muhiplex designed for an alytics//Proceedings of the VLDB. Toronto, Canada, 2004: 1227-1230.
  • 10Stonebraker Michael, Abadi Daniel J, Batkin Adam, Chen Xuedong et al. C Store: A column-oriented DBMS//Proceed ings of VLDB. Trondheim, Norway, 2005:553 -564.

共引文献69

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部