数据仓库中一种有效的高维联机分析处理方法

A improved parallel processing approach for high-dimensional OLAP in data warehouse

下载PDF

导出

摘要在大量数据仓库系统中,对于一个d维的data cube,数据立方体(cube)可以生成2d个聚集cuboids,然而随着数据仓库维数的增长,计算这些预聚集数据已经成为一个瓶颈.在minimal cubing方法的基础上,提出一种具体层次语义特性的多维层次数据立方体——前缀索引立方体(prefix-index cubing)技术,将高维cube划分成若干个低维立方体cube,以实现高维cube的分布式存储和并行计算.理论分析与实验结果表明,相对于以往的minimal cubing等方法,前缀索引立方体方法的性能显著提高. In many data warehouses, it can generate 2d cuboids for the cube with d dimensions. However, as the size of data warehouses grows, the time it takes to perform this pre-computation becomes a significant performance bottleneck. In this paper, a multi-dimensional hierarchical cubing approach, called prefix-index cubing approach, is proposed based on an extension of the previous minimal cubing approach. This method partitions the high dimensional data cube into low dimensional cube segments. The proposed data allocation and processing model support distributed storage and parallel processing, as well as load balancing for disks and processors. The analytical and experimental results show that the proposed method is significantly more efficient than other existing cubing methods such as minimal cubing approach.

作者徐建平胡孔法

机构地区扬州大学信息工程学院

出处《扬州大学学报（自然科学版）》 CAS CSCD 2008年第1期46-50,共5页 Journal of Yangzhou University：Natural Science Edition

基金国家自然科学基金资助项目(60773103) 江苏省“青蓝工程”基金资助项目

关键词联机分析处理高维数据立方体前缀索引立方体维层次编码 online analytical processing （OLAP） dimension hierarchical encoding high-dimensional cube prefix-index cubing approach

分类号 TP311.13 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献8

1HAN J, PEI J, DONG G, et al. Efficient computation of iceberg cubes with complex measures[C]//AREF W G. Proc of SIGMOD'01. New York: ACM Press, 2001: 1-12.
2LAKSHMANAN L V S, PEI J, HAN J. Quotient cubes: how to summarize the semantics of a data cube[C]// BRESSAN S, CHAUDHRI A B, LEE M L. Proc of VLDB'02. San Fransisco:Morgan Kaufmann, 2002: 778- 789.
3XIN D, HAN J, LI X, et al. Star-cubing: computing iceberg cubes by top-down and bottom-up integration [C]// FREYTAG J C, LOCKEMANN P C, ABITEBOUL S, et al. Proe of VLDB' 03. San Fransiseo: Morgan Kaufmann, 2003: 476-487.
4LAKSHMANAN L V S, PEI J, ZHAO Y. QC-trees: an efficient summary structure for ,semantic OLAP [C]// HALEVY A Y, IVES Z G, DOAN A. Proc of SIGMOD'03. New York: ACM Press, 2003:64-75.
5SISMANIS Y, DELIGIANNAKIS A, KOTIDIS Y, et al. Hierarchical dwarfs for the rollup cube [C]//SONG I, DAVIS K C. Proe of DOLAP'03. New York: ACM Press, 2004: 540-551.
6XIN D, HAN J, SHAO Z, et al. C-cubing: efficient computation of closed cubes by aggregation-based checking [C]//LIU L, REUTER A, WHANG K Y, et al. Proc of 2006 Int Conf on Data Engineering (ICDE'06). Los Alamitos: IEEE Computer Society Press, 2006: 4.
7LI X, HAN J, GONZALEZ H. High-dimensional OLAP: a minimal cubing approach [C]//NASCIMENTO M A, OZSU M T, KOSSMANN D, et al. Proe of VLDB'04. San Fransisco: Morgan Kaufmann, 2004: 528-539.
8胡孔法,陈崚,顾颀,蔡俊杰,董逸生.数据仓库系统中一种高效的多维层次聚集算法[J].计算机集成制造系统,2007,13(1):196-201. 被引量：4

二级参考文献9

1WU M C.Query optimization for selections using bitmaps[C]//Proceedings of the ACM SIGMOD International Conference on Management of Data.New York,N.Y.,USA:ACM Press,1999:227-238.
2WU K,OTOO E J,SHOSHANI A.A performance comparison of bitmap indexes[C]//Proceedings of the 10th International Conference on CIKM.New York,N.Y.,USA:ACM Pess,2001:559-561.
3MISTRY H,ROY P,SUDARSHAN S.Materialized view selection and maintenance using multi-query optimization[C]//Proceedings of the ACM SIGMOD 2001.New York,N.Y.,USA:ACM Press,2001:307-318.
4MARKL V,RAMSAK F,BAYERN R.Improving OLAP performance by multidimensional hierarchical Clustering[C]//Proceedings of the International Conference on IDEAS 1999.Los Alamitos,Cal.,USA:IEEE Computer Society Press,1999:165-177.
5THEODORATOS D,TSOIS A.Heuristic optimization of OLAP queries in multidimensionally hierarchically clustered databases[C]//Proceedings of the 4th ACM International Workshop on DOLAP.New York,N.Y.,USA:ACM Press,2001:48-55.
6KARAYANNIDIS N,TSOIS A,SELLIS T,et al.Processing star queries on hierarchically-clustered fact tables[C]//Proceedings of the 28th International Conference on VLDB.San Fransisco,Cal.,USA:Morgan Kaufmann,2002:730-741.
7CHEN Y,DEHNE F,EAVIS T.Parallel ROLAP data cube construction on shared-nothing multiprocessors[J].Distributed and Paralled Databases,2004,15(3):219-236.
8LI X,HAN J,GONZALEZ H.High-dimensional OLAP:a minimal cubing approach[C]//Proceedings of the 30th International Conference on VLDB.San Fransisco,Cal.,USA:Morgan Kaufmann,2004:528-539.
9胡孔法,董逸生,徐立臻,杨科华.一种基于维层次编码的OLAP聚集查询算法[J].计算机研究与发展,2004,41(4):608-614. 被引量：14

共引文献3

1宋旭东,胡墨谦,刘晓冰.基于模型驱动体系架构的OLAP开发方法[J].计算机集成制造系统,2010,16(2):423-430. 被引量：4
2朱凯,万定生,程习锋.水利普查成果分析中数据立方体计算研究[J].计算机与数字工程,2014,42(9):1591-1594. 被引量：3
3唐珊珊,朱跃龙,朱凯.基于Map/Reduce的外壳片段立方体并行计算方法[J].计算机工程与应用,2015,51(22):124-129. 被引量：4

1胡孔法,陈崚,李斌,陈宏建,董逸生.数据仓库系统中高维联机分析处理聚集数据存储技术研究[J].计算机集成制造系统,2006,12(7):1095-1101. 被引量：5
2荀亚玲,张素兰.基于位图连接索引的复杂多维层次的连接和聚集算法[J].通讯和计算机（中英文版）,2007,4(2):10-14. 被引量：1
3杨科华,张伟.一种基于多层次链的Data Cube维层次编码[J].湖南大学学报（自然科学版）,2007,34(9):74-77.
4华镕.Factory Talk服务平台的产品(二)[J].自动化博览,2010,27(5):64-65. 被引量：1
5殷君伟,陈建明,薛百里,张健.一种基于排序划分的聚类初始化方法[J].微电子学与计算机,2013,30(6):80-83. 被引量：3
6徐红波,胡文,潘海为,高祥,刘润涛.高维空间范围查询并行算法研究[J].哈尔滨商业大学学报（自然科学版）,2013,29(1):73-75. 被引量：2
7胡孔法,董逸生,陈崚.数据仓库中一种基于维层次编码的位图索引方法[J].东南大学学报（自然科学版）,2005,35(2):171-177. 被引量：4
8陈骋,马晓岩,杨瑞娟,程伟.多功能电子系统电磁兼容任务调度方法[J].电子科技,2013,26(3):8-13. 被引量：3
9胡孔法,陈崚,赵茂先,达庆利,纪兆辉.DHMC:一种有效的高维Cube并行分布式存储结构[J].计算机研究与发展,2007,44(12):2098-2105. 被引量：4
10胡孔法,董逸生,徐立臻,杨科华.一种基于维层次编码的OLAP聚集查询算法[J].计算机研究与发展,2004,41(4):608-614. 被引量：14

扬州大学学报（自然科学版）

2008年第1期

浏览历史

内容加载中请稍等...

数据仓库中一种有效的高维联机分析处理方法

参考文献8

二级参考文献9

共引文献3

相关作者

相关机构

相关主题

浏览历史