期刊文献+

Efficient Incremental Maintenance for Distributive and Non-Distributive Aggregate Functions 被引量:2

Efficient Incremental Maintenance for Distributive and Non-Distributive Aggregate Functions
原文传递
导出
摘要 Data cube pre-computation is an important concept for supporting OLAP (Online Analytical Processing) and has been studied extensively. It is often not feasible to compute a complete data cube due to the huge storage requirement. Recently proposed quotient cube addressed this issue through a partitioning method that groups cube cells into equivalence partitions. Such an approach not only is useful for distributive aggregate functions such as SUM but also can be applied to the maintenance of holistic aggregate functions like MEDIAN which will require the storage of a set of tuples for each equivalence class. Unfortunately, as changes are made to the data sources, maintaining the quotient cube is non-trivial since the partitioning of the cube cells must also be updated. In this paper, the authors design incremental algorithms to update a quotient cube efficiently for both SUM and MEDIAN aggregate functions. For the aggregate function SUM, concepts are borrowed from the principle of Galois Lattice to develop CPU-efficient algorithms to update a quotient cube. For the aggregate function MEDIAN, the concept of a pseudo class is introduced to further reduce the size of the quotient cube, Coupled with a novel sliding window technique, an efficient algorithm is developed for maintaining a MEDIAN quotient cube that takes up reasonably small storage space. Performance study shows that the proposed algorithms are efficient and scalable over large databases. Data cube pre-computation is an important concept for supporting OLAP (Online Analytical Processing) and has been studied extensively. It is often not feasible to compute a complete data cube due to the huge storage requirement. Recently proposed quotient cube addressed this issue through a partitioning method that groups cube cells into equivalence partitions. Such an approach not only is useful for distributive aggregate functions such as SUM but also can be applied to the maintenance of holistic aggregate functions like MEDIAN which will require the storage of a set of tuples for each equivalence class. Unfortunately, as changes are made to the data sources, maintaining the quotient cube is non-trivial since the partitioning of the cube cells must also be updated. In this paper, the authors design incremental algorithms to update a quotient cube efficiently for both SUM and MEDIAN aggregate functions. For the aggregate function SUM, concepts are borrowed from the principle of Galois Lattice to develop CPU-efficient algorithms to update a quotient cube. For the aggregate function MEDIAN, the concept of a pseudo class is introduced to further reduce the size of the quotient cube, Coupled with a novel sliding window technique, an efficient algorithm is developed for maintaining a MEDIAN quotient cube that takes up reasonably small storage space. Performance study shows that the proposed algorithms are efficient and scalable over large databases.
作者 李翠平 王珊
机构地区 Information School
出处 《Journal of Computer Science & Technology》 SCIE EI CSCD 2006年第1期52-65,共14页 计算机科学技术学报(英文版)
基金 This paper is supported by the National Natural Science Foundation of China (Grant Nos. 60473069, 60496325, 60273071).
关键词 quotient cube incremental maintenance OLAP quotient cube, incremental maintenance, OLAP
  • 相关文献

参考文献20

  • 1Agarwal S, Agrawal R, Deshpande P M et al. On the computation of multidimensional aggregates. In Proc. 1996 Int.Conf. Very Large Data Bases (VLDB'96), Bombay, India,Sept. 1996, pp.506-521.
  • 2Barbara D. Quasi-cubes: Exploiting approximation in multidimensional databases, SIGMOD Record, 1997, 26: 12-17.
  • 3Beyer K, Ramakrishnan R. Bottom-up computation of sparse and iceberg cubes. In Proc, 1999 ACM-SIGMOD Int, Conf. Management of Data ( SIGMOD'99), Philadelphia, PA, June 1999, pp.359-370,.
  • 4Davey A, Priestley H A, Introduction to Lattices and Order.Cambridge University Press, 1990.
  • 5Ganter B, Wille R, Franzke C. Formal Concept Analysis: Mathematical Foundations, Sprlnger-Verlag, 1999.
  • 6Godin R, Missaoui R, Alaoui H. Incremental concept formation algorithms based on Galois lattices. Computational Intelligence, 1991, 11: 246-267.
  • 7Gray J, Chaudhuri S, Bosworth A et al. Data cube; A relational aggregation operator generalizing group-by, cross-tab and sub-totals. Data Mining and Knowledge Discovery, 1997,1: 29-54.
  • 8Harinarayan V, Rajaraman A, Ullman J D, Implementing data cubes efficiently. In Proc. 1996 ACM-SIGMOD Int. Conf. Management of Data (SIGMOD'96), Montreal,Canada, June 1996, pp.205-216.
  • 9Labio W J, Yang Ji Cui Y et al. Performance issues in incremental warehouse maintenance. In Proc. the 26th Int. Conference on Very Large Data Bases (VLDB'00), Cairo, Egypt,2000, pp.461-472.
  • 10Lakshmanan L V S, Pei J, Han J W. Quotient cube: How to summarize the semantics of a data cube. In Proc. 2002 Int. Conf. Very Large Data Bases (VLDB'02), Hong Kong,China, 2002, pp.227-238.

同被引文献7

  • 1Cui-PingLi,Kum-HoeTung,ShanWang.Incremental Maintenance of Quotient Cube Based on Galois Lattice[J].Journal of Computer Science & Technology,2004,19(3):302-308. 被引量:2
  • 2Laks V S Lakshmanan, Jian Pei, Jiawei Han. Quotient Cube: How to Summarize the Semantics of a Data Cube [C]. Proceedings of VLDB2002, 2002:778-789.
  • 3Ki Yong Lee, Myoung-Ho Kim. Efficient Incremental Maintenance of Data Cubes[C]. Proceedings of VLDB2006, 2006:823-833.
  • 4Ki Yong Lee, Yon Dohn Chung, Myoung-Ho Kim. An efficient method for maintaining data cubes incrementally [J].Information Sciences : an International Journal, 2010, 180 ( 6 ) :928-948.
  • 5Dong Jin, Tatsuo Tsuji, Takayuki Tsuchida,et al. An Incremental Maintenance Scheme of Data Cubes[J]. Lecture Notes in Computer Science, 2008, 4947 : 172-187.
  • 6Hahn C, et al. Edited synoptic cloud reports from ships and land stations over the globe [EB/OL]. http://cdiac, oral. gov/ftp/ ndp026b.
  • 7栾华,杜小勇,王珊.Prefetching J^+-Tree:A Cache-Optimized Main Memory Database Index Structure[J].Journal of Computer Science & Technology,2009,24(4):687-707. 被引量:3

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部