期刊文献+

The Generic Annular Bucket Histogram for Estimating the Selectivity of Spatial Selection and Spatial Join

The Generic Annular Bucket Histogram for Estimating the Selectivity of Spatial Selection and Spatial Join
原文传递
导出
摘要 Selectivity estimation is crucial for query optimizers choosing an optimal spatial execution plan in a spatial database management system.This paper presents an Annular Bucket spatial histogram(AB histogram)that can estimate the selectivity in finer spatial selection and spatial join operations even when the spatial query has more operators or more joins.The AB histogram is represented as a set of bucket-range,bucket-count value pairs.The bucket-range often covers an annular region like a sin-gle-cell-sized photo frame.The bucket-count is the number of objects whose Minimum Bounding Rectangles(MBRs)fall between outer rectangle and inner rectangle of the bucket-range.Assuming that all MBRs in each a bucket distribute evenly,for every buck-et,we can obtain serial probabilities that satisfy a certain spatial selection or join conditions from the operations' semantics and the spatial relations between every bucket-range and query ranges.Thus,according to some probability theories,spatial selection or join selectivity can be estimated by the every bucket-count and its probabilities.This paper also shows a way to generate an updated AB histogram from an original AB histogram and those probabilities.Our tests show that the AB histogram not only supports the selectivity estimation of spatial selection or spatial join with "disjoint","intersect","within","contains",and "overlap" operators but also provides an approach to generate a reliable updated histogram whose spatial distribution is close to the distribution of ac-tual query result. Selectivity estimation is crucial for query optimizers choosing an optimal spatial execution plan in a spatial database management system. This paper presents an Annular Bucket spatial histogram (AB histogram) that can estimate the selectivity in finer spatial selection and spatial join operations even when the spatial query has more operators or more joins. The AB histogram is represented as a set of bucket-range, bucket-count value pairs. The bucket-range often covers an annular region like a single-cell-sized photo frame. The bucket-count is the number of objects whose Minimum Bounding Rectangles (MBRs) fall between outer rectangle and inner rectangle of the bucket-range. Assuming that all MBRs in each a bucket distribute evenly, for every bucket, we can obtain serial probabilities that satisfy a certain spatial selection or join conditions from the operations' semantics and the spatial relations between every bucket-range and query ranges. Thus, according to some probability theories, spatial selection or join selectivity can be estimated by the every bucket-count and its probabilities. This paper also shows a way to generate an updated AB histogram from an original AB histogram and those probabilities. Our tests show that the AB histogram not only supports the selectivity estimation of spatial selection or spatial join with "disjoint", "intersect", "within", "contains", and "overlap" operators but also provides an approach to generate a reliable updated histogram whose spatial distribution is close to the distribution of ac- tual query result.
出处 《Geo-Spatial Information Science》 2011年第4期262-273,共12页 地球空间信息科学学报(英文)
基金 Supported by the Innovation Project of IGSNRR (No. O9V90220ZZ) the Research Plan of LREIS (O88RA700KA),CAS
关键词 selectivity estimation AB histogram annular bucket spatial selection spatial join 概率直方图 选择性估计 环形区域 和空间 空间数据库管理系统 最小边界矩形 选择性估算 通用
  • 相关文献

参考文献22

  • 1Jiang Songtao, Lee B S, Zhen He (2007) Cost modeling of spatial operators using non-parametric regression [J].Information Sciences, 177: 607-631.
  • 2Bjorke J T, Nilsen S (2003) Wavelets applied to simplifi- cation of digital terrain models [J]. International Journal of Geographical Information Seience, 17:601-621.
  • 3Yang Bisheng, Purves R S, Weibel R (2008) Variable- resolution compression of vector data [J]. Geoinformatic, 12:357-376.
  • 4Van Oosterom P, Schenkelaar V (1995) The development of an interactive multiscale GIS [J]. International Jour- nal of Geographical Information Systems, 9:489-507.
  • 5Prasher S, Zhou Xiaofang (2003) Efficient update and retrieval of objects in a multiresolution geospatial data- base [C]. Proceedings of SSDBM 2003, Cambridge, MA, USA.
  • 6Cheng Changxiu, Niu Fangqu, Cai Jtm, et al. (2008) Ex- tensions of GAP-tree and its implementation based on a non-topological data model [J]. International Journal of Geographical Information Science, 22, 657-673.
  • 7Buttenfield B (1999) Progressive transmission of vector data on the Internet: A cartographic solution [C]. The 18th International Cartographic Association Conference, Stockholm, Sweden.
  • 8Bertolotto M, Egenhofer M J (2001) Progressive trans- mission of vector map data over the World Wide Web [J]. Geolnformatica, 5(4): 345-373.
  • 9Cheng Changxiu, Lu Feng, Cai Jun (2009) A quantitative scale-setting approach for building multi-scale spatial databases [J]. Computer & GeoScience, 35:2004-2209.
  • 10Aboulnaga A, Chaudhuri S (1999) Self-tuning histo- grams: building histograms without looking at data [C]. 1999 ACM SIGMOD, Philadelphia, PA, USA.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部