期刊文献+

一种基于压缩策略的高维空间子空间skyline查询算法 被引量:1

A Compression Method Based Algorithm for Subspace Skyline Computation in High Dimensional Space
下载PDF
导出
摘要 skyline操作就是找出数据集中不被其他数据点支配的点的集合,但是随着数据属性维度的不断增多,通常人们只对数据集的某几个属性感兴趣,高维空间子空间skyline计算就是发现数据集中在某几个特定维度上不被其他点支配的点的集合,skyline计算在数据量大时其时间花销是非常大的,快速的返回结果才是人们能接受的.基于此提出了一个RSky算法,在原有CSky算法的基础上,指出并改进了其存在的3处明显不足,并根据InvertS索引的特性提出了一个压缩扫描策略,通过设置每个维度的下限来控制要处理的桶,除去不必要处理的桶和不可能是skyline的点,从而减少了点与点之间的比较次数.实验结果表明了RSky算法的有效性. The skyline computaion is to find the points that are not dominated by any other points in the data set.However,with the growing number of dimensions of the data attributes,usually people are only interested in a few attributes of the data set.The subspace skyline computation for high dimensional space is to find a set of points that are not dominated by any other points on a few specific dimensions in the data set.When deal with the large amount of data,the skyline computation can consume a lot of time,but return result fast is which users can accept.In this paper,we proposed an improved algorithm called RSky which is based on the existing CSky to address the problem of skyline computation's high time complexity.In particular,we pointed out three shortcomings of CSky and proposed a compression method to reduce the points to be processed based on the characters of the InvertS index.The number of comparisons between point and point can be reduced by the lower bounds of each dimension.With the lower bounds,RSky can also remove the unnecessary buckets and the points which are impossible to be skyline points.The experimental results have show the efficiency of the proposed RSky algorithm.
出处 《计算机研究与发展》 EI CSCD 北大核心 2013年第S1期101-108,共8页 Journal of Computer Research and Development
基金 国家自然科学基金项目(61070005) 中央高校基本科研业务费项目(111gpy63) 广东省科技计划项目(2010B080701062)
关键词 子空间skyline查询 InvertS索引 下限 压缩扫描法 渐进 subspace skyline computation InvertS index lower limit reduced scanning method progressive
  • 相关文献

参考文献5

  • 1Ken C. K. Lee,Wang-Chien Lee,Baihua Zheng,Huajing Li,Yuan Tian.Z-SKY: an efficient skyline query processing framework based on Z-order[J]. The VLDB Journal . 2010 (3)
  • 2周红福,宫学庆,郑凯,周傲英.基于高维空间的在线高效子空间Skyline算法——CSky[J].计算机学报,2007,30(8):1409-1417. 被引量:8
  • 3Christopher Raphael,Guy Shani.The Skyline algorithm for POMDP value function pruning[J]. Annals of Mathematics and Artificial Intelligence . 2012 (1)
  • 4Tian Xia,Donghui Zhang,Zheng Fang,Cindy Chen,Jie Wang.Online subspace skyline query processing using the compressed skycube[J]. ACM Transactions on Database Systems (TODS) . 2012 (2)
  • 5印鉴,姚树宇,薛少锷,杨文新,刘玉葆.一种基于索引的高效k-支配Skyline算法[J].计算机学报,2010,33(7):1236-1245. 被引量:14

二级参考文献28

  • 1Stephan B(o)rzs(o)nyi,Donald Kossmann,Konrad Stocker.The skyline Operator//Proceedings of the 17th International Conference on Data Engineering.Heidelberg,Germany,2001:421-430.
  • 2Chan Chee-Yong,Jagadish H V,Tan Kian-Lee,Tung Anthony K H,Zhang Zhen-Jie.Finding k-dominant Skylines in high dimensional space//Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data.Chicago,Illinois,USA,2006:503-514.
  • 3Ronald Fagin,Amnon Lotem,Moni Naor.Optimal Aggregation Algorithms for Middleware.Journal of Computer and System Sciences,2001,66(4):614-656.
  • 4Jan Chomicki,Parke Godfrey,Jarek Gryz,Dongming Liang.Skyline with Presorting//Proceedings of the 19th International Conference on Data Engineering.Bangalore,India,2003:717-816.
  • 5Tan Kian-Lee,Eng Pin-Kwang,Ooi Beng Chin.Efficient progressive skyline computation//Proceedings of the 27th International Conference on Very Large Data Bases.Roma,Italy,2001:301-310.
  • 6Donald Kossmann,Frank Ramsak,Steffen Rost.Shooting stars in the sky:An online algorithm for skyline queries//Proceedings of the 28th International Conference on Very Large Data Bases.Hong Kong,China,2002:275-286.
  • 7Papadias Dimitris,Tao Yufei,Fu Greg,Seeger Bernhard.An optimal and progressive algorithm for skyline queries//Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data.San Diego,California,USA,2003:467-478.
  • 8Yiu Man Lung,Mamoulis Nikos.Efficient processing of topk dominating queries on MultiDimensional data//Proceedings of the 33rd International Conference on Very Large Data Bases.Vienna,Austria,2007:483-494.
  • 9Chan Chee-Yong,Jagadish H V,Tung Anthony K H,Zhang Zhen-Jie.On high dimensional Skylines//Proceedings of the 10th International Conference on Extending Database Technology.Munich,Germany,2006:478-495.
  • 10Borzsonyi S,Kossmann D,Stocker K.The skyline operator//Proceedings of the 17th International Conference on Data Engineering.Heidelberg,Germany,2001:421-430

共引文献20

同被引文献4

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部