数据流上连续动态skyline查询研究被引量：11

Continuous Dynamic Skyline Queries over Data Stream

下载PDF

导出

摘要 skyline查询能够从大规模数据集上计算满足多个标准的最优点.数据流上的skyline计算是数据流上最基本的查询操作之一,对于很多在线应用具有非常重要的意义,尤其在移动计算环境、网络监控、通信网络以及传感器网络等领域.不同于大部分传统的skyline研究,主要研究数据流上约束skyline和动态skyline计算问题.采用网格索引存储元组,提出了GBDS算法用于计算和维护动态skyline.通过为每个查询定义影响区域,使得在元组到达和失效时需要处理的元组个数最小化.理论分析和实验结果证明了提出方法的有效性. Skyline queries are capable of retrieving interesting points from a large data set according to multiple criteria.As an essential query,skyline computation over data stream is very important for many online applications,including mobile environment,network monitoring,communication,sensor network and stock market trading,etc.The problem of skyline computation has attracted considerable research attention.Different from most popular skyline processing methods,this paper focuses on constrained skyline and dynamic skyline processing over data stream.Instead of computing the skyline results on the whole data set,this kind of skyline query only needs to process parts of the data set,and there are maybe thousands of such queries in the system.To deal with the challenges of the random additions and deletions of the tuples over data stream,we employ a grid based index to store the tuples and put forward an algorithm to compute and maintain skyline set based on it.By making use of the advantage of grid index,we define influence area for every query to minimize the cells need to be processed when new tuples arrive and old tuples expire.Only tuples in the cells that belong to influence area will be processed.This way,the tuples which are not in the influence area will be ignored and the CPU time is saved.Theoretical analysis and experimental evidences show the efficiency of the proposed approaches.

作者张丽邹鹏贾焰田李

机构地区国防科学技术大学计算机学院

出处《计算机研究与发展》 EI CSCD 北大核心 2011年第1期77-85,共9页 Journal of Computer Research and Development

基金国家"八六三"高技术研究发展计划基金项目(2006AA01Z451 2007AA010502 2007AA01Z474)

关键词数据流滑动窗口约束skyline 动态skyline 网格索引 data stream sliding window constrained skyline dynamic skyline grid-based index

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献19

1Cui Bin, Lu Hua, Xu Quanqing, et al. Parallel distributed processing of constrained skyline queries by filtering[C] // Proc of the 24th Int Conf on Data Engineering. Los Alamitos, CA: IEEE Computer Society, 2008:546-555.
2Borzsonyi S, Kossmann D, Stocker K. The skyline operator [C]//Proc of the 17th Int Conf on Data Engineering. Los Alamitos, CA: IEEE Computer Society, 2001:421-430.
3Barndorff-Nielsen O, Sobel M. On the distribution of the number of admissible points in a vector random sample [J]. Theory of Probability and Its Application, 1966, 11 (2) : 249-269.
4Dellis E, Seeger B. Efficient computation of reverse skyline queries [C]//Proc of the 33rd Int Conf on Very Large Data Bases. New York: ACM, 2007:291-302.
5Papadias D, Tao Yufei, Fu Greg, et al. Progressive skyline computation in database systems [J]. ACM Trans on Database Systems, 2005, 30(1):41-82.
6Bentley J L, Kung H T, Schkolnick M, et al. On the average number of maxima in a set of vectors and applications [J]. JournaloftheACM, 1978, 25(4):536-543.
7Klan-Lee T, Pin-Kwang E, Ooi B C. Efficient progressive skyline computation [C]//Proc of the 27th Int Conf on Very Large Data Bases. San Francisco: Morgan Kaufmann, 2001: 301-310.
8Kossmann D, Ramsak F, Rost S. Shooting stars in the sky: An online algorithm for skyline queries [C]//Proc of the 28th Int Conf on Very Large Data Bases. San Francisco: Morgan Kaufmann, 2002: 275-286.
9Doulkeridis C, Kotidis Y, subspaee skyline computation et al. SKYPEER: efficient over distributed data [C] // Proc of 23th Int Conf on Data Engineering. Los Alamitos CA: IEEE Computer Society, 2007:416-425.
10Wang Shiyuan, Ooi B C, Tung A K H, et al. Efficient skyline query processing on peevto-peer networks [C] //Proc of the 23rd Int Conf on Data Engineering. Los Alamitos, CA: IEEE Computer Society, 2007:1126-1135.

二级参考文献35

1刘欣,余靖,刘国华.基于窗口查询的轮廓查询算法[J].燕山大学学报,2005,29(5):398-402. 被引量：9
2邓波,贾焰,杨树强.一种高效的分布式Skyline查询算法[J].计算机工程与科学,2007,29(9):97-100. 被引量：4
3Borzsonyi S, Kossmann D, Stocker K. The skyline opera tor//Proceedings of the International Conference on Data Engineering (ICDE). Heidelberg, Germany, 2001:421-430.
4Chomicki J, Godfrey P, Gryz J, Liang D. Skyline with presorting//Proeeedings of the International Conference on Data Engineering (ICDE). Bangalore, India, 2003:717-719.
5Tan K L, Eng P K, Ooi B C. Efficient progressive skyline computation//Proceedings of the International Conference on Very Large Data Bases (VLDB). Roma, Italy, 2001: 301- 310.
6Kossmann D, Ramsak F, Rost S. Shooting stars in the sky: An online algorithm for skyline queries//Proceedings of the International Conference on Very Large Data Bases (VLDB). Hong Kong, China, 2002:275-286.
7Papadias D, Tao Y. Progressive skyline computation in data- base systems. ACM Transactions on Database Systems (TODS), 2005, 30(1): 41-82.
8高云君,陈根才,陈岭,陈纯.一种存储最佳的分支界限轮廓查询算法//全国数据库学术会议(NDBC).广州,2006:526-533.
9Balke W T, Gtitzer U, Zheng J X. Efficient distributed skylining for web information systems//Proceedings of the International Conference on Extending Database Technology (EDBT). Heraklio,Greece, 2004:256- 273.
10Wu P, Zhang C, Feng Y. Skyline queries for scalable distri bution//Proceedings of the International Conference on Ex tending Database Technology (EDBT). Munich, Germany 2006:112- 130.

共引文献8

1甘亮,金鑫,贾焰,李爱平,盘仰柯.GDG:一种基于逆支配点集的top-k高效查询索引方法[J].计算机研究与发展,2010,47(10):1771-1784. 被引量：4
2张丽,邹鹏,贾焰.基于网格的数据流连续约束Skyline处理技术研究[J].计算机工程与科学,2011,33(8):173-180.
3杨永滔,王意洁.n-of-N数据流模型上高效概率Skyline计算[J].软件学报,2012,23(3):550-564. 被引量：3
4甘亮,于莉莉,李润恒,贾焰,金鑫.一种基于逆支配点集的数据流Top-k计算方法[J].计算机工程与科学,2012,34(6):59-64.
5李建伟,王康平,黄岚,王贵参.MapReduce模型下基于R树索引的Skyline查询算法[J].吉林大学学报（理学版）,2016,54(4):833-838.
6雷向东,黄荣敏,雷振阳,袁晓莉.Chord网络中的Skyline计算[J].小型微型计算机系统,2017,38(1):77-82.
7唐颖峰,陈世平.利用k-d树索引改进数据流skyline查询算法[J].小型微型计算机系统,2018,39(3):544-550. 被引量：5
8李松,王冠群,郝晓红,郝忠孝.面向推荐系统的多目标决策优化算法[J].西安交通大学学报,2022,56(8):104-112. 被引量：7

同被引文献127

1刘殷雷,刘玉葆,陈程.不确定性数据流上频繁项集挖掘的有效算法[J].计算机研究与发展,2011,48(S3):1-7. 被引量：14
2赵越,王意洁,王媛,李小勇.一种高效的不确定数据流并行Skyline查询处理方法[J].计算机研究与发展,2013,50(S2):132-139. 被引量：3
3谢志军,王雷,林亚平,陈红,刘永和.传感器网络中基于数据压缩的汇聚算法[J].软件学报,2006,17(4):860-867. 被引量：32
4谢洁锐,胡月明,刘才兴,刘兰.无线传感器网络的时间同步技术[J].计算机工程与设计,2007,28(1):76-77. 被引量：9
5孙圣力,黄震华,李金玖,郭建奎,朱扬勇.数据流上高效计算子空间Skyline的算法[J].计算机学报,2007,30(8):1418-1428. 被引量：9
6Borzsonyi S, Kossmann D, Stoeker K. The Skyline operator[C]# Proceedings of the 17th International Conference on Data Engineering. Washington, DC, USA: IEEE Computer Society, 2001: 421-430.
7Tan K L, Eng P K, Ooi B C. Efficient progressive Skyline computation[C]//Proceedings of the 27th International Con- ference on Very Large Data Bases (VLDB '01), Roma, Italy, 2001. San Francisco, CA, USA: Morgan Kaufmarm Pub- lishers Inc, 2001: 301-310.
8Kossmann D, Ramsak F, Rost S. Shooting stars in the sky: an online algorithm for Skyline queries[C]/lProceedings of the 28th International Conference on Very Large Data Bases (VLDB '02), Hong Kong, China, 2002: 275-286.
9Papadias D, Tao Yufei. Progressive Skyline computation in database systems[J]. ACM Transactions on Database Systems (TODS), 2005, 30(1) : 41-82.
10Huang Zhiyong, Lu Hua, Ooi B C, et al. Continuous Skyline queries for moving objects[J]. IEEE Transactions on Knowledge and Data Engineering (TKDE), 2006, 18(12): 1645-1658.

引证文献11

1曹金凤,董一鸿,王勇,钱江波,钟才明.不确定移动对象概率Skyline集的查询更新[J].计算机科学与探索,2012,6(5):443-455. 被引量：1
2黄伯虎,张海宾,王小兵,刘旭东.移动环境中的位置依赖连续轮廓查询[J].西安交通大学学报,2012,46(6):79-86. 被引量：2
3丘晓平,黄小兵.非确定性数据处理技术发展现状与挑战[J].现代计算机,2012,18(18):9-14.
4谢志军,金光,钱江波,唐建华.传感器网络中基于两级过滤的Skyline查询处理[J].系统仿真学报,2013,25(11):2611-2617.
5谢志军,唐建华,杨婧,金光.无线传感器网络中基于连通核的高效Skyline查询算法[J].传感技术学报,2013,26(10):1437-1445. 被引量：1
6李媛媛,曲雯毓,栗志扬,季长清,吴俊峰.基于时间序列的Global Skyline并行算法[J].系统工程与电子技术,2016,38(1):215-222.
7白梅,信俊昌,王国仁,王习特.数据流上动态轮廓查询处理技术的研究[J].计算机学报,2016,39(10):2007-2030. 被引量：8
8唐颖峰,陈世平.利用k-d树索引改进数据流skyline查询算法[J].小型微型计算机系统,2018,39(3):544-550. 被引量：5
9Li Song,Zhang Liping,Li Shuang,Hao Xiaohong.Spatial skyline query method based on Hilbert R-tree in multi-dimensional space[J].High Technology Letters,2019,25(3):262-270.
10霸建民,郭永红,彭龙,赵东阳,邵鹏志,杜宏博.基于ρ-支配轮廓及n-of-Nρ-支配轮廓的数据流中关键数据计算方法[J].兵工学报,2021,42(5):1004-1015. 被引量：2

二级引证文献18

1丁维龙,韩燕波,王菁,赵卓峰.时间滑动窗口上数据流极值聚集的空间优化[J].西安交通大学学报,2012,46(11):106-111. 被引量：1
2汤志俊,樊明锁,何贤芒,陈华辉,董一鸿.位置不确定移动对象的连续概率反Skyline查询[J].计算机科学,2013,40(7):147-152.
3邓泽,刘汪洋,陈丹.一种面向动态连续查询的查询索引[J].计算机应用与软件,2015,32(12):8-11. 被引量：5
4张蓬郁,王煜,江旻宇,邵嘉琳,张洪滨.基于K-D树和机器学习的时空数据检索-预测系统[J].软件,2018,39(8):215-218. 被引量：4
5林志贵,安旭磊,刘英平,李敏,杨子原.基于事件优先级的蛇形时隙存储算法[J].传感技术学报,2015,28(10):1531-1536. 被引量：2
6钟毓灵,王习特,白梅,朱斌,李冠宇.FODU:不确定数据集中快速离群点检测方法[J].计算机工程与应用,2019,55(19):105-114. 被引量：1
7李征宇,李贵,曹科研.针对隐藏Web数据库的Skyline查询方法研究[J].计算机科学与探索,2020,14(8):1307-1314. 被引量：3
8白梅,王京徽,王习特,朱斌,李冠宇.PSP:一种高效的偏序域上skyline查询处理方法[J].湖南大学学报（自然科学版）,2020,47(8):9-20. 被引量：2
9白梅,王习特,李冠宇,宁博,周新.基于最大覆盖的代表Skyline问题的优化算法研究[J].计算机学报,2020,43(12):2276-2297. 被引量：2
10Zhiyun ZHENG,Ke RUAN,Mengyao YU,Xingjin ZHANG,Ning WANG,Dun LI.k-dominant Skyline query algorithm for dynamic datasets[J].Frontiers of Computer Science,2021,15(1):213-221.

1张丽,邹鹏,贾焰.基于网格的数据流连续约束Skyline处理技术研究[J].计算机工程与科学,2011,33(8):173-180.
2邹先霞,贾维嘉,潘久辉.连接查询的分片传输算法[J].计算机工程与应用,2009,45(35):10-13.
3陶金花,文必龙,张敬波,高俊涛.一种基于元模型的关系数据库的查询方法[J].大庆石油学院学报,2004,28(2):69-71. 被引量：6
4张以文,吴金涛,郭星,赵姝.一种基于动态Skyline和遗传粒子群优化的云服务组合方法[J].小型微型计算机系统,2016,37(11):2552-2557. 被引量：5
5为提高搜索引擎效率百度用闪存替代硬盘[J].互联网天地,2008(9):3-3.
6李媛媛,曲雯毓,栗志扬,季长清,吴俊峰.基于时间序列的Global Skyline并行算法[J].系统工程与电子技术,2016,38(1):215-222.
7杨萍萍,赵雷.分布式数据的反Skyline查询算法[J].小型微型计算机系统,2014,35(2):255-260.
8尹浩,张长胜,张斌.求解服务选取问题的混合蚁群优化算法[J].东北大学学报（自然科学版）,2013,34(7):931-934. 被引量：1
9张彬,蒋涛,乐光学,李国徽.一种最优的相互skyline查询算法[J].华中科技大学学报（自然科学版）,2010,38(8):53-56. 被引量：2
10刘春晓,马君,孟祥福.基于隶属度的数据库模糊结果排序方法[J].辽宁工业大学学报（自然科学版）,2011,31(5):295-297. 被引量：1

计算机研究与发展

2011年第1期

浏览历史

内容加载中请稍等...

数据流上连续动态skyline查询研究被引量：11

参考文献19

二级参考文献35

共引文献8

同被引文献127

引证文献11

二级引证文献18

相关作者

相关机构

相关主题

浏览历史

数据流上连续动态skyline查询研究 被引量：11

参考文献19

二级参考文献35

共引文献8

同被引文献127

引证文献11

二级引证文献18

相关作者

相关机构

相关主题

浏览历史

数据流上连续动态skyline查询研究被引量：11