期刊文献+

基于Flink的k-支配skyline体并行求解算法 被引量:1

A k-dominant skyline body parallel solving algorithm based on Flink
下载PDF
导出
摘要 k-支配skyline算法弱化了数据点之间的支配关系,更适合高维数据。k-支配skyline体适应于多名用户使用k-支配skyline算法查询,而现有的求解算法在时间效率和代码扩展性方面都有待提高。因此,提出了面向多用户的k-支配skyline体求解优化算法MKSSOA,该算法对每名用户的候选集和中间集分别进行存储,同时在k-支配检查过程中利用2集合中数据点出现的先后次序将候选集中的非k-支配skyline点存储到对应用户的中间集中,以便下一名用户筛选使用,这样可以减少数据点之间的比较次数,避免重复计算,从而提升查询效率。同时,提出了面向多用户的k-支配skyline体并行求解算法MKSPSA,通过Apache Flink并行处理框架有效减少了数据点的比较时间。理论研究和实验结果显示,提出的算法具有较高的效率,能很好地处理多用户k-支配skyline问题。 The k-dominated skyline algorithm weakens the domination relationship between data points and is more suitable for high-dimensional data.k-dominated skyline bodies are suitable for multiple users to query with the k-dominated skyline algorithm,but the existing solution algorithms need to be improved in terms of time efficiency and code scalability.Therefore,this paper proposes an optimization algorithm for solving k-dominated skyline bodies.This algorithm stores the candidate set and the intermediate set for each user separately,and stores the non-k-dominated skyline points in the candidate set to the intermediate set of the corresponding user in the order of appearance of data points in the two sets during the k-domination checking process,so that the next user can filter and use them,which can reduce the number of comparisons between data points,avoids double counting,and improve query efficiency.A multi-user k-dominated skyline body parallel solving algorithm is also proposed,which effectively reduces the comparison time of data points through the Apache Flink parallel processing framework.The theoretical study and experimental data show that the proposed algorithm is highly efficient and can handle the multi-user k-dominated skyline problem well.
作者 孙国璋 黄山 艾力卡木·再比布拉 徐浩桐 段晓东 SUN Guo-zhang;HUANG Shan;ALKAM Zabibul;XU Hao-tong;DUAN Xiao-dong(College of Computer Science and Engineering,Dalian Minzu University,Dalian 116600;State Ethnic Affairs Commission Key Laboratory of Big Data Applied Technology(Dalian Minzu University),Dalian 116600;Dalian Key Laboratory of Digital Technology for National Culture(Dalian Minzu University),Dalian 116600,China)
出处 《计算机工程与科学》 CSCD 北大核心 2023年第1期17-27,共11页 Computer Engineering & Science
基金 国家重点研发计划(2018YFB1004402)。
关键词 k-支配 SKYLINE查询 多用户 Apache Flink 并行查询 k-dominant skyline query multi-user Apache Flink parallel query
  • 相关文献

参考文献6

二级参考文献42

  • 1刘欣,余靖,刘国华.基于窗口查询的轮廓查询算法[J].燕山大学学报,2005,29(5):398-402. 被引量:8
  • 2Stephan B(o)rzs(o)nyi,Donald Kossmann,Konrad Stocker.The skyline Operator//Proceedings of the 17th International Conference on Data Engineering.Heidelberg,Germany,2001:421-430.
  • 3Chan Chee-Yong,Jagadish H V,Tan Kian-Lee,Tung Anthony K H,Zhang Zhen-Jie.Finding k-dominant Skylines in high dimensional space//Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data.Chicago,Illinois,USA,2006:503-514.
  • 4Ronald Fagin,Amnon Lotem,Moni Naor.Optimal Aggregation Algorithms for Middleware.Journal of Computer and System Sciences,2001,66(4):614-656.
  • 5Jan Chomicki,Parke Godfrey,Jarek Gryz,Dongming Liang.Skyline with Presorting//Proceedings of the 19th International Conference on Data Engineering.Bangalore,India,2003:717-816.
  • 6Tan Kian-Lee,Eng Pin-Kwang,Ooi Beng Chin.Efficient progressive skyline computation//Proceedings of the 27th International Conference on Very Large Data Bases.Roma,Italy,2001:301-310.
  • 7Donald Kossmann,Frank Ramsak,Steffen Rost.Shooting stars in the sky:An online algorithm for skyline queries//Proceedings of the 28th International Conference on Very Large Data Bases.Hong Kong,China,2002:275-286.
  • 8Papadias Dimitris,Tao Yufei,Fu Greg,Seeger Bernhard.An optimal and progressive algorithm for skyline queries//Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data.San Diego,California,USA,2003:467-478.
  • 9Yiu Man Lung,Mamoulis Nikos.Efficient processing of topk dominating queries on MultiDimensional data//Proceedings of the 33rd International Conference on Very Large Data Bases.Vienna,Austria,2007:483-494.
  • 10Chan Chee-Yong,Jagadish H V,Tung Anthony K H,Zhang Zhen-Jie.On high dimensional Skylines//Proceedings of the 10th International Conference on Extending Database Technology.Munich,Germany,2006:478-495.

共引文献27

同被引文献4

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部