期刊文献+

分布式网络中的一种高效top-k求解方法研究 被引量:1

FT:Efficient top-k algorithm in distributed networks
下载PDF
导出
摘要 提出了一种新的算法,来解决在分布式的环境中top-k求解问题(求出全局数值最大的前k名)。之前的研究,例如TA、TPUT、HT算法,都会消耗大量的带宽。KLEE算法虽然能够大大地减少带宽的消耗,却不能给出精确解。而提出的算法FT由于添加了一个预处理阶段并且使用了histogram bloom技术,即能有效地减少带宽的消耗,又能给出精确解。实现了FT和相关的算法,并进行了全面的比较。比较是建立在真实的数据集和根据不同情况合成的数据集的基础上的。实验结果显示FT在带宽消耗上面,相对于其他算法有很大的改进和优势。 A new algorithm FT to answer top-k queries(find the k objects with the highest aggregate scores) in a distributed net-work is proposed.Prior research such as Threshlod Algorithm(TA),Three-Phase Uniform-Threshold(TPUT) algorithm and Hybrid-Threshold(HT) algorithm consume an excessive amount of bandwidth.KLEE algorithm can reduce network bandwidth consumption but it can't give exact top-k answers.The algorithm FT can both reduce network bandwidth and give the exact top-k answer based on the pretreatment in the first phase and the histogram bloom.This paper implements FT and related algorithms and conducts a comprehensive performance evaluation.Evaluation employs real-world and synthetic data sets.The experiment shows that FT can achieve major performance in terms of network bandwidth.
出处 《计算机工程与应用》 CSCD 北大核心 2010年第18期89-92,共4页 Computer Engineering and Applications
基金 中科院创新基金重点支持项目~~
关键词 top-k算法 分布式网络 HISTOGRAM blooms技术 top-k algorithm distributed networks histogram blooms
  • 相关文献

参考文献12

  • 1Babcock B,Olston C.Distributed top-k monitorin[C]//SIGMOD Conf, 2003.
  • 2Silberstein A,Braynard R,Ellis C S,et al.A sampling-based approach to optimizing top-k queries in sensor networks[C]//ICDE Conf, 2006.
  • 3Akbarinia R,Pacitti E,Valduriez P.Reducing network traffic in unstructured P2P systems using Top-k queries[J].Distributed and Parallel Databases, 2006,19(2).
  • 4Metwally A,Agrawal D,E1 Abbadi A.An integrated efficient solution for computing frequent and top-k elements in data streams[J]. J ACM Transactions on Database Systems(TODS),2006,31(3).
  • 5Zipf G K.Human behaviour and the principle of least effort:An introduction to human ecology[M].[S.l.]:Addison-Wesley, 1949.
  • 6Nepal S,Ramakrishna M V.Query processing issues in image(multimedia)databases[C]//Proc of Intl Conf on Data Engineering(ICDE), 1999:22-31.
  • 7Fagin R,Lotem A,Naor M.Optimal aggregation algorithms or middleware[C]//Symposium on Principles of Database Systems,2001.
  • 8Fagin R.Combining fuzzy information from multiple systems[C]//Proc of Intl Syrup on Principles of Database Systems(PODS),1996:216--226.
  • 9Cao P,Wang Z.Efficient top-K query calculation in distributed networks[C]//PODC, 2004.
  • 10Yu H, Li H,Wu P, et al.Efficient processing of distributed Top-k queries[C]//Proceedings of the 16th International Conference on Database and Expert Systems Applications,DEXA 2005,2005.

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部