期刊文献+

分档布鲁姆过滤器的查询算法 被引量:14

Basket Bloom Filters for Membership Queries
下载PDF
导出
摘要 布鲁姆过滤器是一种能够简洁地表示集合并支持集合查询的数据结构,广泛应用于数据库、网络和分布式系统中.针对现有的布鲁姆过滤器没有考虑查询失效代价这一缺陷,文中提出一种新的代价敏感的分档布鲁姆过滤器查询算法.它将元素根据不同的查询代价分为不同的子集,通过考查每档子集最低查询失效率的关系,建立由每档子集合最低查询失效假阳性概率表示的集合最低查询失效总代价目标函数,使用类目标函数梯度遗传算法获得每档的最优Hash函数个数ki,完成集合到向量的映射与查找.仿真实验结果表明,使用新结构的查询算法和标准布鲁姆过滤器算法相比,所用的查询计算时间基本相同,因为区分对待集合元素,查询失效总代价仅为标准算法的27%. A Bloom Filter is a space-efficient data structure allowing membership queries over sets with allowable errors. It is widely used in databases, networks, and distributed systems. This paper presents a novel Bloom Filter, called Basket Bloom Filter (BBF). The BBF differentiates elements in a data set depending on their query invalidation cost, by clustering elements into different baskets. The total query invalidation cost function is defined. In order to minimize the total query invalidation cost, the genetic algorithm is employed to find the optimal number of hash functions for every basket. Simulation results show that, BBF's total query invalidation cost is 27% of the standard Bloom Filters' while the executing time is almost the same.
出处 《计算机学报》 EI CSCD 北大核心 2007年第4期597-607,共11页 Chinese Journal of Computers
基金 国家自然科学基金(60473031 60273070 60403031 90604015) 国家"八六三"高技术研究发展计划项目基金(2005AA121560) 湖南省科技计划项目基金(2006GK3101)资助.
关键词 分档布鲁姆过滤器 计算机网络 分布式计算 分布式消息系统 集合元素查询 Basket Bloom Filter computer networks distributed computing distributed information system membership query
  • 相关文献

参考文献24

  • 1Bloom B.Space/time trade-offs in hash coding with allowable errors.Communications of the ACM,1970,13(7):422-426
  • 2Mullin J K.Optimal semijoins for distributed database systems.IEEE Transactions on Software Engineering,1990,16(5):558-560
  • 3McIlroy M.Development of a spelling list.IEEE Transactions on Communications,1982,30(1):91-99
  • 4Druschel P,Rowstron A.PAST,a large-scale,persistent peer-to-peer storage utility//Proceedings of the 8th Workshop on Hot Topics in Operations Systems.Elmau/Oberbayern,Germany,2001.Washington,DC,USA,2001:65-70
  • 5Stoica I,Morris R,Karger D et al.Chord:A scalable peerto-peer lookup service for Internet applications//Proceedings of the ACM SIGCOMM.San Francisco,USA,2001:149-160
  • 6Ratnasamy S,Francis P,Handley M et al.A scalable content-addressable network//Proceedings of the ACM SIGCOMM.San Francisco,2001:161-172
  • 7Rhea S C,Kubiatowicz J.Probabilistic location and routing//Proceedings of the INFOCOM2002.New York,2002.Washington,DC,USA,2002:1248-1257
  • 8Whitaker A,Wetherall D.Forwarding without loops in Icarus//Proceedings of the Open Architectures and Network Programming.New York,USA,2002:63-75
  • 9Wu-Chan F,Shin K G,Kandlur D D et al.The BLUE active queue management algorithms.IEEE/ACM Transactions on Networking,2002,10(4):513-528
  • 10Estan C,Varghese G.New directions in trace measurement and accounting//Proceedings of the ACM SIGCOMM.Pittsburgh,USA,2002:323-336.

二级参考文献7

共引文献39

同被引文献139

引证文献14

二级引证文献82

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部