期刊文献+

A Framework for Supporting Tree-Like Indexes on the Chord Overlay

A Framework for Supporting Tree-Like Indexes on the Chord Overlay
原文传递
导出
摘要 With the explosive growth of data, to support efficient data management including queries and updates, the database system is expected to provide tree-like indexes, such as R-tree, M-tree, B+-tree, according to different types of data. In the distributed environment, the indexes have to be scattered across the compute nodes to improve reliability and scalability. Indexes can speed up queries, but they incur maintenance cost when updates occur. In the distributed environment, each compute node maintains a subset of an index tree, so keeping the communication cost small is more crucial, or else it occupies lots of network bandwidth and the scalability and availability of the database system are affected. Further, to achieve the reliability and scalability for queries, several replicas of the index are needed, but keeping the replicas consistent is not straightforward. In this paper, we propose a framework supporting tree-like indexes, based on Chord overlay, which is a popular P2P structure. The framework dynamically tunes the number of replicas of index to balance the query cost and the update cost. Several techniques are designed to improve the efficiency of updates without the cost of performance of the queries. We implement M-tree and R-tree in our framework, and extensive experiments on real- life and synthetic datasets verify the efficiency and scalability of our framework. With the explosive growth of data, to support efficient data management including queries and updates, the database system is expected to provide tree-like indexes, such as R-tree, M-tree, B+-tree, according to different types of data. In the distributed environment, the indexes have to be scattered across the compute nodes to improve reliability and scalability. Indexes can speed up queries, but they incur maintenance cost when updates occur. In the distributed environment, each compute node maintains a subset of an index tree, so keeping the communication cost small is more crucial, or else it occupies lots of network bandwidth and the scalability and availability of the database system are affected. Further, to achieve the reliability and scalability for queries, several replicas of the index are needed, but keeping the replicas consistent is not straightforward. In this paper, we propose a framework supporting tree-like indexes, based on Chord overlay, which is a popular P2P structure. The framework dynamically tunes the number of replicas of index to balance the query cost and the update cost. Several techniques are designed to improve the efficiency of updates without the cost of performance of the queries. We implement M-tree and R-tree in our framework, and extensive experiments on real- life and synthetic datasets verify the efficiency and scalability of our framework.
出处 《Journal of Computer Science & Technology》 SCIE EI CSCD 2013年第6期962-972,共11页 计算机科学技术学报(英文版)
基金 supported by the National Basic Research 973 Program of China under Grant No.2012CB316201 the National Natural Science Foundation of China under Grant Nos.60973021,61033007,61003060 the Fundamental Research Funds for the Central Universities of China under Grant No.N100704001
关键词 tree-like index CHORD distributed algorithm tree-like index, Chord, distributed algorithm
  • 相关文献

参考文献23

  • 1Chang F, Dean J, Ghemawat S, Hsieh W C, Wallach D A, Burrows M, Chandra T, Fikes A, Gruber R E. Bigtable: A distributed storage system for structured data. In Proc. the 7th OSDI, November 2006, pp.205-218.
  • 2Cooper B F, Ramakrishnan R, Srivastava U, Silberstein A, Bohannon P, Jacobsen H, Puz N, Weaver D, Yerneni R. Pnuts: Yahoo!'s hosted data serving platform. In Proe. the 34th VLDB, August 2008, pp.1277-1288.
  • 3DeCandia G, Hastorun D, Jampani M, Kakulapati G, Laksh- man A, Pilchin A, Sivasubramanian S, Vosshall P, Vogels W. Dynamo: Amazons highly available key-value store. In Proc. the 21st SOSP, October 2007, pp.205-220.
  • 4Dean J, Ghemawat S. MapReduce: Simplified data process- ing on large clusters. In Proc. the 6th OSDI, December 2004, pp.137-150.
  • 5Stoiea I, Morris R, Karger D, Kaashoek F, Balakrishnan H. Chord: A scalable peer-to-peer lookup service for Internet applications. In Proe. SIGCOMM, August 2001, pp.149-160.
  • 6Ratnasamy S, Francis P, Handley M, Karp R, Shenker S. A scalable contentaddressable network. In Proc. SIGCOMM, Aug. 2001, pp.161-172.
  • 7Rowstron A, Drusehel P. Pastry: Scalable, distributed object location and routing for large-scale peer-to-peer systems. In Proc. IFIP/ACM International Conference on Distributed Systems Platforms, November 2001, pp.329-350.
  • 8Tanin E, Harwood A, Samet H. Using a distributed quadtree index in peer-to-peer networks. VLDB Journal, 2007, 16(2): 165-178.
  • 9Wang J, Wu S, Gao H, Li J, Ooi B C. Indexing multi- dimensional data in a cloud system. In Proc. SIGMOD, June 2010, pp.591-602.
  • 10Wu S, Jiang D, Ooi B C, Wu K L. Efficient B-tree based in- dexing for cloud data processing. In Proc. the 36th VLDB, September 2010, pp.1207-1218.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部