A Framework for Supporting Tree-Like Indexes on the Chord Overlay

A Framework for Supporting Tree-Like Indexes on the Chord Overlay

导出

摘要 With the explosive growth of data, to support efficient data management including queries and updates, the database system is expected to provide tree-like indexes, such as R-tree, M-tree, B＋-tree, according to different types of data. In the distributed environment, the indexes have to be scattered across the compute nodes to improve reliability and scalability. Indexes can speed up queries, but they incur maintenance cost when updates occur. In the distributed environment, each compute node maintains a subset of an index tree, so keeping the communication cost small is more crucial, or else it occupies lots of network bandwidth and the scalability and availability of the database system are affected. Further, to achieve the reliability and scalability for queries, several replicas of the index are needed, but keeping the replicas consistent is not straightforward. In this paper, we propose a framework supporting tree-like indexes, based on Chord overlay, which is a popular P2P structure. The framework dynamically tunes the number of replicas of index to balance the query cost and the update cost. Several techniques are designed to improve the efficiency of updates without the cost of performance of the queries. We implement M-tree and R-tree in our framework, and extensive experiments on real- life and synthetic datasets verify the efficiency and scalability of our framework. With the explosive growth of data, to support efficient data management including queries and updates, the database system is expected to provide tree-like indexes, such as R-tree, M-tree, B＋-tree, according to different types of data. In the distributed environment, the indexes have to be scattered across the compute nodes to improve reliability and scalability. Indexes can speed up queries, but they incur maintenance cost when updates occur. In the distributed environment, each compute node maintains a subset of an index tree, so keeping the communication cost small is more crucial, or else it occupies lots of network bandwidth and the scalability and availability of the database system are affected. Further, to achieve the reliability and scalability for queries, several replicas of the index are needed, but keeping the replicas consistent is not straightforward. In this paper, we propose a framework supporting tree-like indexes, based on Chord overlay, which is a popular P2P structure. The framework dynamically tunes the number of replicas of index to balance the query cost and the update cost. Several techniques are designed to improve the efficiency of updates without the cost of performance of the queries. We implement M-tree and R-tree in our framework, and extensive experiments on real- life and synthetic datasets verify the efficiency and scalability of our framework.

作者朱命冬申德荣寇月聂铁铮于戈

机构地区 CCF ACM College of Information Science and EngineeringNortheastern University IEEE

出处《Journal of Computer Science & Technology》 SCIE EI CSCD 2013年第6期962-972,共11页 计算机科学技术学报（英文版）

基金 supported by the National Basic Research 973 Program of China under Grant No.2012CB316201 the National Natural Science Foundation of China under Grant Nos.60973021,61033007,61003060 the Fundamental Research Funds for the Central Universities of China under Grant No.N100704001

关键词 tree-like index CHORD distributed algorithm tree-like index, Chord, distributed algorithm

分类号 TP311.13 [自动化与计算机技术—计算机软件与理论] TU375.4 [建筑科学—结构工程]

引文网络
相关文献

参考文献23

1Chang F, Dean J, Ghemawat S, Hsieh W C, Wallach D A, Burrows M, Chandra T, Fikes A, Gruber R E. Bigtable: A distributed storage system for structured data. In Proc. the 7th OSDI, November 2006, pp.205-218.
2Cooper B F, Ramakrishnan R, Srivastava U, Silberstein A, Bohannon P, Jacobsen H, Puz N, Weaver D, Yerneni R. Pnuts: Yahoo!'s hosted data serving platform. In Proe. the 34th VLDB, August 2008, pp.1277-1288.
3DeCandia G, Hastorun D, Jampani M, Kakulapati G, Laksh- man A, Pilchin A, Sivasubramanian S, Vosshall P, Vogels W. Dynamo: Amazons highly available key-value store. In Proc. the 21st SOSP, October 2007, pp.205-220.
4Dean J, Ghemawat S. MapReduce: Simplified data process- ing on large clusters. In Proc. the 6th OSDI, December 2004, pp.137-150.
5Stoiea I, Morris R, Karger D, Kaashoek F, Balakrishnan H. Chord: A scalable peer-to-peer lookup service for Internet applications. In Proe. SIGCOMM, August 2001, pp.149-160.
6Ratnasamy S, Francis P, Handley M, Karp R, Shenker S. A scalable contentaddressable network. In Proc. SIGCOMM, Aug. 2001, pp.161-172.
7Rowstron A, Drusehel P. Pastry: Scalable, distributed object location and routing for large-scale peer-to-peer systems. In Proc. IFIP/ACM International Conference on Distributed Systems Platforms, November 2001, pp.329-350.
8Tanin E, Harwood A, Samet H. Using a distributed quadtree index in peer-to-peer networks. VLDB Journal, 2007, 16(2): 165-178.
9Wang J, Wu S, Gao H, Li J, Ooi B C. Indexing multi- dimensional data in a cloud system. In Proc. SIGMOD, June 2010, pp.591-602.
10Wu S, Jiang D, Ooi B C, Wu K L. Efficient B-tree based in- dexing for cloud data processing. In Proc. the 36th VLDB, September 2010, pp.1207-1218.

1开发利必达75E的竞争优势[J].今日印刷,2011(8).
2飞利浦推出新款32位MCU[J].中国集成电路,2005(1):34-34.
3飞利浦为32位微处理器设定新的性价比[J].电子产品与技术,2004(11):77-78.
4卡巴斯基实验室发布新版Windows服务器安全[J].信息安全与通信保密,2016,14(5):83-83. 被引量：1
5飞利浦推出LPC2130系列32位MCU[J].电子测试（新电子）,2005(1):87-88.
6ROSA CHEN.适用未来，最大化您的安防投资价值[J].A&S（安全&自动化）,2009(5):120-124.
7Brian Casey.集成化为运动控制提供新选择[J].电子产品世界,2004,11(04B):57-58.
8皇家飞利浦32位微处理器[J].电子产品世界,2005,12(01B):33-33.
9三星将大批量生产30纳米DDR3DRAM内存芯片[J].中国集成电路,2010(3):5-5.
10卢敏.变革您的数据中心[J].软件世界,2008(7):73-73.

Journal of Computer Science & Technology

2013年第6期

浏览历史

内容加载中请稍等...

A Framework for Supporting Tree-Like Indexes on the Chord Overlay

参考文献23

相关作者

相关机构

相关主题

浏览历史