期刊文献+

基于并发跳表的云数据处理双层索引架构研究 被引量:5

Concurrent Skiplist Based Double-Layer Index Framework for Cloud Data Processing
下载PDF
导出
摘要 云数据处理在云计算基础设施中占有极其关键的地位.然而,当前的云存储系统绝大部分都采用基于分布式Hash的健-值对模式来组织数据,在范围查询方面支持不理想、且动态实时性差,有必要构建云环境下辅助动态索引.通过总结、分析云环境中辅助双层索引机制,提出一种基于并发跳表的云数据处理双层索引架构.该架构采用两层体系结构,突破单台机器内存和硬盘的限制,从而扩展系统整体的索引范围.通过动态分裂算法解决局部服务器中的热点问题,保证索引结构整体的负载均衡.通过并发跳表来提高全局索引的承载性能,改善了全局索引的并发性,提高整体索引的吞吐率.实验结果表明,基于并发跳表的云数据处理双层索引架构能够有效支持单键查询和范围查询,具有较强的可扩展性和并发性,是一种高效的云存储辅助索引. Cloud data processing plays an essential infrastructure in cloud systems.Without efficient structures,cloud systems cannot support the necessary high throughput and provide services for millions of users.However,most existing cloud storage systems generally adopt a distributed Hash table(DHT)approach to index data,which lacks to support range-query and dynamic real-time character.It is necessary to generate a scalable,dynamical and multi-query functional index structure in cloud environment.Based on the summary and analysis of the double-layer index systems for cloud storage,this paper provides a novel concurrent skiplist based double-layer index(referred as CSDindex)for cloud data processing.Two-layer architecture,which can breakthrough single machine memory and hard drive limitation,is used to extend indexing scope.Online migration algorithm of skiplist's nodes between local servers is used to make dynamic load-balancing.The details of the design and the implement of the concurrent skiplist are discussed in this paper.Optimistic concurrency control(OCC)technique is introduced to enhance the concurrency.Through concurrent skiplist CSDindex improves the load bearing capacity of the global index and enhances the overall throughput of the index.Experimental results show the efficiency of the concurrent skiplist based double-layer index and it has viability as an alternative approach for cloud-suitable data structures.
出处 《计算机研究与发展》 EI CSCD 北大核心 2015年第7期1531-1545,共15页 Journal of Computer Research and Development
基金 国家自然科学基金项目(61363021 61363084) 云南省软件工程重点实验室开放基金项目(2011SE01 2012SE304) 云南省青年基金项目(2012FD004) 云南省教育厅科学研究基金项目(2014Y013)
关键词 云计算 双层索引 并发跳表 范围查询 乐观并发控制 cloud computing double-layer index concurrent skiplist range query optimistic concurrency control
  • 相关文献

参考文献31

  • 1Amazon Inc. Amazcn elastic compute cloud (Amazon EC2) [EB/OL]. [2014-03-25]. http://aws, amazon, com/ec2.
  • 2IBM. IBM introduces ready-to-use cloud computing [EB/OL]. [2014- 03-25]. http://www03, ibm. com/press/ us/en/pressrelease/22613, wss.
  • 3Fay C, Jeffrey D, Sanjay G, et al. Bigtable: A distributed storage system for structured data [J]. ACM Trans on Computer Systems(TOCS), 2008, 26(2) : No. 4.
  • 4Microsoft Inc. Windows azure cloud [EB/OL]. [2014-03- 25]. http://www, windowsazure, com.
  • 5Sanjay G, Howard G, Leung S. The google file system [C] //Proc of the 19th ACM Symp on Operating Systems Principles(SOSP'03). New York: ACM, 2003:29-43.
  • 6The Apache Software Foundation. Hadoop [EB/OL]. [2014- 03- 25]. http://hadoop, apache, org/.
  • 7Giuseppe D, Deniz H, Madan J, et al. Dynamo: Amazon's highly available key value store [C] //Proe of the 21st ACM SIGOPS Syrup on Operating Systems Principles(SOSP'07). New York: ACM, 2007:205-220.
  • 8The Apache Software Foundation. Cassandra. [EB/OL]. [2014- 03- 25]. http://cassandra, apache, org/.
  • 9Yang H C, Parker D S. Traverse: Simplified indexing on large map-reduce-merge clusters [C] //Proc of Database Systems for Advanced Applications. Berlin: Springer, 2009: 308-322.
  • 10Dittrich J, Quianfi-Ruiz J A, Jindal A, et al. Hadoop+ + Making a yellow elephant run like a cheetah (without it even noticing)[J]. Proceedings of the VLDB Endowment, 2010, 3(1/2) :515-529.

二级参考文献36

  • 1Armbrust Michael, Fox Armando, Griffith Rean et al. A view of cloud computing. Communications of the ACM, 2010, 53(4): 50-58.
  • 2Yang H-C, Dasdan A, Hsiao R L, Parker D S. Map-reduce merge: Simplified relational data processing on large clus- ters//Proceedings of the SIGMOD 2007. Beijing, China, 2007:1029-1040.
  • 3Chowdhury N M Mosharaf Kabir, Boutaba Raouf. A survey of network virtualization. Computer Networks, 2010, 54 (5) : 862-876.
  • 4Seshadri P, Pirahesh H, Leung T Y C. Complex query decorrelation//Proceedings of the ICDE. New Orleans, LA, 1996 : 450-458.
  • 5Canahuate Guadalupe, Apaydin Tan, Sacan Ahmet, Ferha- tosmanoglu Hakan. Secondary bitmap indexes with vertical and horizontal partitioning//Proeeedings of the EDBT. Saint Petersburg, Russia, 2009:600-611.
  • 6Sadoghi Mohammad, Jacobsen Hans-Arno. Be-tree: An in- dex structure to efficiently match boolean expressions over high-dimensional discrete spaee//Proceedings of the S1G- MOD Conference. Athens, Greece, 2011:637-648.
  • 7Chang Fay, Dean Jerey, Ghemawat Sanjay et al. Bigtable: A distributed storage system for structured data//Proceedings of the OSDI. Seattle, Washington, USA, 2006:205-218.
  • 8Apache HBase Project. http: //hbase. apache, org/.
  • 9HBase Transactional Index. https: //github. eom/hbase- trx/hbase-transactional-tableindexed.
  • 10Aguilera Marcos Kawazoe, Golab Wojciech M, Shah Mehul A. A practical scalable distributed B-tree//Proceedings of the VLDB. Auckland, New Zealand, 2008:598-609.

共引文献41

同被引文献32

引证文献5

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部