期刊文献+

基于LSM Tree的分布式索引实现 被引量:3

Distributed secondary index based on LSM Tree
下载PDF
导出
摘要 近年来Log-Structured-Merge(LSM)Tree在NoSQL系统中得到了广泛地应用.主要是因为LSM Tree架构提出了延迟更新和批量写入的算法,将随机写转换为批量写,减少了磁盘臂的移动开销,从而大大地提升了数据库的写入性能.然而,读性能却也因此受到影响.LSM Tree和B Tree之间的本质区别使得NoSQL系统不适宜直接引用B Tree作为辅助索引结构.本文实现了LSM Tree下的一种分布式辅助索引结构,提出针对这种读写分离架构的索引批量加载策略,并对LSM Tree的查询计划树进行了缓冲优化,避免了重复的查询解析,使得索引读的性能得到了相应的提升. In recent years, Log-Structured-Merge Tree has been widely used in NoSQL systems. This is mainly because it has proposed two algorithms: update delayed and batch write, convert random write to batch write, reducing the cost of moving the disk arm therefore the write performance of database has been enhanced greatly. However, the read performance of database has also been affected negatively. The essential difference between LSM Tree and B Tree makes NoSQL not suitable for using B Tree as index structure directly. This paper implements a distributed secondary index based on LSM Tree, and proposes a bulk loading method in this read and write separation architecture. We also do lots of works on the optimization of index query plan to avoid repeatly query parsing IO so that the performance of index read has been greatly improved.
出处 《华东师范大学学报(自然科学版)》 CAS CSCD 北大核心 2016年第5期36-44,66,共10页 Journal of East China Normal University(Natural Science)
基金 国家863计划项目(2015AA015307) 国家自然科学基金(U1401256 61402180 61402177) CCF-腾讯联合研究基金(AGR20150114) 上海市自然科学研究基金(14ZR1412600)
关键词 辅助索引 日志结构合并树 NOSQL Secondary Index LSM Tree NoSQL
  • 相关文献

参考文献14

  • 1APACHE ORG. Apache HBase[EB/OL]. [2016-07-07]. https://hbase.apache.org/.
  • 2LAKSHMAN A, MALIK P. Cassandra: A decentralized structured storage system[J]. ACM SIGOPS Operating Systems Review, 2010, 44(2): 35-40.
  • 3O'NEIL P, CHENG E, GAWLICK D, et al. The log-structured merge-tree (LSM-tree)[J]. Acta Informatica, 1996, 33(4): 351-385.
  • 4HUAWEI. Secondary index in HBase[EB/OL]. [2016-07-07]. https://github.com/Huawei-Hadoop/hindex.
  • 5CORBETT J C, DEAN J, EPSTEIN M, ETA L. Spanner: Google's globally distributed database[J]. ACM Transactions on Computer Systems (TOCS), 2013, 31(3): 8.
  • 6CHEN G, VO H T, WU S, et al. A framework for supporting DBMS-like indexes in the cloud[J]. Proceedings of The Vldb Endowment, 2011, 4(11): 702-713.
  • 7翁海星,宫学庆,朱燕超,胡华梁.集群环境下分布式索引的实现[J].计算机应用,2016,36(1):1-7. 被引量:7
  • 8CHANG F, DEAN J, GHEMAWAT S, et al. Bigtable: A distributed storage system for structured data[J]. ACM Transactions on Computer Systems, 2008, 26(2): 4.
  • 9TAN W, TATA S, TANG Y, et al. Diff-index: differentiated index in distributed log-structured data stores[C]. Extending Database Technology, 2014: 700-711.
  • 10阳振坤.OceanBase关系数据库架构[J].华东师范大学学报(自然科学版),2014(5):141-148. 被引量:22

二级参考文献56

  • 1Armbrust Michael, Fox Armando, Griffith Rean et al. A view of cloud computing. Communications of the ACM, 2010, 53(4): 50-58.
  • 2Yang H-C, Dasdan A, Hsiao R L, Parker D S. Map-reduce merge: Simplified relational data processing on large clus- ters//Proceedings of the SIGMOD 2007. Beijing, China, 2007:1029-1040.
  • 3Chowdhury N M Mosharaf Kabir, Boutaba Raouf. A survey of network virtualization. Computer Networks, 2010, 54 (5) : 862-876.
  • 4Seshadri P, Pirahesh H, Leung T Y C. Complex query decorrelation//Proceedings of the ICDE. New Orleans, LA, 1996 : 450-458.
  • 5Canahuate Guadalupe, Apaydin Tan, Sacan Ahmet, Ferha- tosmanoglu Hakan. Secondary bitmap indexes with vertical and horizontal partitioning//Proeeedings of the EDBT. Saint Petersburg, Russia, 2009:600-611.
  • 6Sadoghi Mohammad, Jacobsen Hans-Arno. Be-tree: An in- dex structure to efficiently match boolean expressions over high-dimensional discrete spaee//Proceedings of the S1G- MOD Conference. Athens, Greece, 2011:637-648.
  • 7Chang Fay, Dean Jerey, Ghemawat Sanjay et al. Bigtable: A distributed storage system for structured data//Proceedings of the OSDI. Seattle, Washington, USA, 2006:205-218.
  • 8Apache HBase Project. http: //hbase. apache, org/.
  • 9HBase Transactional Index. https: //github. eom/hbase- trx/hbase-transactional-tableindexed.
  • 10Aguilera Marcos Kawazoe, Golab Wojciech M, Shah Mehul A. A practical scalable distributed B-tree//Proceedings of the VLDB. Auckland, New Zealand, 2008:598-609.

共引文献59

同被引文献32

引证文献3

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部