期刊文献+

一种基于HBase的数据持久性和可用性研究 被引量:11

Research of Data Durable and Available Base on HBase
下载PDF
导出
摘要 HBase(Hadoop DataBase)是Apache Hadoop项目下的一款非关系型数据库,它是一个基于列簇的开源数据存储系统,关于HBase的研究和应用越来越受到关注.由于HBase会在内存缓存数据后写文件系统,所以缓存的大小成为影响系统性能的一个重要因素.本文提出一种基于备份日志的持久性、可用性方案Remote Log Process,使得HBase能够在不同的缓存规模获得更好的写性能.实验证明,在保证数据的持久性和可用性前提下,RLP能够在不同的缓存大小下获得稳定的性能,并且在缓存不超过默认设置时明显提高写操作时间性能. HBase, a NoSql database under Apache Hadoop, is an open source data storage system based on column family. Researches and applications based on HBase is more and more popular. But the size of memory buffer become a key factor to influence system performance as HBase will buffer data in memory before store them on file system. In this paper, we provide a new method based on copied log named Remote Log Process to make HBase perform better on write operation with different buffer size while keeping data durable and available. Experiments result indicates RLP can get a steady performance with different buffer size under the condition to guarantee durable and available of input data, while perform much better than pristine systems if the buffer isn't larger then default value.
出处 《计算机系统应用》 2013年第10期175-180,共6页 Computer Systems & Applications
基金 江苏省产学研前瞻性联合研究(BY2009128) 江苏省自然科学基金(BK2012194) 国家自然科学基金(61272131)
关键词 HBASE 持久性 可用性 预写日志 写操作效率 HBase durable available write ahead log write performance
  • 相关文献

参考文献7

  • 1Kubiatowicz J, Bindel D, Chen Y, Czerwinski S, Eaton P, GeelsD, Gummadi R, Rhea S,Weatherspoon H,Wells C,Zhao B.OceanStore: an architecture for global-scale persistent storage.SIGARCH Comput. Archit. News 28,5 (Dec.2000),190-201.
  • 2HBase. http://hbase.apache.org/.
  • 3Chang F,Dean J, Ghemawat S, Hsieh WC. Bigtable: A distri-buted storage system for structured data. ACM Trans, onComputer Systems(TOCS) 2008,26(2).
  • 4Tom White,曾大聃,周傲英,周敏译.Hadoop权威指南.北京:清华大学出版社,2010:366~429.
  • 5Lars George.HBase:The Definitive Guide(影印版).南京:东南大学出版社,2012:315-384.
  • 6Ousterhout J,Agrawal P, Erickson D, Kozyrakis C,Leverich J,Mazi^res D,Mitra S,Narayanan A, Parulkar G, RosenblumM,Rumble SM, Stratmann E, Stutsman R. The case forRAMClouds: Scalable high-performance storage entirely inDRAM. SIGOPS Operating Systems Review, December2009,43(4): 92-105.
  • 7Dai D,Li X,Wang C, Sun MM, Zhou XH. Sedna: A memorybased key-value storage system for real time processing incloud. The 2012 International Conference on ClusterComputing Workshops(IASDS 2012) in Conjunction withIEEE Cluster* 12, September 2012: 24-28.

同被引文献63

  • 1崔杰,李陶深,兰红星.基于Hadoop的海量数据存储平台设计与开发[J].计算机研究与发展,2012,49(S1):12-18. 被引量:141
  • 2李斯伟.基于IP的SAN存储技术研究[J].电讯技术,2004,44(3):132-135. 被引量:4
  • 3Dimiduk N.HBase实战[M].谢磊,译.北京:人民邮电出版社,2013.
  • 4莫扎特.大数据和NoSQL:关系型数据库的不足之处[EB/OL].(2014-08-11)[2016-03-10].http://www.36dsj.tom/archives/11078.
  • 5蒋焱峰.HBase管理指南[M].北京:人民邮电出版社,2013.
  • 6郝树魁.分布式存储系统HBase原理解析[EB/OL].(2010-12-16)[2016-03-10].http://www.paper.edu.cn/releasepa-per/content/201012-591.
  • 7Fu Zhicheng,Liu Chen. A general research on database migra- tion from RDBMS to HBase[ EB/OL ]. (2015-03-17) [ 2016- 03-07 ]. http://www, paper, edu. cn/releasepaper/content/ 201503-145.
  • 8百度百科.关系型范式[EB/OL].[2016-03-07].http://baike.SO.com/doc/4367825-4573590.html.
  • 9陈荣鑫,付永钢,陈维斌.基于Pentaho的商业智能系统[J].计算机工程与设计,2008,29(9):2407-2409. 被引量:16
  • 10卢冬海,何先波.浅析NoSQL数据库[J].中国西部科技,2011,10(2):15-16. 被引量:17

引证文献11

二级引证文献51

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部