期刊文献+

基于纠删码的分布式存储系统中针对多数据失效的树型并行修复技术(英文) 被引量:5

Tree-Structured Parallel Regeneration for Multiple Data Losses in Distributed Storage Systems Based on Erasure Codes
下载PDF
导出
摘要 To reduce the time required to complete the regeneration process of erasure codes, we propose a Tree-structured Parallel Regeneration (TPR) scheme for multiple data losses in distributed storage systems. Under the scheme, two algorithms are proposed for the construction of multiple regeneration trees, namely the edge-disjoint algorithm and edge-sharing algorithm. The edge-disjoint algorithm constructs multiple independent trees, and is simple and appropriate for environments where newcomers and their providers are distributed over a large area and have few intersections. The edge-sharing algorithm constructs multiple trees that compete to utilize the bandwidth, and make a better utilization of the bandwidth, although it needs to measure the available band-width and deal with the bandwidth changes; it is therefore difficult to implement in practical systems. The parallel regeneration for multiple data losses of TPR primarily includes two optimizations: firstly, transferring the data through the bandwidth optimized-paths in a pipe-line manner; secondly, executing data regeneration over multiple trees in parallel. To evaluate the proposal, we implement an event-based simulator and make a detailed comparison with some popular regeneration methods. The quantitative comparison results show that the use of TPR employing either the edge-disjoint algorithm or edge-sharing algorithm reduces the regeneration time significantly. To reduce the time required to complete the regeneration process of erasure codes, we propose a Tree-structured Parallel Regeneration (TPR) scheme for multiple data losses in distributed storage systems. Under the scheme, two algorithms are proposed for the construction of multiple regeneration trees, namely the edge-disjoint algorithm and edge- sharing algorithm. The edge-disjoint algorithm constructs multiple independent trees, and is simple and appropriate for environments where newcomers and their providers are distributed over a large area and have few intersections. The edge-sharing algorithm constructs multiple trees that compete to utilize the bandwidth, and make a better utilization of the bandwidth, alth- ough it needs to measure the available band- width and deal with the bandwidth changes; it is therefore difficult to implement in practical systems. The parallel regeneration for multiple data losses of TPR primarily includes two op- timizations: firstly, transferring the data thr ough the bandwidth optimized-paths in a pipe line manner; secondly, executing data regen eration over multiple trees in parallel. To eva- luate the proposal, we implement an event based simulator and make a detailed compari son with some popular regeneration methods. The quantitative comparison results show that the use of TPR employing either the edge-dis joint algorithm or edge-sharing algorithm re duces the regeneration time significantly.
出处 《China Communications》 SCIE CSCD 2013年第4期113-125,共13页 中国通信(英文版)
基金 supported by the National Grand Fundamental Research of China (973 Program) under Grant No. 2011CB302601 the National High Technology Research and Development of China (863 Program) under GrantNo. 2013AA01A213 the National Natural Science Foundation of China under Grant No. 60873215 the Natural Science Foundation for Distinguished Young Scholars of Hunan Province under Grant No. S2010J5050 Specialized Research Fund for the Doctoral Program of Higher Education under Grant No. 20124307110015
关键词 分布式存储系统 再生过程 数据丢失 并行 纠删码 带宽利用率 算法构造 树形 distributed storage system erasure code replication regeneration tree
  • 相关文献

参考文献13

  • 1WEATHERSPOON H, KUBIATOWICZ J. Erasure Coding vs. Replication: A Quantitative Comparison[C]// Proceedings of the 1st International Workshop on Peer-to-Peer Systems (IPTPS'02): March 7-8, 2002. Cambridge, MA, USA. 2002: 328-337.
  • 2RODRIGUES R, LISKOV B. High Availability in DHTs: Erasure Coding vs. Replication[C]// Proceedings of 4th International Workshop on Peer-to-Peer Systems (IPTPS'05): February 24- 25, 2005. Ithaca, NY, USA. Springer, 2005: 226- 239.
  • 3WU Yunnan, DIMAKISY A, RAMCHANDRANY K. Deterministic Regenerating Codes for Distributed Storage[C]// Proceedings of Allerton Conference on Communication, Control and Computing: September 26-28, 2007. Monti- cello, Illinois, USA. Springer, 2007: 2l5-223.
  • 4ACEDANSKI S, DEB S, MEDARD M, et al. How Good is Random Linear Coding Based Distributed Networked Storage[C]// Proceedings of Workshop on Network Coding, Theory and Applications: April 7, 200S. Riva del Garda, Italy. 2005: 1-6.
  • 5LI Jun, YANG Shuang, WANG Xin, et al. TreeStructured Data Regeneration in Distributed Storage Systems with Regenerating Codes[C]// Proceedings of International Conference on Computer Communications (INFOCOM) 2010: March IS-19, 2010. San Diego, California, USA. IEEE Computer Society, 2010: 1-9.
  • 6HU Yuchong, XU Yinlong, WANG Xiaozhao, et al. Cooperative Recovery of Distributed Storage Systems from Multiple Losses with Network Coding[J]. IEEE Journal on Selected Areas in Communications, 2010, 28(2): 268-276.
  • 7LEE S L, SHARMA P, BANERJEE S, et al. Measuring Bandwidth Between PlanetLab Nodes in PAM[C]//Proceedings of 6th International Conference on Passive and Active Network Measurement: March 31-Aprill, 2005. Boston, MA, USA. Springer-Verlag Berlin, 2005: 292-305.
  • 8HUANG Zhen, LIN Yisong, PENG Yuxing. Robust Redundancy Scheme for the Repair Process: Hierarchical Codes in the BandwidthLlimited Systems[J]. Journal of Grid Computing, 2012, 2012(10): 579-597.
  • 9王意洁,孙伟东,周松,裴晓强,李小勇.云计算环境下的分布存储关键技术[J].软件学报,2012,23(4):962-986. 被引量:280
  • 10HUANG Cheng, SIMITCI H, XU Yikang, et al. Erasure Coding in Windows Azure Storage[C]// Proceedings of the 2012 USENIX Conference on Annual Technical Conference, Boston, MA, USA, USENIX Association, 2012:2-2.

二级参考文献1

共引文献279

同被引文献9

引证文献5

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部