期刊文献+

云环境下应用感知的动态重复数据删除机制 被引量:3

A Dynamic Deduplication Method with Application-Aware in Cloud Environment
下载PDF
导出
摘要 针对传统在线/离线重删对云存储系统中重删效率不高的问题,采用混合重复数据删除(Hy-Dedup)机制,通过融合在线和离线两种方式进行有效的数据重删。该方案在线重删阶段根据负载类型对指纹索引进行聚类分组,设置不同重删阈值来评估数据流的空间局部一致性,提高了缓存命中率;离线重删阶段采用延迟敏感的方法,对在线阶段缓存没有命中的重复块进行精确重删。通过这种混合方式在保持系统的I/O性能和吞吐量的前提下,显著减少了写入云存储的重复数据量。实验结果表明,与iDedup机制相比,Hy-Dedup机制可将在线重删率提高35.9%,磁盘空间需求减少41.36%,并且能够在云存储系统中实现高准确率的重删,提升重删效率,节省存储空间。 A hybrid deduplication method(Hy-Dedup)is adopted to solve the problem that the deduplication efficiency in the cloud storage system is not high for traditional mode online/offline deduplication,and the method performs effective data deduplication by combining online and offline modes.This method clusters fingerprint indices according the type of loads in online deduplication stage by adopting the fingerprint caching technology.The temporal local consistency of the duplicated data in data stream is estimated and the spatial local consistency is evaluated by setting different deduplication thresholds to reduce the disk fragments.The problem that the cache cannot be hit because lack of local consistency in the offline deduplication phase will be solved.The duplicated data is significantly reduced by this method while maintaining the I/O performance and the system throughput.Experimental results and a comparison with iDedup show that Hy-Dedup improves the online deduplication ratio by up to 35.9%and the disk capacity requirement reduces by 41.36%.It is concluded that the proposed method can achieve high-deciding deduplication in the cloud storage system,improve deduplication efficiency,and save storage space.
作者 贺秦禄 边根庆 邵必林 贾雷刚 HE Qinlu;BIAN Genqing;SHAO Bilin;JIA Leigang(School of Information and Control Engineering,Xi’an University of Architecture and Technology,Xi’an 710055,China;School of Management,Xi’an University of Architecture and Technology,Xi’an 710055,China)
出处 《西安交通大学学报》 EI CAS CSCD 北大核心 2018年第10期24-30,共7页 Journal of Xi'an Jiaotong University
基金 国家自然科学基金资助项目(61672416) 陕西省自然科学基础研究计划资助项目(2018JM6105) 西安建筑科技大学人才基金资助项目(RC1707)
关键词 在线重删 离线重删 缓存 云存储 online deduplication offline deduplication cache cloud storage
  • 相关文献

参考文献7

二级参考文献23

  • 1付印金,肖侬,刘芳,鲍先强.基于重复数据删除的虚拟桌面存储优化技术[J].计算机研究与发展,2012,49(S1):125-130. 被引量:12
  • 2Meyer D,Aggarwal G,Cully B,et al.Parallax:Virtual disks for virtual machines. Proc of the ACM SIGOPS/EuroSys’’08 . 2008
  • 3Zhu B,Li Kai,Patterson H.Avoiding the disk bottleneck in the Data Domain deduplication file system. Proc of the USENIX FAST’’08 . 2008
  • 4Dong Wei,Douglis F,Li Kai,et al.Tradeoffs in Scalable Data Routing for Deduplication Clusters. Proc of the USENIX FAST’’11 . 2011
  • 5Armbrust M,,Fox A,Griffith A,et al.Above the clouds:A berkeley view of cloud computing,UCB/EECS-2009-28. . 2009
  • 6Jin Keren,Miller E.The effectiveness of deduplication on virtual machine disk images. Proc of the SYSTOR’’09 . 2009
  • 7Nath P,Kozuch M,O’’Hallaron D,et al.Design tradeoffs in applying content addressable storage to enterprise-scale systems based on virtual machines. Proc of the USENIX ATC’’06 . 2006
  • 8Zhang X,Huo Z,Ma J,et al.Exploiting data deduplication to accelerate live virtual machine migration. Proc of the IEEE Cluster’’10 . 2010
  • 9The VMware Reference Architecture for Stateless Virtual Desktops on Local Solid-State Storage with VMware View4.5. http://www.vmware.com/files/pdf . 2011
  • 10Debnath B,Sengupta S,Li Jin.ChunkStash:Speeding up Inline Storage Deduplication using Flash Memory. Proc of the USENIX ATC’’10 . 2010

共引文献26

同被引文献19

引证文献3

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部