摘要
数据备份规模的不断增大,网络带宽成为了远程数据备份系统的瓶颈.针对这个问题,本文提出了一种基于Hash匹配的数据消重远程备份系统:DeduBS系统.DeduBS系统通过数据消重,避免了在数据备份过程中传输重复数据,有效地提高了网络传输效率.DeduBS系统在源节点和目标节点建立Hash库存储数据块的Hash值,数据传输前通过比对Hash值判断其是否为重复数据,只传输重复数据的Hash值和非重复数据,对于重复数据接收端通过Hash库恢复数据.实验数据表明,DeduBS系统可以减少网络传输的数据量,在降低成本、节省能耗的同时,提高数据备份的效率.
With data increasing, network bandwidth has become a bottleneck in remote data backup system. A new duplicate eliminating remote backup system is proposed based on Hash matching: Dedu BS. DeduB S improves the efficiency of network transmission by deduplication. In Dedu BS, a Hash library is established in both the source node and the destination node to store all data blocks Hash value. Before being transferred, the data will be judged whether it is duplication or not by its Hash value. Only the non-duplicate and the Hash value of duplicate are transferred in Dedu BS. Experiment shows that the Dedu BS can transfer less data and save the cost and energy of backup system, thus improving network utilization.
出处
《河北工业大学学报》
CAS
2015年第4期32-37,共6页
Journal of Hebei University of Technology
基金
河北省教育厅青年基金(QN2014192)
河北省自然科学基金(F2013202138)
河北省教育厅重点项目(ZH2012038)
关键词
备份
数据消重
Hash值
网络传输
backup
duplicate eliminating
Hash
network transmission