期刊文献+

基于分数阶Fourier变换的云存储系统重复数据删除算法 被引量:2

Duplicate Data Remove Algorithm of Cloud Storage System Based on Fractional Fourier Transform
下载PDF
导出
摘要 云存储系统的重复数据作为大量冗余数据的一种,对其有效及时地删除能保证云存储系统的稳定与运行。由于云存储系统中的干扰数据较多,信噪比较低,传统的重删算法会在分数阶Fourier域出现伪峰峰值,不能有效地对重复数据进行检测滤波和删除处理,因此提出一种改进的基于分数阶Fourier变换累积量检测的云存储系统重复数据删除算法。首先分析云存储系统重复数据删除机制体系架构,定义数据存储点的适应度函数,得到云存储节点的系统子集随机概率分布;采用经验约束函数对存储节点中的校验数据块分存,通过分数阶Fourier变换对云存储系统中的幅度调制分量进行残差信号滤波预处理。采用4阶累积量切片后置算子,把每个文件分为若干个块,针对每个文件块进行重删,进行重复数据检测后置滤波处理,实现存储资源上的重复数据检测及其删除。仿真实验表明,该算法能提高集群云存储系统计算资源的利用率,重复数据准确删除率较高,有效避免了数据信息流的干扰特征造成的误删和漏删,性能优越。 Duplicate data of cloud storage system is taken as one of a large amount of redundant data,and the effective and timely remove can guarantee the stability and operation of cloud storage system.Because of the interference of data,the SNR is low,the traditional method has false peaks in the fractional Fourier domain,and it cannot effectively detect and remove the duplicate data.An improved duplicate data remove algorithm of cloud storage system was proposed based on fractional Fourier transform cumulant detection.Firstly,the delete system architecture for cloud storage system was taken,the fitness function of data storage point was defined,and system subset random probability distribution function of the cloud storage node was gotten.The constraint function was used for blocking the calibration data of storage nodes,the detection of duplicate data removing processing was taken,and the fractional Fourier transform was used to preprocess the residual signal filtering in cloud storage system.The 4 order cumulanted slice post operator was used to divide each file into blocks.To delete each file block,duplicated data detection post filtering was obtained,and data storage resource detection and deletion were realized.Simulation results show that this algorithm can improve the utilization efficiency of cluster cloud storage system resource,and duplicate data can be accurately removed with higher rate.It can effectively avoid the error removing caused by interference and leakage removing,and it has superior performance.
出处 《计算机科学》 CSCD 北大核心 2015年第7期174-177,209,共5页 Computer Science
基金 广西自然科学基金青年基金项目(2013GXNSFBA019268) 广西科技大学自然科学基金项目(校科自1261126) 广西特色专业建设项目(GXTSZY217) 广西教育厅一般项目(YB2014208) 广西教育厅立项项目(LX2014182)资助
关键词 分数阶FOURIER变换 云存储 重复数据 Fractional Fourier transform Cloud storage Duplicate data
  • 相关文献

参考文献9

二级参考文献167

  • 1董欢庆,李战怀,林伟.RAID-VCR:一种能够承受三个磁盘故障的RAID结构[J].计算机学报,2006,29(5):792-800. 被引量:10
  • 2宋道金.单神经元自适应PID控制器的性能优化设计[J].计算机工程与应用,2007,43(12):199-201. 被引量:17
  • 3Bhagwat D,Pollack K,Long DDE,Schwarz T,Miller EL,P-ris JF.Providing high reliability in a minimum redundancy archival storage system.In:Proc.of the 14th Int'l Symp.on Modeling,Analysis,and Simulation of Computer and Telecommunication Systems (MASCOTS 2006).Washington:IEEE Computer Society Press,2006.413-421.
  • 4Zhu B,Li K.Avoiding the disk bottleneck in the data domain deduplication file system.In:Proc.of the 6th Usenix Conf.on File and Storage Technologies (FAST 2008).Berkeley:USENIX Association,2008.269-282.
  • 5Bhagwat D,Eshghi K,Mehra P.Content-Based document routing and index partitioning for scalable similarity-based searches in a large corpus.In:Berkhin P,Caruana R,Wu XD,Gaffney S,eds.Proc.of the 13th ACM SIGKDD Int'l Conf.on Knowledge Discovery and Data Mining (KDD 2007).New York:ACM Press,2007.105-112.
  • 6You LL,Pollack KT,Long DDE.Deep store:An archival storage system architecture.In:Proc.of the 21st Int'l Conf.on Data Engineering (ICDE 2005).Washington:IEEE Computer Society Press,2005.804-815.
  • 7Quinlan S,Dorward S.Venti:A new approach to archival storage.In:Proc.of the 1st Usenix Conf.on File and Storage Technologies (FAST 2002).Berkeley:USENIX Association,2002.89-102.
  • 8Sapuntzakis CP,Chandra R,Pfaff B,Chow J,Lam MS,Rosenblum M.Optimizing the migration of virtual computers.In:Proc.of the 5th Symp.on Operating Systems Design and Implementation (OSDI 2002).New York:ACM Press,2002.377-390.
  • 9Rabin MO.Fingerprinting by random polynomials.Technical Report,CRCT TR-15-81,Harvard University,1981.
  • 10Rivest R.The MD5 message-digest algorithm.1992.http://www.python.org/doc/current/lib/module-md5.html.

共引文献197

同被引文献16

引证文献2

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部