基于FLASH的存储系统复写旧数据时需要通过地址映射层(flash translation layer,FTL)分配新的写入地址,因此旧的数据在一段时间内仍存在于系统中,在这样的系统上实现连续数据保护(continuous data protection,CDP),只需找到旧的映射信息...基于FLASH的存储系统复写旧数据时需要通过地址映射层(flash translation layer,FTL)分配新的写入地址,因此旧的数据在一段时间内仍存在于系统中,在这样的系统上实现连续数据保护(continuous data protection,CDP),只需找到旧的映射信息,不需额外保存数据的更新信息。利用FLASH的带外区(out of band,OOB)记录时间戳,在FTL的映射项中增加时间戳信息,在不跟踪映射表每次更新的情况下,达到数据快速恢复的目的。该方法降低了对存储系统性能的影响,实现了快速的数据恢复响应。展开更多
The poor performance of random writes has been a cause of major concern which needs to be addressed to better utilize the potential of flash in enterprise-scale environments. We examine one of the important causes of ...The poor performance of random writes has been a cause of major concern which needs to be addressed to better utilize the potential of flash in enterprise-scale environments. We examine one of the important causes of this poor performance: the design of the flash translation layer (FTL) which performs the virtual-to-physical address translations and hides the erase-before-write characteristics of flash. We propose a complete paradigm shift in the design of the core FTL engine from the existing techniques with our Demand-Based Flash Translation Layer (DFTL) which selectively caches page- level address mappings. Our experimental evaluation using FlashSim with realistic enterprise-scale workloads endorses the utility of DFTL in enterprise-scale storage systems by demonstrating: 1) improved performance, 2) reduced garbage collection overhead and 3) better overload behavior compared with hybrid FTL schemes which are the most popular implementation methods. For example, a predominantly random-write dominant I/O trace from an OLTP application running at a large financial institution shows a 78% improvement in average response time (due to a 3-fold reduction in operations of the garbage collector), compared with the hybrid FTL scheme. Even for the well-known read-dominant TPC-H benchmark, for which DFTL introduces additional overheads, we improve system response time by 56%. Moreover, interestingly, when write-back cache on DFTL-based SSD is enabled, DFTL even outperforms the page-based FTL scheme, improving their response time by 72% in Financial trace.展开更多
Purpose-The purpose of this paper is to make an overview of 474 publications and 512 patents of FTL from 1987 to 2020 in order to provide a conclusive and comprehensive analysis for researchers in this field,as well a...Purpose-The purpose of this paper is to make an overview of 474 publications and 512 patents of FTL from 1987 to 2020 in order to provide a conclusive and comprehensive analysis for researchers in this field,as well as a preliminary knowledge of FTL for interested researchers.Design/methodology/approach-Firstly,the FTL algorithms are classified and its functions are introduced in detail.Secondly,the structures of the publications are analyzed in terms of the fundamental information and the publication of the most productive countries/regions,institutions and authors.After that,co-citation networks of institutions,authors and papers illustrated by VOS Viewer are given to show the relationship among those and the most influential of them is further analyzed.Then,the characteristics of the patent are analyzed based on the basic information and classification of the patent and the most productive inventors.In order to obtain research hotspots and trends in this field,the time-line review and citation burst detection of keywords carried out by Cite Space are made to be visual.Finally,based on the above analysis,it draws some other important conclusions and the development trend of this field.Findings-The research on FTL algorithm is still the top priority in the future,and how to improve the performance of SSD in the era of big data is one of the research hotspots.Research limitations/implications-This paper makes a comprehensive analysis of FTL with the method of bibliometrics,and it is valuable for researchers can quickly grasp the hotspots in this area.Originality/value-This article draws the structural characteristics of the publications in this field and summarizes the research hotspots and trends in this field in recent years,aiming to inspire new ideas for researchers.展开更多
文摘基于FLASH的存储系统复写旧数据时需要通过地址映射层(flash translation layer,FTL)分配新的写入地址,因此旧的数据在一段时间内仍存在于系统中,在这样的系统上实现连续数据保护(continuous data protection,CDP),只需找到旧的映射信息,不需额外保存数据的更新信息。利用FLASH的带外区(out of band,OOB)记录时间戳,在FTL的映射项中增加时间戳信息,在不跟踪映射表每次更新的情况下,达到数据快速恢复的目的。该方法降低了对存储系统性能的影响,实现了快速的数据恢复响应。
基金funded in part by the Natural Science Foundation of U.S.under Grant Nos.CCF-0811670,CNS-0720456a gift from Cisco System,Inc.and partially through the Ofce of Science of the U.S.Department of Energy under Contract No.DE-AC05-00OR22725
文摘The poor performance of random writes has been a cause of major concern which needs to be addressed to better utilize the potential of flash in enterprise-scale environments. We examine one of the important causes of this poor performance: the design of the flash translation layer (FTL) which performs the virtual-to-physical address translations and hides the erase-before-write characteristics of flash. We propose a complete paradigm shift in the design of the core FTL engine from the existing techniques with our Demand-Based Flash Translation Layer (DFTL) which selectively caches page- level address mappings. Our experimental evaluation using FlashSim with realistic enterprise-scale workloads endorses the utility of DFTL in enterprise-scale storage systems by demonstrating: 1) improved performance, 2) reduced garbage collection overhead and 3) better overload behavior compared with hybrid FTL schemes which are the most popular implementation methods. For example, a predominantly random-write dominant I/O trace from an OLTP application running at a large financial institution shows a 78% improvement in average response time (due to a 3-fold reduction in operations of the garbage collector), compared with the hybrid FTL scheme. Even for the well-known read-dominant TPC-H benchmark, for which DFTL introduces additional overheads, we improve system response time by 56%. Moreover, interestingly, when write-back cache on DFTL-based SSD is enabled, DFTL even outperforms the page-based FTL scheme, improving their response time by 72% in Financial trace.
文摘Purpose-The purpose of this paper is to make an overview of 474 publications and 512 patents of FTL from 1987 to 2020 in order to provide a conclusive and comprehensive analysis for researchers in this field,as well as a preliminary knowledge of FTL for interested researchers.Design/methodology/approach-Firstly,the FTL algorithms are classified and its functions are introduced in detail.Secondly,the structures of the publications are analyzed in terms of the fundamental information and the publication of the most productive countries/regions,institutions and authors.After that,co-citation networks of institutions,authors and papers illustrated by VOS Viewer are given to show the relationship among those and the most influential of them is further analyzed.Then,the characteristics of the patent are analyzed based on the basic information and classification of the patent and the most productive inventors.In order to obtain research hotspots and trends in this field,the time-line review and citation burst detection of keywords carried out by Cite Space are made to be visual.Finally,based on the above analysis,it draws some other important conclusions and the development trend of this field.Findings-The research on FTL algorithm is still the top priority in the future,and how to improve the performance of SSD in the era of big data is one of the research hotspots.Research limitations/implications-This paper makes a comprehensive analysis of FTL with the method of bibliometrics,and it is valuable for researchers can quickly grasp the hotspots in this area.Originality/value-This article draws the structural characteristics of the publications in this field and summarizes the research hotspots and trends in this field in recent years,aiming to inspire new ideas for researchers.