期刊文献+

FlatLFS:一种面向海量小文件处理优化的轻量级文件系统 被引量:9

FlatLFS:a lightweight file system for optimizing the performance of accessing massive small files
下载PDF
导出
摘要 海量小文件的高效存储和访问是当前分布式文件系统面临的主要挑战之一。以GFS和HDFS为代表的分布式文件系统大多面向海量大文件的高效存储和访问设计,缺乏小文件处理的针对性优化,导致访问海量小文件时效率低下。针对分布式文件系统中海量小文件访问时的数据服务器优化问题,提出了一种采用扁平式数据存储方法的轻量级文件系统FlatLFS,取代传统文件系统对上层分布式文件系统提供数据存储和访问支持,提高了数据服务器处理小数据块时的I/O性能,从而提升了整个分布式文件系统的性能。实验表明,当数据块大小设定为1M时,FlatLFS的随机读性能分别比ext3、ext4、reiserfs高135%、112%和122%。 The storage and access of massive small files are one of the challenges in the design of DFS( Distributed file system). Most of the DFSs, such as GFS and HDFS, are designed for handling massive big files. The performance of DFSs decreases greatly when accessing massive small files without special optimization for small files. This research focuses on the optimizing of the performance of data server in handling massive small files, and presents a Flat Lightweight File System called FlatLFS in which the user data are managed flat in disks. FlatLFS is supposed to substitute the traditional file system when accessing user data for upper DFSs. With the improvement of the performance of small data block processing on data servers by FlatLFS, the performance of the whole DFSs is greatly improved. The effectiveness of FlatLFS is proved with intensive experiments: when the size of data block is 1M, the performance of random read of FlatLFS is 135%, 112% and 122% higher than ext 3 ,ext4 and reiserfs respectively.
出处 《国防科技大学学报》 EI CAS CSCD 北大核心 2013年第2期120-126,共7页 Journal of National University of Defense Technology
基金 国家核高基重大专项(2012zx01040001)
关键词 分布式文件系统 海量小文件 ext文件系统 云计算 distributed file system massive small files extended file system cloud computing
  • 相关文献

参考文献21

  • 1Ghemawat S, Gobioff H, Leung S. The Google file system [ C ]//19th Symposium on Operating Systems Principles, NY : IEEE, 2003 : 29 - 43.
  • 2The apache hadoop project, hadoop distributed file system [EB/OL]. (2012- 12-05) [2012 - 12 -20]. http:/! hadoop, apache, org/.
  • 3Srirams. Kosmos file system [ EB/OL ]. (2011 - 05 ) [ 2011 - 11 - 15 ]. http ://code. google, com/p/kosmosfs.
  • 4Moose file system[ EB/OL]. (2012 - 08 - 16 ) [ 2012 - 09 -15]. http://www, moosefs, org.
  • 5Beaver D, et al. Finding a needle in Haystack: Facebook "s photo storage [ C ] // 9th USENIX Symposium on Operating Systems Design and Implementation, October 4 - 6 Canada 2010.
  • 6Taobao file system[ CP/OL]. (2012 - 12 -04) [ 2012 - 12 - 21 ]. http ://code. taobao, org/p/tfs/src/.
  • 7Cloudera big data solution [ EB/OL ]. [ 2012 - 12 - 22 ]. http ://www. cloudera, com.
  • 8MapR big data solution[ EB/OL]. [ 2012 - 12 - 22 ]. http :// www. mapr. com.
  • 9McKusick M K, Quinlan S. GFS: Evolution on fast-forward [R/OL]. (2009 - 08 - 07) [2011 - 10 - 09]. http:/! queue, acm. org/detail, cfm? id = 1594206.
  • 10White T. The small files problem [ R/OL]. (2009 -02 -02) [ 2011 - 08 - 23 ]. http ://www. cloudera, com/blog/2009/ 02/the-small-files-problem/.

同被引文献87

引证文献9

二级引证文献165

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部