期刊文献+

基于HDFS的海量文件存储优化设计

下载PDF
导出
摘要 针对Hadoop分布式文件系统(Hadoop Distributed File System,HDFS)在存储海量数据时对小文件支持性能低下的问题,提出了基于HDFS的通用文件存储系统(Common File Storage-Hadoop Distributed File System,CFSHDFS)。该方案采用了分类处理的设计思想,区别对待大文件和小文件。对大文件不做任何处理,按照HDFS原有的流程完成既定的操作;对小文件的存取,经过缓存、预读、合并等机制的处理,从而在提高Namenode内存利用率和Datanode空间利用率的基础上,提高了小文件的读写性能。区别对待文件的处理以及对小文件的一系列的处理,对用户来说都是透明的,因此,该方案并不会增加用户使用HDFS的复杂性。
作者 李娜
出处 《电脑编程技巧与维护》 2016年第7期63-65,共3页 Computer Programming Skills & Maintenance
  • 相关文献

参考文献5

二级参考文献37

  • 1Beaver D, Kumar S, Li H C, et al. Finding a needle in haystack: facebook's photo storage[ C ]. In Proceedings of the 9th USENIXSymposium on Operating System Design and Implementation ( OS- DI'10), Vancouver, Canada, October 2010.
  • 2Leslie Lamport. The part-time parliament [ J ]. ACM Transactions on Computer Systems, 1998,16 (2) : 133-169.
  • 3Fay Chang, Jeffrey Dean, Sanjay Ghemawat, et al. Bigtable: a distributed storage system for structured data [ C ]. In Proceedings of the 7th USENIX Symposium on Operating System Design and Implementation ( OSDI'06 ) ,2006.
  • 4Patil S, Gibson G A, Ganger G R, et al. In search of an API for scalable file systems: under the table or above it? [ C ]. In USE- NIX Workshop on Hot Topics in Cloud Computing (HotCloud 2009),2009.
  • 5Sage A Wei, Scott A Brandt, Ethan L Miller, et al. Ceph: a scala- ble, high-performance distributed file system [ C ]. In Proceedings of 7^th Symposium on Operating Systems Design and Implementation ( OSDI 2006 ) ,2006.
  • 6Swapnil Patil, Garth Gibson. Scale and concurrency of GIGA + : file system directories with millions of files[ C]. In Proceedings of the 9^th USENIX Conference on File and Storage Technologies ( FAST 2011 ) ,2011.
  • 7Philip Ross Cams, Sam Lang, Robert Ross Kunkel, Thomas Lud- wig. Small file access in parallel file systems [ C ]. In Proceedings of International Symposium on Parallel and Distributed Processing Systems ( IPDPS 2009) ,2009.
  • 8Michael Kuhn, Julian Kunkel, Thomas Ludwig. Directory-based metadata optimizations for small files in PVFS [ C ]. In Proceedings of the 14^th International Euro-Par Conference on Parallel Processing ( EuroPar 2008 ) ,2008.
  • 9Xing Jing, Xiong Jin, Sun Ning-hui, et al. Adaptive and scalable metadata management to support a trillion files[ C]. In Proceedings of the Conference on High Performance Computing Networking, Storage, and Analysis ( SC 2009 ) ,2009.
  • 10Marshall Kirk McKusick, Sean Quinlan. GFS: evolution on fast- forward [ J ]. ACM Queue, 2009,7 ( 7 ) : 10 -20.

共引文献36

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部