期刊文献+

海量小文件元数据的分布式存储与检索

Distributed storage and retrieval of massive small files metadata
下载PDF
导出
摘要 针对现有分布式文件系统处理海量小文件时存在的主节点元数据处理性能瓶颈问题,提出采用分布式文件来存储元数据,并通过元数据缓冲和Hash映射实现元数据的分布;采用Map Reduce并行程序对元数据检索进行了实现,并指出了并行检索中存在的问题,提出采取局部位图索引对元数据检索进行了优化.最后通过实验进行了验证,实验结果证明,该方法实现了海量元数据的分布式存储与检索,避免了现有分布式文件系统在处理海量小文件时存在的主节点单点性能瓶颈. For the bottleneck performance on master node metadata processing when the current distributed file systems processing the massive small files, this paper proposes using the distributed file to store metadata and implement the distribution of metadata through its buffer and Hash mapping, and using the MapReduee parallel program to search the metadata and have its implementation, points out the existing problems of parallel retrieval and optimizes the metadata retrieval by using local map index, and finally, carried out a test by experiments. Experimental results demonstrate that this proposed method can implement the distributed storage and retrieval of massive metadata, and avoid the single point bottleneck performance on master node when using the existing distributed file system to process massive small files.
机构地区 空军预警学院
出处 《空军预警学院学报》 2014年第6期427-431,共5页 Journal of Air Force Early Warning Academy
关键词 海量小文件 元数据 分布存储 并行检索 massive small files metadata distributed storage parallel retrieval
  • 相关文献

参考文献11

  • 1FELIX E. Environmental molecular sciences laboratory:static survey of file system statistics[EB/OL].(2011-02-23)[2014-09-01].http://www.pdsiscidac.org/fsstats/index.html.
  • 2GHEMAWAT S, GOBIOFF H, LEUNG S T. The Googlefile system[C]//ACM SIGOPS Operating Systems Review.ACM,2003,37(5):29-43.
  • 3SHVACHKO K, KUANG H, RADIA S,et al.The Hadoopdistributed file system [EB /OL] .(2010-03-01 )[2014-09-01 ] .http://www.aosabook.org./en/hdfs.html.
  • 4Amazon. Amazon simple storage service [EB/OL]. (2011-03-01) [2014-09-01]. http://www.amazon.com/s3.
  • 5LIU Xu-hui, HAN Ji-zhong, ZHONG Yun-qin, et al. Im-plementing WebGIS on Hadoop: a case study of improv-ing small file I/O performance on HDFS[C]//IEEE Inter-national Conference on Cluster Computing. IEEE, 2009:1-8.
  • 6DONG Bo, QIU Jie, ZHENG Qing-hua,et al. A novel ap-proach to improving the efficiency of storing and access-ing small files on Hadoop: a case study by PowerPointfiles [C] // IEEE International Conference on ServicesComputing. IEEE Computer Society, 2010: 65-72.
  • 7王涛,姚世红,徐正全,熊炼.云存储中面向访问任务的小文件合并与预取策略[J].武汉大学学报(信息科学版),2013,38(12):1504-1508. 被引量:14
  • 8张启飞,张尉东,李文娟,潘雪增,沈雁.基于对等网络的面向小文件的云存储系统[J].浙江大学学报(工学版),2013,47(1):8-14. 被引量:9
  • 9刘立坤,武永卫,徐鹏志,杨广文.CorsairFS:一种面向校园网的分布式文件系统[J].西安交通大学学报,2009,43(8):43-47. 被引量:8
  • 10陈卓,熊劲,马灿.基于SSD的机群文件系统元数据存储系统[J].计算机研究与发展,2012,49(S1):269-275. 被引量:8

二级参考文献61

  • 1LIU Likun, WU Yongwei, YANG Guangwen, et al. ZettaDS: a ligh-weight distributed storage system for cluster[C]//Proceedings of the 3rd China Grid Annual Conference. Piscataway, NJ, USA:IEEE, 2008:158- 164.
  • 2Corsair Working Group. Corsair project [ EB/OL]. [2009-02-22]. http://corsair. thuhpc. org/.
  • 3BRESNAHAN J, LINK M, KETTIMUTHU R, et al. Gridftp pipelining [EB/OL]. [2009-02-22]. http: // www. globus. org/alliance/publications/papers/ Pipelining. pdf.
  • 4GHEMAWAT S, GOBIOFF H, LEUNG S T. The google file system[J]. SIGOPS Oper Syst Rev, 2003, 37(5):29-43.
  • 5BIALECKI A. Hadoop project[EB/OL]. [2009-04- 30]. http://hadoop.apache. org/.
  • 6HOWARD J H, KAZAR M L, MENEES S G, et al. Scale and performance in a distributed file system[J]. ACM Trans Comput Syst, 1988, 6(1) :51-81.
  • 7SCHMUCK F, HASKIN R. GPFS: a shared-disk file system for large computing clusters[EB/OL]. [2009- 02- 22]. http: // db. usenix. org/events/fast02/ schmuck.html.
  • 8ANDERSON T E, DAHLIN M D, Neefe J M, et al. Serverless network file systems [J]. SIGOPS Oper Syst Rev, 1995,29(5):109-126.
  • 9OLSON M A, BOSTIC K, SELTZER M. Berkeley DB [EB/OL]. [2009-02-22]. http://www. usenix. org/publications/library/proceedings/usenix99/technical freenix.html.
  • 10Armbrust Michael, Fox Armando, Griffith Rean et al. A view of cloud computing. Communications of the ACM, 2010, 53(4): 50-58.

共引文献61

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部