期刊文献+

Ceph系统中海量气象小文件存取性能优化方法 被引量:3

Optimization of massive meteorological small files storage and accessing in ceph System
下载PDF
导出
摘要 为解决Ceph在处理海量气象小文件时,由于集群数据双倍写入会导致存储性能下降问题,提出了一种Ceph系统中海量气象小文件存取性能优化方法。该方法通过分析文件历史访问日志得到气象小文件间的关联概率,然后依据关联概率设计出文件合并算法将相关联的小文件合并后再存储到Ceph集群;访问文件时,根据文件块的利用率和相关率来衡量合并后小文件间的相关性,并根据其相关性进行文件预读取,减少用户与集群的交互以提高小文件的访问效率。实验表明,该方法与现有方法相比,能明显提高Ceph系统中海量气象小文件的存储效率和访问效率。 In order to solve the problem of the storage performance degrades due to double writing of cluster data when Ceph is dealing with massive meteorological small files.This paper proposes an optimization method for accessing the mass meteorological small files in Ceph system.By analyzing the history file access log to get the association probability between meteorological small files,and then based on the association probability of document merging algorithm to design a small file associated with the relevant storage and then to Ceph;When reading a large number of meteorological small files through the utilization of the file block and the correlation rate to measure the correlation between the merged small files,and according to their relevance to pre-read the file,reducing user interaction with the cluster to improve the reading performance of large meteorological small files.The results of experiment show that the proposed method can significantly improve the efficiency of storing and accessing mass meteorological small files in Ceph system compared with the existing methods.
作者 陆小霞 王勇 雷晓春 LU Xiaoxia;WANG Yong;LEI Xiaochun(School of Information and Communication,Guilin University of Electronic Technology,Guilin 541004,China;School of Computer and Information Security,Guilin University of Electronic Technology,Guilin 541004,China;Guangxi Cooperative Innovation Center of Cloud Computing and Big Data,Guilin University of Electronic Technology,Guilin 541004,China)
出处 《桂林电子科技大学学报》 2019年第1期61-66,共6页 Journal of Guilin University of Electronic Technology
基金 国家自然科学基金(61662018,61661015) 中国博士后科学基金(2016M602922XB) 广西云计算与大数据协同创新中心项目(YDQ17001)
关键词 Ceph分布式文件系统 小文件 相关性合并 预读取 ceph distributed file system small files correlation merger prepare reading
  • 相关文献

参考文献5

二级参考文献42

  • 1石磊,孟彩霞,韩英杰.基于预测的Web缓存替换策略[J].计算机应用,2007,27(8):1842-1845. 被引量:6
  • 2DONG B, QIU J, ZHENG Q, et al.A novel approach to improving the efficiency of storing and accessing small files on Hadoop: a case study by PowerPoint files [C] // SCC 2010: Proceedings of the 7th IEEE International Conference on Services Computing. Piscataway: IEEE Press, 2010: 65-72.
  • 3LIU X, HAN J, ZHONG Y, et al.Implementing WebGIS on Hadoop: a case study of improving small file I/O performance on HDFS [C] // CLUSTER 2009: Proceedings of the 2009 IEEE International Conference on Cluster Computing and Workshops. Piscataway: IEEE Press, 2009: 1-8.
  • 4JIANG L, LI B, SONG M. The optimization of HDFS based on small files [C] // IC-BNMT 2010: Proceedings of the 3rd IEEE International Conference on Broadband Network and Multimedia Technology. Piscataway: IEEE Press, 2010: 912-915.
  • 5DONG B, ZHENG Q, TIAN F, et al.An optimized approach for storing and accessing small files on cloud storage[J]. Journal of Network and Computer Applications, 2012, 35(6): 1847-1862.
  • 6GOHIL P, PANCHAL B. Efficient ways to improve the performance of HDFS for small files[J]. Computer Engineering and Intelligent Systems, 2014, 5(1): 45-49.
  • 7CHEN J, WANG D, FU L, et al.An improved small file processing method for HDFS[J]. International Journal of Digital Content Technology and its Applications, 2012, 6(20): 296-304.
  • 8HUA X, WU H, LI Z, et al.Enhancing throughput of the Hadoop distributed file system for interaction-intensive tasks[J]. Journal of Parallel and Distributed Computing, 2014, 74(8): 2770-2779.
  • 9CHANDRASEKAR S, DAKSHINAMURTHY R, SESHAKUMAR P, et al.A novel indexing scheme for efficient handling of small files in Hadoop distributed file system [C] // ICCCI 2013: Proceedings of the 2013 International Conference on Computer Communication and Informatics. Piscataway: IEEE Press, 2013: 1-8.
  • 10MACKEY G, SEHRISH S, WANG J. Improving metadata management for small files in HDFS [C] // CLUSTER 2009: Proceedings of the 2009 IEEE International Conference on Cluster Computing and Workshops. Piscataway: IEEE Press, 2009: 1-4.

共引文献43

同被引文献19

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部