期刊文献+

面向分层混合存储架构的协同式突发缓冲技术 被引量:3

Research on optimization towards hybrid and hierarchy storage architecture
下载PDF
导出
摘要 科学计算产生和分析的数据规模日益增长,高性能计算机的存储系统在体系架构和软件管理方法上面临重大挑战。针对天河-2系统的新型分层混合存储架构,提出一种由应用程序耦合的协同式突发缓冲技术来有效利用其存储资源优势。该方法采用运行时动态耦合的方法,将临近计算任务的分布式高速存储资源聚合成为一个的巨大的协同式突发缓冲区,通过采用文件命名空间投影的映射方法组织全局数据视图,利用位置亲和和感知数据意图的方法来挖掘空间局部性和时间局部性,并利用应用并发度感知的策略优化数据移动效率。天河-2系统的测试结果表明,该方法能够有效优化多种典型应用场景,可获得高可扩展的突发并行输出带宽和稳定的持续并行输出带宽,可显著提升数据分析场景的输入/输出性能,适合应用于大规模超级计算机的存储系统。 Today's advancing modern science is generating and analyzing increasing scale of datasets and makes HPC storage system facing new challenges both on architecture and software approach. In order to exploit potential benefits of emerging hybrid and hierarchy storage architecture on Milky- 2 system,a I / O middleware approach named application coupled burst buffer,is introduced to make full use of the solid state disk based in-system storage resources. Application coupled burst buffer aggregates distributed in-system storage close to running tasks into single namespace during application runtime and manages it as cooperative persistent burst buffer tightly coupled with its host application. To take full advantage of cooperative burst buffer,application coupled burst buffer uses an unified shadow namespace to map application data into physical in-system storage based on its real namespace of the host application. Besides that,application coupled burst buffer organizes data with locality aware layout and leverages application intent based replacement policy to fully exploit spatial and temporal locality. Furthermore,application coupled burst buffer employs concurrency aware policies to optimize data movement between different storage tiers. Evaluations on Milky- 2 system show that application coupled burst buffer can improve the performance of typical data-intensive applications dramatically. It can achieve scalable burst I / O bandwidth and smooth sustained I / O bandwidth with high throughput solid state disk deployed and can be taken as an appropriate candidate for storage solution on emerging leadership supercomputer systems.
出处 《国防科技大学学报》 EI CAS CSCD 北大核心 2015年第1期47-52,共6页 Journal of National University of Defense Technology
基金 国家自然科学基金资助项目(61120106005) 国家"863"高技术研究发展计划基金资助项目(2012AA01A301)
关键词 超级计算机 存储架构 并行文件系统 突发缓冲区 supercomputer storage architecture parallel file system burst buffer
  • 相关文献

参考文献16

  • 1Braam P. Lustre file system [ EB/OL]. 2005 [ 2014 - O1 - 18 ]. http ://www. lustre, org.
  • 2Bent J, Gibson G, Grider G, et al. PLFS: a checkpoint filesystem for parallel applications [ C ]//Proceedings of the 2009 International Conference for High Performance Computing, Networking, Storage and Analysis, 2009 : 1 - 12.
  • 3Seelam S, Kerstens A, Teller P J. Throttling I/0 streams to accelerate file-IO performance [ C //Proceedings of Third International Conference, High Performance Computing and Communications, 2007 : 718 - 731.
  • 4Chen Y. Towards scalable I/O architecture for exascale systems[C]//Proceedings of the 2011 ACM International Workshop on Many Task Computing on Grids and Supercomputers, 2011 : 43 -48.
  • 5Xie B, Chase J, Dillow D, et al. Characterizing output bottlenecks in a supercomputer [ C //Proceedings of the 2012 International Conference for High Performance Computing, Networking, Storage and Analysis, 2012:1 - 11.
  • 6Lofstead J, Liu Q, et al. Managing variability in the i0 performance of petascale storage systems [ C ]//Proceedings of the 2010 International Conference for High Performance Computing, Networking, Storage and Analysis, 2010: 1- 12.
  • 7DiUow D A, Shipman G M, Oral S, et al. I/O congestion avoidance via routing and object placement[ C ]//Proceedings of Cray User Group Conference ( CUG), 2011.
  • 8Ghemawat S, Gobioff H, Lueng S, et al. The google file system [ C ]//Proceedings of the 9th ACM Symposium on Operating System Principles, 2003:29 - 43.
  • 9White T. Hadoop: the definitive guide [ M ]. Cambridge, 2009.
  • 10Iskra- K, Romein J W, Yoshii K, et al. ZOID: I/O- forwarding infrastructure for petascale architectures [ C ]// Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2008:153 - 162.

同被引文献21

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部