期刊文献+

面向国家高性能计算环境的虚拟数据空间系统 被引量:6

Virtual data space system for national highperformance computing environment
下载PDF
导出
摘要 高性能计算环境是支撑国家科技创新、经济发展、国防建设的核心信息基础设施,世界高性能计算强国纷纷建设基于多超算中心资源的广域高性能计算环境。然而,高性能计算环境中资源种类繁多且地域分布广,无法有效发挥资源的聚合效应,难以满足大型应用对广域分布数据的统一管理和高效访问需求。为此,提出了一套可用于构建广域全局虚拟数据空间的完整技术体系,包括虚拟数据空间模型、跨域虚拟数据空间构建、广域环境中数据高效迁移、广域环境中存算协同调度、跨域高并发数据聚合处理等技术,并研发了一个可运行于国家高性能计算环境的虚拟数据空间系统,可有效支撑广域分散异构存储资源的统一高效访问,实现广域环境中分布数据的跨域共享和协同处理。目前,该软件系统已在国家高性能计算环境实验性部署,并验证了分子对接、全基因组关联分析、天气预报模式3类典型大型应用。验证结果表明,所研虚拟数据空间构建方法和系统可有效聚合广域分散的存储资源,满足大型应用的数据空间需求。 High-performance computing(HPC)environment is the core information infrastructure supporting national scientific and technological innovation,economic development and national defense construction.High-performance computing powers around the world have been building wide-area HPC environments based on multi-supercomputing center resources.However,in the high-performance computing environment,there are many kinds of resources and wide geographical distribution,which cannot effectively exert the aggregation effect of resources,and it is difficult to meet the requirements of large-scale applications for unified management and efficient access to wide-area distributed data.To this end,a complete set of technologies were proposed,which could be used to build wide-area global virtual data space,including virtual data space model,cross-domain virtual data space constructing,efficiently migrating data in a wide-area environment,co-scheduling of storage resources and computing job and cross-domain high concurrency data aggregation processing,etc.Based on the above,a virtual data space system has been developed for the national high-performance computing environment(NHPCE),which can effectively support the unified and efficient access to the wide area distributed heterogeneous storage resources,and the distributed data in the wide-area environment can be shared and cooperative processed in a cross-domain manner.At present,the system was experimental deployed in NHPCE and three typical large-scale applications,such as molecular docking,genome-wide association study and weather forecasting model,have been verified.The verification results show that the developed technology and software system can effectively aggregate the wide area distributed storage resources and meet the data space requirements of large-scale applications.
作者 秦广军 肖利民 张广艳 牛北方 陈志广 QIN Guangjun;XIAO Limin;ZHANG Guangyan;NIU Beifang;CHEN Zhiguang(Smart City College,Beijing Union University,Beijing 100101,China;School of Computer Science and Engineering,Beihang University,Beijing 100191,China;State Key Laboratory of Software Development Environment,Beijing 100191,China;Department of Computer Science and Technology,Tsinghua University,Beijing 100084,China;Computer Network Information Center,Chinese Academy of Sciences,Beijing 100190,China;University of Chinese Academy of Sciences,Beijing 100190,China;School of Computer Science and Engineering,Sun Yat-sen University,Guangzhou 510006,China)
出处 《大数据》 2021年第2期101-122,共22页 Big Data Research
基金 国家重点研发计划资助项目(No.2018YFB0203901)。
关键词 高性能计算环境 大型计算问题 虚拟数据空间 广域分布式存储 统一命名空间 high-performance computing environment large-scale computing problem virtual data space wide-area distributed storage unified namespace
  • 相关文献

参考文献3

二级参考文献13

  • 1Atul Adya,et al.FARSITE:Federated,available,and reliable storage for an incompletely trusted environment[C].The 5th Symp on Operating Systems Design and Implementation(OSDI'02),Boston,2002
  • 2F Bek,M F Kaashoek,D Karger,et al.Wide-area cooperative storage with CFS[C].The 18th ACM Symp on Operating Systems Principles(SOSP'01),Banff,2001
  • 3J Kubiatowicz,et al.OceanStore:An architecture for globalscale persistent storage[C].The 9th Int'l Conf on Architectural Support for Programming Languages and Operating Systems(ASPLOS IX),Canbfidge,2000
  • 4A Rowstron,P Druschel.Storage management and caching in PAST,a large-scale persistent peer-to-peer storage utility[C].The 18th ACM Symp on Operating Systems Principles(SOSP'01),Banff,2001
  • 5Landon P Cox,Christopher D Murray,Brian D Noble.Pastiche:Making backup cheap and easy[C].The 5th Symp on Operating Systems Design and Implementation(OSDI'02),Boston,2002
  • 6Athicha Muthitacharoen,Robert Morris,Thomer M Gil,et al.Ivy:A read/write peer-to-peer file system[C].The 5th Symp on Operating Systems Design and Implementation(OSDI'02),Boston,2002
  • 7Yasushi Saito,Christos Karamanolis,Magnus Karlsson,et al.Taming aggressive replication in the Pangaea wide-area file system[C].The 5th Symp on Operating Systems Design and Implementation(OSDI'02),Boston,2002
  • 8Jinfeng Hu,Ming Li,Haitao Dong,et al.PeerWindow:Looking outside for peers[OL].http://166.111.205.216/,2004
  • 9Jinfeng Hu,Chunhui Hong,Huanan Zhang,et al.Tourist:Utilizing heterogeneity to build a scalable,efficient,and adaptive DHT routing protocol[OL].http://166.111.205.216/,2004
  • 10Jinfeng Hu,Ming Li,Weimin Zheng,et al.SmartBoa:Constructing P2P overlay network in the heterogeneous Internet using irregular routing tables[C].The 3rd Int'l Workshop on Peer-to-Peer Systems(IPTPS'04),San Diego,2004

共引文献17

同被引文献56

引证文献6

二级引证文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部