摘要
随着超级计算机的发展,大规模数值计算、大数据分析对算力基础设施的能力和数量的要求日益增长,受成本与功耗等因素的制约,单个超算中心不能无限制扩大规模,异地超算互联互通为解决这个问题提供了一种新的思路。文中基于济南至青岛10 Gbps DWDM光纤网络,搭建了济青跨地域500 km高速互联的计算集群,实现了两地计算资源的统一调度,利用海气耦合模式COAWST中的海洋分量模式ROMS和大气分量模式WRF进行了多组同地和异地不同资源规模和配置的对比实验。实验结果表明,耦合模式在异地高速互联环境下协同计算是可行的,WRF模式和ROMS模式在济南同地和济青异地运行同一算例的模拟结果相同,WRF模式在济南集群运行、ROMS模式在青岛集群运行时,运行时间比二者都在济南集群运行时增加5%,单分量模式拆分在济南集群和青岛集群协同计算时通信耗时较长,异地高速互联环境更适合有分量模块的耦合模式。
With the development of supercomputers,large-scale numerical computing and big data analysis require increasing demand for high-performance computing capacities.Limited by cost and power consumption,a single supercomputing center cannot expand indefinitely.Interconnection of supercomputers in different places is a good solution.Based on the 10Gbps DWDM fiber optic network from Jinan to Qingdao,a long distance geographically interconnected computing cluster that consisted of the nodes located at Jinan cluster and some nodes at Qingdao cluster is built,which realizes the unified scheduling of computing resources in the two clusters.The ROMS model and WRF model in air-sea coupled model COAWS Tare used to conduct multiple sets of comparative experiments with nodes of varying sizes and locations.Experiments results show that it is feasible to perform coupled numerical simulation on long distance geographically interconnected computing cluster without substantial drop inperformance.Si-mulation results of WRF and ROMS running the same example in Jinan cluster and Jinan-Qingdao cluster are the same.When WRF runs in Jinan cluster and ROMS runs in Qingdao cluster,the running time is 5%more than that when WRF and ROMS both run in Jinan cluster.When WRF and ROMS are split in Jinan-Qingdao cluster,the communication takes up a lot of time.The high-speed interconnection environment is more suitable for coupled model that have low communication requirements.
作者
韩琦琦
刘鑫
HAN Qiqi;LIU Xin(Key Laboratory of Computing Power Network and Information Security,Ministry of Education,Shandong Computer Science Center(National Supercomputer Center in Jinan),Qilu University of Technology(Shandong Academy of Sciences),Jinan 250014,China;Shandong Provincial Key Laboratory of Computer Networks,Shandong Fundamental Research Center for Computer Science,Jinan 250014,China)
出处
《计算机科学》
CSCD
北大核心
2023年第S02期833-837,共5页
Computer Science
基金
山东省重点研发计划(2022CXGC020106)
自然资源部海洋环境科学与数值模拟重点实验室开放基金(2021-YB-02)
齐鲁工业大学科教产融合试点工程重大创新专项(2022JBZ01-01)。
关键词
高速网络
超算互联
海气耦合
COAWST模式
分布式计算
High-speed networks
Supercomputing interconnection
Air-Sea coupled
COAWST mode
Distributed computing