摘要
区域气候模式CWRF(Climate-Weather Research and Forecasting model)是国家气候中心区域气候预测系统的重要组成部分,也是系统最耗时的程序。高性能计算是提高CWRF数值预报计算性能的关键技术,开展CWRF模式在国产神威众核架构上的移植和优化,提高模式的模拟效率,对模式的扩展、开发能力和可持续发展具有重要意义。基于国产众核SW26010处理器,完成了CWRF区域气候模式的移植、性能分析和深入性能优化,采用访存优化、Cache命中率优化及众核加速优化等方法,对CWRF模式动力过程、物理过程和I/O过程计算代码进行重构及众核加速。结果表明:优化技术可使CWRF动力过程平均加速2倍,最高加速6.4倍,物理过程平均加速1.7倍,最高加速5.4倍,I/O过程加速1.2倍,程序整体最高加速1.4倍,计算误差在合理范围内。
CWRF(Climate-Weather Research and Forecasting model)is a component of the regional climate prediction system built in the National Climate Center,and consumes the largest proportion of time.High performance computing is a key technology used to improve the compactional performance of CWRF.Carrying out the configuration and optimization of the CWRF model based on the domestic Sunway many-core system,improving the simulation efficiency are of great significance for the speedup,as well as the development capability and sustainable development of the model.This paper completed the configuration and performance evaluation of CWRF based on the SW26010 many-core architecture.Memory access optimization,Cache hit rate optimization,many-core acceleration models are introduced to speedup CWRF relating to the dynamic-core process,physical process and I/O process.The results show that the average speed of the dynamic process is 2 times and the highest speed is 6.4 times,the average speed of the physical process is 1.7 times and the highest speed is 5.4 times,the I/O process speeds up 1.2 times,the overall program speeds up to 1.4 times,and the calculation error is reasonable.
作者
吕小敬
刘钊
蔡蕙伊
李锦薇
Lv Xiaojing;Liu Zhao;Cai Huiyi;Li Jinwei(China Ship Scientific Research Center,Wuxi 214000,China;National Supercomputing Center in Wuxi,Wuxi 214000,China;Tsinghua University,Beijing 100080,China)
出处
《电子技术应用》
2022年第1期31-38,共8页
Application of Electronic Technique
基金
国家重点研发计划资助(2020YFB0204800)
国家自然科学基金项目(51877115)
无锡市太湖人才计划创新领军人才项目(融合人工智能技术的新一代精细化区域气候观测系统研制)。