The accuracy of typhoon forecasts plays an important role in the prediction of storm surges.The uncertainty of a typhoon’s intensity and track means it is necessary to use an ensemble model to predict typhoon storm s...The accuracy of typhoon forecasts plays an important role in the prediction of storm surges.The uncertainty of a typhoon’s intensity and track means it is necessary to use an ensemble model to predict typhoon storm surges.A hydrodynamic model,which is operational at the National Marine Environmental Forecasting Center,is applied to conduct surge simulations for South China coastal areas using the best track data with parametric wind and pressure models.The results agree well with tidal gauge observations.To improve the calculation efficiency,the hydrodynamic model is modified using CUDA Fortran.The calculation results are almost the same as those from the original model,but the calculation time is reduced by more than 99%.A total of 150 typhoon cases are generated by combining 50 typhoon tracks from the European Centre for Medium-Range Weather Forecasts with three possible typhoon intensity forecasts.The surge ensembles are computed by the improved hydrodynamic model.Based on the simulated storm surges for the different typhoon cases,ensemble and probability forecast products can be provided.The mean ensemble results and probability forecast products are shown to agree well with the observed storm surge caused by Typhoon Mangkhut.The improved model is highly suitable for ensemble numerical forecasts,providing better forecast products for decision-making,and can be easily implemented to run on regular workstations.展开更多
针对三维直流电法正演模拟中大型稀疏线性方程组求解,在GPU(Graphic Process Unit)集群上实现了并行预处理共轭梯度(conjugate gradient,CG)算法。矩阵预处理使用可并行的对称逐次超松弛近似逆预处理(symmetric successive over relaxat...针对三维直流电法正演模拟中大型稀疏线性方程组求解,在GPU(Graphic Process Unit)集群上实现了并行预处理共轭梯度(conjugate gradient,CG)算法。矩阵预处理使用可并行的对称逐次超松弛近似逆预处理(symmetric successive over relaxation approximate inverse preconditioner,SSORAI),与传统的对称逐次超松弛预处理(symmetric successive over relaxation preconditioner,SSOR)相比,避免了串行的三角矩阵回代求解过程,增加了并行性。在GPU集群通信上,使用支持跨节点GPU-GPU直接通信的MVAPICH2,省去了GPU间通信时GPU到CPU的数据中转过程。为了提高数据局部性和重复性,使用RCM算法(reverse Cuthill-Mckee algorithm)对预处理后的线性方程组进行带宽缩减,并利用GPU的计算通信重叠,极大地缩减了计算时间。实验结果表明,这里提出的方法在GPU集群上有很好的可扩展性。同时,程序基于CUDA FORTRAN语言实现,可以容易地与现有FORTRAN程序相结合,提高程序的性能。展开更多
Personal desktop platform with teraflops peak performance of thousands of cores is realized at the price of conventional workstations using the programmable graphics processing units(GPUs).A GPU-based parallel Euler/N...Personal desktop platform with teraflops peak performance of thousands of cores is realized at the price of conventional workstations using the programmable graphics processing units(GPUs).A GPU-based parallel Euler/Navier-Stokes solver is developed for 2-D compressible flows by using NVIDIA′s Compute Unified Device Architecture(CUDA)programming model in CUDA Fortran programming language.The techniques of implementation of CUDA kernels,double-layered thread hierarchy and variety memory hierarchy are presented to form the GPU-based algorithm of Euler/Navier-Stokes equations.The resulting parallel solver is validated by a set of typical test flow cases.The numerical results show that dozens of times speedup relative to a serial CPU implementation can be achieved using a single GPU desktop platform,which demonstrates that a GPU desktop can serve as a costeffective parallel computing platform to accelerate computational fluid dynamics(CFD)simulations substantially.展开更多
基金The National Key Research and Development Program of China under contract Nos 2016YFC14015032018YFC140066 and 2018YFC1407001。
文摘The accuracy of typhoon forecasts plays an important role in the prediction of storm surges.The uncertainty of a typhoon’s intensity and track means it is necessary to use an ensemble model to predict typhoon storm surges.A hydrodynamic model,which is operational at the National Marine Environmental Forecasting Center,is applied to conduct surge simulations for South China coastal areas using the best track data with parametric wind and pressure models.The results agree well with tidal gauge observations.To improve the calculation efficiency,the hydrodynamic model is modified using CUDA Fortran.The calculation results are almost the same as those from the original model,but the calculation time is reduced by more than 99%.A total of 150 typhoon cases are generated by combining 50 typhoon tracks from the European Centre for Medium-Range Weather Forecasts with three possible typhoon intensity forecasts.The surge ensembles are computed by the improved hydrodynamic model.Based on the simulated storm surges for the different typhoon cases,ensemble and probability forecast products can be provided.The mean ensemble results and probability forecast products are shown to agree well with the observed storm surge caused by Typhoon Mangkhut.The improved model is highly suitable for ensemble numerical forecasts,providing better forecast products for decision-making,and can be easily implemented to run on regular workstations.
基金supported by the National Natural Science Foundation of China (No.11172134)the Funding of Jiangsu Innovation Program for Graduate Education (No.CXLX13_132)
文摘Personal desktop platform with teraflops peak performance of thousands of cores is realized at the price of conventional workstations using the programmable graphics processing units(GPUs).A GPU-based parallel Euler/Navier-Stokes solver is developed for 2-D compressible flows by using NVIDIA′s Compute Unified Device Architecture(CUDA)programming model in CUDA Fortran programming language.The techniques of implementation of CUDA kernels,double-layered thread hierarchy and variety memory hierarchy are presented to form the GPU-based algorithm of Euler/Navier-Stokes equations.The resulting parallel solver is validated by a set of typical test flow cases.The numerical results show that dozens of times speedup relative to a serial CPU implementation can be achieved using a single GPU desktop platform,which demonstrates that a GPU desktop can serve as a costeffective parallel computing platform to accelerate computational fluid dynamics(CFD)simulations substantially.