An efficient MPI/OpenMP hybrid parallel Radial Basis Function (RBF) strategy for both continuous and discontinuous large-scale mesh deformation is proposed to reduce the computational cost and memory consumption.Unlik...An efficient MPI/OpenMP hybrid parallel Radial Basis Function (RBF) strategy for both continuous and discontinuous large-scale mesh deformation is proposed to reduce the computational cost and memory consumption.Unlike the conventional parallel methods in which all processors use the same surface displacement and implement the same operation,the present method employs different surface points sets and influence radius for each volume point movement,accompanied with efficient geometry searching strategy.The deformed surface points,also called Control Points (CPs),are stored in each processor.The displacement of spatial points is interpolated by using only 20-50 nearest control points,and the local influence radius is set to 5-20 times the maximum displacement of control points.To shorten the searching time for the nearest control point clouds,an Alternating Digital Tree (ADT) algorithm for 3D complex geometry is designed based on an iterative bisection technique.Besides,an MPI/OpenMP hybrid parallel approach is developed to reduce the memory cost in each High-Performance Computing (HPC) node for large-scale applications.Three 3D cases,including the ONERA-M6 wing and a commercial transport airplane standard model with up to 2.5 billion hybrid elements,are used to test the present mesh deformation method.The robustness and high parallel efficiency are demonstrated by a wing deflection case with a maximum bending angle of 450 and more than 80% parallel efficiency with 1024 MPI processors.In addition,the availability for both continuous and discontinuous surface deformation is verified by interpolating the projecting displacement with opposite directions surface points to the spatial points.展开更多
A wide variety of algorithms have been developed to monitor aerosol burden from satellite images. Still, few solutions currently allow for real-time and efficient retrieval of aerosol optical thickness (AOT), mainly...A wide variety of algorithms have been developed to monitor aerosol burden from satellite images. Still, few solutions currently allow for real-time and efficient retrieval of aerosol optical thickness (AOT), mainly due to the extremely large volume of computation necessary for the numeric solution of atmospheric radiative transfer equations. Taking into account the efforts to exploit the SYNergy of Terra and Aqua Modis (SYNTAM, an AOT retrieval algorithm), we present in this paper a novel method to retrieve AOT from Moderate Resolution Imaging Spectroradiometer (MODIS) satellite images, in which the strategy of block partition and collective communication was taken, thereby maximizing load balance and reducing the overhead time during inter-processor communication. Experiments were carried out to retrieve AOT at 0.44, 0.55, and 0.67μm of MODIS/Terra and MODIS/Aqua data, using the parallel SYNTAM algorithm in the IBM System Cluster 1600 deployed at China Meteorological Administration (CMA). Results showed that parallel implementation can greatly reduce computation time, and thus ensure high parallel efficiency. AOT derived by parallel algorithm was validated against measurements from ground-based sun-photometers; in all cases, the relative error range was within 20%, which demonstrated that the parallel algorithm was suitable for applications such as air quality monitoring and climate modeling.展开更多
Finite difference method is an important methodology in the approximation of waves.In this paper,we will study two implicit finite difference schemes for the simulation of waves.They are the weighted alternating direc...Finite difference method is an important methodology in the approximation of waves.In this paper,we will study two implicit finite difference schemes for the simulation of waves.They are the weighted alternating direction implicit(ADI)scheme and the locally one-dimensional(LOD)scheme.The approximation errors,stability conditions,and dispersion relations for both schemes are investigated.Our analysis shows that the LOD implicit scheme has less dispersion error than that of the ADI scheme.Moreover,the unconditional stability for both schemes with arbitrary spatial accuracy is established for the first time.In order to improve computational efficiency,numerical algorithms based on message passing interface(MPI)are implemented.Numerical examples of wave propagation in a three-layer model and a standard complex model are presented.Our analysis and comparisons show that both ADI and LOD schemes are able to efficiently and accurately simulate wave propagation in complex media.展开更多
基金supported by the National Key Research and Development Program of China (No.2016YFB0200701)the National Natural Science Foundation of China (Nos. 11532016 and 91530325)
文摘An efficient MPI/OpenMP hybrid parallel Radial Basis Function (RBF) strategy for both continuous and discontinuous large-scale mesh deformation is proposed to reduce the computational cost and memory consumption.Unlike the conventional parallel methods in which all processors use the same surface displacement and implement the same operation,the present method employs different surface points sets and influence radius for each volume point movement,accompanied with efficient geometry searching strategy.The deformed surface points,also called Control Points (CPs),are stored in each processor.The displacement of spatial points is interpolated by using only 20-50 nearest control points,and the local influence radius is set to 5-20 times the maximum displacement of control points.To shorten the searching time for the nearest control point clouds,an Alternating Digital Tree (ADT) algorithm for 3D complex geometry is designed based on an iterative bisection technique.Besides,an MPI/OpenMP hybrid parallel approach is developed to reduce the memory cost in each High-Performance Computing (HPC) node for large-scale applications.Three 3D cases,including the ONERA-M6 wing and a commercial transport airplane standard model with up to 2.5 billion hybrid elements,are used to test the present mesh deformation method.The robustness and high parallel efficiency are demonstrated by a wing deflection case with a maximum bending angle of 450 and more than 80% parallel efficiency with 1024 MPI processors.In addition,the availability for both continuous and discontinuous surface deformation is verified by interpolating the projecting displacement with opposite directions surface points to the spatial points.
基金supported partly by the Ministry of Science and Technology of the People’s Republic of China (Grant Nos.2007CB714407, and 2008ZX10004012)the Special Funds for Basic Research in CAMS of CMA (Grant No. 2007Y001)State Key Laboratory of Remote Sensing Sciences (Grant No.07S00502CX)
文摘A wide variety of algorithms have been developed to monitor aerosol burden from satellite images. Still, few solutions currently allow for real-time and efficient retrieval of aerosol optical thickness (AOT), mainly due to the extremely large volume of computation necessary for the numeric solution of atmospheric radiative transfer equations. Taking into account the efforts to exploit the SYNergy of Terra and Aqua Modis (SYNTAM, an AOT retrieval algorithm), we present in this paper a novel method to retrieve AOT from Moderate Resolution Imaging Spectroradiometer (MODIS) satellite images, in which the strategy of block partition and collective communication was taken, thereby maximizing load balance and reducing the overhead time during inter-processor communication. Experiments were carried out to retrieve AOT at 0.44, 0.55, and 0.67μm of MODIS/Terra and MODIS/Aqua data, using the parallel SYNTAM algorithm in the IBM System Cluster 1600 deployed at China Meteorological Administration (CMA). Results showed that parallel implementation can greatly reduce computation time, and thus ensure high parallel efficiency. AOT derived by parallel algorithm was validated against measurements from ground-based sun-photometers; in all cases, the relative error range was within 20%, which demonstrated that the parallel algorithm was suitable for applications such as air quality monitoring and climate modeling.
基金supported by the State Key Project(2010CB731505)National Key Foundation Project(10431030)Director Foundation Project of Laboratory of Scientific and Engineering Computing(LSEC).
文摘Finite difference method is an important methodology in the approximation of waves.In this paper,we will study two implicit finite difference schemes for the simulation of waves.They are the weighted alternating direction implicit(ADI)scheme and the locally one-dimensional(LOD)scheme.The approximation errors,stability conditions,and dispersion relations for both schemes are investigated.Our analysis shows that the LOD implicit scheme has less dispersion error than that of the ADI scheme.Moreover,the unconditional stability for both schemes with arbitrary spatial accuracy is established for the first time.In order to improve computational efficiency,numerical algorithms based on message passing interface(MPI)are implemented.Numerical examples of wave propagation in a three-layer model and a standard complex model are presented.Our analysis and comparisons show that both ADI and LOD schemes are able to efficiently and accurately simulate wave propagation in complex media.