Traditional two-dimensional(2D) complex resistivity forward modeling is based on Poisson's equation but spectral induced polarization(SIP) data are the coproducts of the induced polarization(IP) and the electro...Traditional two-dimensional(2D) complex resistivity forward modeling is based on Poisson's equation but spectral induced polarization(SIP) data are the coproducts of the induced polarization(IP) and the electromagnetic induction(EMI) effects.This is especially true under high frequencies,where the EMI effect can exceed the IP effect.2D inversion that only considers the IP effect reduces the reliability of the inversion data.In this paper,we derive differential equations using Maxwell's equations.With the introduction of the Cole-Cole model,we use the finite-element method to conduct2 D SIP forward modeling that considers the EMI and IP effects simultaneously.The data-space Occam method,in which different constraints to the model smoothness and parametric boundaries are introduced,is then used to simultaneously obtain the four parameters of the Cole-Cole model using multi-array electric field data.This approach not only improves the stability of the inversion but also significantly reduces the solution ambiguity.To improve the computational efficiency,message passing interface programming was used to accelerate the 2D SIP forward modeling and inversion.Synthetic datasets were tested using both serial and parallel algorithms,and the tests suggest that the proposed parallel algorithm is robust and efficient.展开更多
An efficient MPI/OpenMP hybrid parallel Radial Basis Function (RBF) strategy for both continuous and discontinuous large-scale mesh deformation is proposed to reduce the computational cost and memory consumption.Unlik...An efficient MPI/OpenMP hybrid parallel Radial Basis Function (RBF) strategy for both continuous and discontinuous large-scale mesh deformation is proposed to reduce the computational cost and memory consumption.Unlike the conventional parallel methods in which all processors use the same surface displacement and implement the same operation,the present method employs different surface points sets and influence radius for each volume point movement,accompanied with efficient geometry searching strategy.The deformed surface points,also called Control Points (CPs),are stored in each processor.The displacement of spatial points is interpolated by using only 20-50 nearest control points,and the local influence radius is set to 5-20 times the maximum displacement of control points.To shorten the searching time for the nearest control point clouds,an Alternating Digital Tree (ADT) algorithm for 3D complex geometry is designed based on an iterative bisection technique.Besides,an MPI/OpenMP hybrid parallel approach is developed to reduce the memory cost in each High-Performance Computing (HPC) node for large-scale applications.Three 3D cases,including the ONERA-M6 wing and a commercial transport airplane standard model with up to 2.5 billion hybrid elements,are used to test the present mesh deformation method.The robustness and high parallel efficiency are demonstrated by a wing deflection case with a maximum bending angle of 450 and more than 80% parallel efficiency with 1024 MPI processors.In addition,the availability for both continuous and discontinuous surface deformation is verified by interpolating the projecting displacement with opposite directions surface points to the spatial points.展开更多
To achieve high parallel efficiency for the global MASNUM surface wave model, the algorithm of an irregular quasirectangular domain decomposition and related serializing of calculating points and data exchanging schem...To achieve high parallel efficiency for the global MASNUM surface wave model, the algorithm of an irregular quasirectangular domain decomposition and related serializing of calculating points and data exchanging schemes are developed and conducted, based on the environment of Message Passing Interface(MPI). The new parallel version of the surface wave model is tested for parallel computing on the platform of the Sunway BlueLight supercomputer in the National Supercomputing Center in Jinan. The testing involves four horizontal resolutions, which are 1°×1°,(1/2)°×(1/2)°,(1/4)°×(1/4)°, and(1/8)°×(1/8)°. These tests are performed without data Input/Output(IO) and the maximum amount of processors used in these tests reaches to 131072. The testing results show that the computing speeds of the model with different resolutions are all increased with the increasing of numbers of processors. When the number of processors is four times that of the base processor number, the parallel efficiencies of all resolutions are greater than 80%. When the number of processors is eight times that of the base processor number, the parallel efficiency of tests with resolutions of 1°×1°,(1/2)°×(1/2)° and(1/4)°×(1/4)° is greater than 80%, and it is 62% for the test with a resolution of(1/8)°×(1/8)° using 131072 processors, which is the nearly all processors of Sunway BlueLight. When the processor's number is 24 times that of the base processor number, the parallel efficiencies for tests with resolutions of 1°×1°,(1/2)°×(1/2)°, and(1/4)°×(1/4)° are 72%, 62%, and 38%, respectively. The speedup and parallel efficiency indicate that the irregular quasi-rectangular domain decomposition and serialization schemes lead to high parallel efficiency and good scalability for a global numerical wave model.展开更多
A wide variety of algorithms have been developed to monitor aerosol burden from satellite images. Still, few solutions currently allow for real-time and efficient retrieval of aerosol optical thickness (AOT), mainly...A wide variety of algorithms have been developed to monitor aerosol burden from satellite images. Still, few solutions currently allow for real-time and efficient retrieval of aerosol optical thickness (AOT), mainly due to the extremely large volume of computation necessary for the numeric solution of atmospheric radiative transfer equations. Taking into account the efforts to exploit the SYNergy of Terra and Aqua Modis (SYNTAM, an AOT retrieval algorithm), we present in this paper a novel method to retrieve AOT from Moderate Resolution Imaging Spectroradiometer (MODIS) satellite images, in which the strategy of block partition and collective communication was taken, thereby maximizing load balance and reducing the overhead time during inter-processor communication. Experiments were carried out to retrieve AOT at 0.44, 0.55, and 0.67μm of MODIS/Terra and MODIS/Aqua data, using the parallel SYNTAM algorithm in the IBM System Cluster 1600 deployed at China Meteorological Administration (CMA). Results showed that parallel implementation can greatly reduce computation time, and thus ensure high parallel efficiency. AOT derived by parallel algorithm was validated against measurements from ground-based sun-photometers; in all cases, the relative error range was within 20%, which demonstrated that the parallel algorithm was suitable for applications such as air quality monitoring and climate modeling.展开更多
Finite difference method is an important methodology in the approximation of waves.In this paper,we will study two implicit finite difference schemes for the simulation of waves.They are the weighted alternating direc...Finite difference method is an important methodology in the approximation of waves.In this paper,we will study two implicit finite difference schemes for the simulation of waves.They are the weighted alternating direction implicit(ADI)scheme and the locally one-dimensional(LOD)scheme.The approximation errors,stability conditions,and dispersion relations for both schemes are investigated.Our analysis shows that the LOD implicit scheme has less dispersion error than that of the ADI scheme.Moreover,the unconditional stability for both schemes with arbitrary spatial accuracy is established for the first time.In order to improve computational efficiency,numerical algorithms based on message passing interface(MPI)are implemented.Numerical examples of wave propagation in a three-layer model and a standard complex model are presented.Our analysis and comparisons show that both ADI and LOD schemes are able to efficiently and accurately simulate wave propagation in complex media.展开更多
基金jointly sponsored by the National Natural Science Foundation of China(Grant No.41374078)the Geological Survey Projects of the Ministry of Land and Resources of China(Grant Nos.12120113086100 and 12120113101300)Beijing Higher Education Young Elite Teacher Project
文摘Traditional two-dimensional(2D) complex resistivity forward modeling is based on Poisson's equation but spectral induced polarization(SIP) data are the coproducts of the induced polarization(IP) and the electromagnetic induction(EMI) effects.This is especially true under high frequencies,where the EMI effect can exceed the IP effect.2D inversion that only considers the IP effect reduces the reliability of the inversion data.In this paper,we derive differential equations using Maxwell's equations.With the introduction of the Cole-Cole model,we use the finite-element method to conduct2 D SIP forward modeling that considers the EMI and IP effects simultaneously.The data-space Occam method,in which different constraints to the model smoothness and parametric boundaries are introduced,is then used to simultaneously obtain the four parameters of the Cole-Cole model using multi-array electric field data.This approach not only improves the stability of the inversion but also significantly reduces the solution ambiguity.To improve the computational efficiency,message passing interface programming was used to accelerate the 2D SIP forward modeling and inversion.Synthetic datasets were tested using both serial and parallel algorithms,and the tests suggest that the proposed parallel algorithm is robust and efficient.
基金supported by the National Key Research and Development Program of China (No.2016YFB0200701)the National Natural Science Foundation of China (Nos. 11532016 and 91530325)
文摘An efficient MPI/OpenMP hybrid parallel Radial Basis Function (RBF) strategy for both continuous and discontinuous large-scale mesh deformation is proposed to reduce the computational cost and memory consumption.Unlike the conventional parallel methods in which all processors use the same surface displacement and implement the same operation,the present method employs different surface points sets and influence radius for each volume point movement,accompanied with efficient geometry searching strategy.The deformed surface points,also called Control Points (CPs),are stored in each processor.The displacement of spatial points is interpolated by using only 20-50 nearest control points,and the local influence radius is set to 5-20 times the maximum displacement of control points.To shorten the searching time for the nearest control point clouds,an Alternating Digital Tree (ADT) algorithm for 3D complex geometry is designed based on an iterative bisection technique.Besides,an MPI/OpenMP hybrid parallel approach is developed to reduce the memory cost in each High-Performance Computing (HPC) node for large-scale applications.Three 3D cases,including the ONERA-M6 wing and a commercial transport airplane standard model with up to 2.5 billion hybrid elements,are used to test the present mesh deformation method.The robustness and high parallel efficiency are demonstrated by a wing deflection case with a maximum bending angle of 450 and more than 80% parallel efficiency with 1024 MPI processors.In addition,the availability for both continuous and discontinuous surface deformation is verified by interpolating the projecting displacement with opposite directions surface points to the spatial points.
基金supported by National Basic Research Program of China (Grant Nos. 2010CB950300, 2010CB950500)Public Science and Technology Research Funds Projects of Ocean (Grant No. 201105019)+1 种基金Key Supercomputing Science-Technology Project of Shandong Province of China (Grant No. 2011YD01107)Scientific Research Foundation of the First Institute of Oceanography, State Oceanic Administration (Grant No. GY02-2010G22)
文摘To achieve high parallel efficiency for the global MASNUM surface wave model, the algorithm of an irregular quasirectangular domain decomposition and related serializing of calculating points and data exchanging schemes are developed and conducted, based on the environment of Message Passing Interface(MPI). The new parallel version of the surface wave model is tested for parallel computing on the platform of the Sunway BlueLight supercomputer in the National Supercomputing Center in Jinan. The testing involves four horizontal resolutions, which are 1°×1°,(1/2)°×(1/2)°,(1/4)°×(1/4)°, and(1/8)°×(1/8)°. These tests are performed without data Input/Output(IO) and the maximum amount of processors used in these tests reaches to 131072. The testing results show that the computing speeds of the model with different resolutions are all increased with the increasing of numbers of processors. When the number of processors is four times that of the base processor number, the parallel efficiencies of all resolutions are greater than 80%. When the number of processors is eight times that of the base processor number, the parallel efficiency of tests with resolutions of 1°×1°,(1/2)°×(1/2)° and(1/4)°×(1/4)° is greater than 80%, and it is 62% for the test with a resolution of(1/8)°×(1/8)° using 131072 processors, which is the nearly all processors of Sunway BlueLight. When the processor's number is 24 times that of the base processor number, the parallel efficiencies for tests with resolutions of 1°×1°,(1/2)°×(1/2)°, and(1/4)°×(1/4)° are 72%, 62%, and 38%, respectively. The speedup and parallel efficiency indicate that the irregular quasi-rectangular domain decomposition and serialization schemes lead to high parallel efficiency and good scalability for a global numerical wave model.
基金supported partly by the Ministry of Science and Technology of the People’s Republic of China (Grant Nos.2007CB714407, and 2008ZX10004012)the Special Funds for Basic Research in CAMS of CMA (Grant No. 2007Y001)State Key Laboratory of Remote Sensing Sciences (Grant No.07S00502CX)
文摘A wide variety of algorithms have been developed to monitor aerosol burden from satellite images. Still, few solutions currently allow for real-time and efficient retrieval of aerosol optical thickness (AOT), mainly due to the extremely large volume of computation necessary for the numeric solution of atmospheric radiative transfer equations. Taking into account the efforts to exploit the SYNergy of Terra and Aqua Modis (SYNTAM, an AOT retrieval algorithm), we present in this paper a novel method to retrieve AOT from Moderate Resolution Imaging Spectroradiometer (MODIS) satellite images, in which the strategy of block partition and collective communication was taken, thereby maximizing load balance and reducing the overhead time during inter-processor communication. Experiments were carried out to retrieve AOT at 0.44, 0.55, and 0.67μm of MODIS/Terra and MODIS/Aqua data, using the parallel SYNTAM algorithm in the IBM System Cluster 1600 deployed at China Meteorological Administration (CMA). Results showed that parallel implementation can greatly reduce computation time, and thus ensure high parallel efficiency. AOT derived by parallel algorithm was validated against measurements from ground-based sun-photometers; in all cases, the relative error range was within 20%, which demonstrated that the parallel algorithm was suitable for applications such as air quality monitoring and climate modeling.
基金supported by the State Key Project(2010CB731505)National Key Foundation Project(10431030)Director Foundation Project of Laboratory of Scientific and Engineering Computing(LSEC).
文摘Finite difference method is an important methodology in the approximation of waves.In this paper,we will study two implicit finite difference schemes for the simulation of waves.They are the weighted alternating direction implicit(ADI)scheme and the locally one-dimensional(LOD)scheme.The approximation errors,stability conditions,and dispersion relations for both schemes are investigated.Our analysis shows that the LOD implicit scheme has less dispersion error than that of the ADI scheme.Moreover,the unconditional stability for both schemes with arbitrary spatial accuracy is established for the first time.In order to improve computational efficiency,numerical algorithms based on message passing interface(MPI)are implemented.Numerical examples of wave propagation in a three-layer model and a standard complex model are presented.Our analysis and comparisons show that both ADI and LOD schemes are able to efficiently and accurately simulate wave propagation in complex media.