The finite element method is a key player in computational electromag-netics for designing RF(Radio Frequency)components such as waveguides.The frequency-domain analysis is fundamental to identify the characteristics ...The finite element method is a key player in computational electromag-netics for designing RF(Radio Frequency)components such as waveguides.The frequency-domain analysis is fundamental to identify the characteristics of the components.For the conventional frequency-domain electromagnetic analysis using FEM(Finite Element Method),the system matrix is complex-numbered as well as indefinite.The iterative solvers can be faster than the direct solver when the solver convergence is guaranteed and done in a few steps.However,such complex-numbered and indefinite systems are hard to exploit the merit of the iterative solver.It is also hard to benefit from matrix factorization techniques due to varying system matrix parts according to frequency.Overall,it is hard to adopt conventional iterative solvers even though the system matrix is sparse.A new parallel iterative FEM solver for frequency domain analysis is implemented for inhomogeneous waveguide structures in this paper.In this implementation,the previous solution of the iterative solver of Matlab(Matrix Laboratory)employ-ing the preconditioner is used for the initial guess for the next step’s solution process.The overlapped parallel stage using Matlab’s Parallel Computing Toolbox is also proposed to alleviate the cold starting,which ruins the convergence of early steps in each parallel stage.Numerical experiments based on waveguide structures have demonstrated the accuracy and efficiency of the proposed scheme.展开更多
Higher-order displacement-based finite element methods are useful for simulating bending problems and potentially addressing mesh-locking associated with nearly-incompressible elasticity,yet are computationally expens...Higher-order displacement-based finite element methods are useful for simulating bending problems and potentially addressing mesh-locking associated with nearly-incompressible elasticity,yet are computationally expensive.To address the computational expense,the paper presents a matrix-free,displacement-based,higher-order,hexahedral finite element implementation of compressible and nearly-compressible(ν→0.5)linear isotropic elasticity at small strain with p-multigrid preconditioning.The cost,solve time,and scalability of the implementation with respect to strain energy error are investigated for polynomial order p=1,2,3,4 for compressible elasticity,and p=2,3,4 for nearly-incompressible elasticity,on different number of CPU cores for a tube bending problem.In the context of this matrix-free implementation,higher-order polynomials(p=3,4)generally are faster in achieving better accuracy in the solution than lower-order polynomials(p=1,2).However,for a beam bending simulation with stress concentration(singularity),it is demonstrated that higher-order finite elements do not improve the spatial order of convergence,even though accuracy is improved.展开更多
We present a parallel numerical method of simulating the interaction of atoms with a strong laser field by solving the time-depending Schr?dinger equation(TDSE) in spherical coordinates. This method is realized by com...We present a parallel numerical method of simulating the interaction of atoms with a strong laser field by solving the time-depending Schr?dinger equation(TDSE) in spherical coordinates. This method is realized by combining constructing block diagonal matrices through using the real space product formula(RSPF) with splitting out diagonal sub-matrices for short iterative Lanczos(SIL) propagator. The numerical implementation of the solver guarantees efficient parallel computing for the simulation of real physical problems such as high harmonic generation(HHG) in these interaction systems.展开更多
为解决并行应用程序使用FFTW(Fastest Fourier Transform in the West)并行软件包所面临的计算规模难以扩展、数据结构变动大、实现不同数据结构间通信难度大、接口不确定等问题,在JASMIN框架内设计实现FFT并行解法器.该解法器封装了数...为解决并行应用程序使用FFTW(Fastest Fourier Transform in the West)并行软件包所面临的计算规模难以扩展、数据结构变动大、实现不同数据结构间通信难度大、接口不确定等问题,在JASMIN框架内设计实现FFT并行解法器.该解法器封装了数据分布存储、数据通信等并行计算细节,通过重新分布存储数据,调用一维FFT变换实现高维FFT的并行计算,并提供规范接口,支撑用户简便地实现FFT的并行计算.数值测试表明,该解法器具有很好的并行性能.该解法器已应用于激光等离子体成丝不稳定性的数值模拟并行程序,它在2048个处理器上的并行效率可达80%以上.展开更多
基金supported by Institute of Information&communications Technology Planning&Evaluation(ITP)grant funded by the Korea govermment(MSIT)(No.2019-0-00098,Advanced and Integrated Software Development for Electromagnetic Analysis)supported by Research Assistance Program(2021)in the Incheon National University.
文摘The finite element method is a key player in computational electromag-netics for designing RF(Radio Frequency)components such as waveguides.The frequency-domain analysis is fundamental to identify the characteristics of the components.For the conventional frequency-domain electromagnetic analysis using FEM(Finite Element Method),the system matrix is complex-numbered as well as indefinite.The iterative solvers can be faster than the direct solver when the solver convergence is guaranteed and done in a few steps.However,such complex-numbered and indefinite systems are hard to exploit the merit of the iterative solver.It is also hard to benefit from matrix factorization techniques due to varying system matrix parts according to frequency.Overall,it is hard to adopt conventional iterative solvers even though the system matrix is sparse.A new parallel iterative FEM solver for frequency domain analysis is implemented for inhomogeneous waveguide structures in this paper.In this implementation,the previous solution of the iterative solver of Matlab(Matrix Laboratory)employ-ing the preconditioner is used for the initial guess for the next step’s solution process.The overlapped parallel stage using Matlab’s Parallel Computing Toolbox is also proposed to alleviate the cold starting,which ruins the convergence of early steps in each parallel stage.Numerical experiments based on waveguide structures have demonstrated the accuracy and efficiency of the proposed scheme.
基金The research relied on computational resources[29]provided by the University of Colorado Boulder Research Computing Group,which is supported by the National1302 CMES,2021,vol.129,no.3 Science Foundation(Awards ACI-1532235 and ACI-1532236)University of Colorado Boulder,and Colorado State University.
文摘Higher-order displacement-based finite element methods are useful for simulating bending problems and potentially addressing mesh-locking associated with nearly-incompressible elasticity,yet are computationally expensive.To address the computational expense,the paper presents a matrix-free,displacement-based,higher-order,hexahedral finite element implementation of compressible and nearly-compressible(ν→0.5)linear isotropic elasticity at small strain with p-multigrid preconditioning.The cost,solve time,and scalability of the implementation with respect to strain energy error are investigated for polynomial order p=1,2,3,4 for compressible elasticity,and p=2,3,4 for nearly-incompressible elasticity,on different number of CPU cores for a tube bending problem.In the context of this matrix-free implementation,higher-order polynomials(p=3,4)generally are faster in achieving better accuracy in the solution than lower-order polynomials(p=1,2).However,for a beam bending simulation with stress concentration(singularity),it is demonstrated that higher-order finite elements do not improve the spatial order of convergence,even though accuracy is improved.
基金Project supported by the National Natural Science Foundation of China(Grant Nos.11534004,11627807,11774131,and 11774130)the Scientific and Technological Project of Jilin Provincial Education Department in the Thirteenth Five-Year Plan,China(Grant No.JJKH20170538KJ)
文摘We present a parallel numerical method of simulating the interaction of atoms with a strong laser field by solving the time-depending Schr?dinger equation(TDSE) in spherical coordinates. This method is realized by combining constructing block diagonal matrices through using the real space product formula(RSPF) with splitting out diagonal sub-matrices for short iterative Lanczos(SIL) propagator. The numerical implementation of the solver guarantees efficient parallel computing for the simulation of real physical problems such as high harmonic generation(HHG) in these interaction systems.
文摘为解决并行应用程序使用FFTW(Fastest Fourier Transform in the West)并行软件包所面临的计算规模难以扩展、数据结构变动大、实现不同数据结构间通信难度大、接口不确定等问题,在JASMIN框架内设计实现FFT并行解法器.该解法器封装了数据分布存储、数据通信等并行计算细节,通过重新分布存储数据,调用一维FFT变换实现高维FFT的并行计算,并提供规范接口,支撑用户简便地实现FFT的并行计算.数值测试表明,该解法器具有很好的并行性能.该解法器已应用于激光等离子体成丝不稳定性的数值模拟并行程序,它在2048个处理器上的并行效率可达80%以上.