A set of small-stencil new Pade schemes with the same denominator are presented to solve high-order nonlinear evolution equations. Using this scheme, the fourth-order precision can not only be kept, but also the final...A set of small-stencil new Pade schemes with the same denominator are presented to solve high-order nonlinear evolution equations. Using this scheme, the fourth-order precision can not only be kept, but also the final three-diagonal discrete systems are solved by simple Doolittle methods, or ODE systems by Runge-Kutta technique. Numerical samples show that the schemes are very satisfactory. And the advantage of the schemes is very clear compared to other finite difference schemes.展开更多
The accuracy of unstructured finite volume methods is greatly influenced by the gradient reconstruction, for which the stencil selection plays a critical role. Compared with the commonly used face-neighbor and vertex-...The accuracy of unstructured finite volume methods is greatly influenced by the gradient reconstruction, for which the stencil selection plays a critical role. Compared with the commonly used face-neighbor and vertex-neighbor stencils, the global-direction stencil is independent of the mesh topology, and characteristics of the flow field can be well reflected by this novel stencil. However, for a high-aspect-ratio triangular grid, the grid skewness is evident, which is one of the most important grid-quality measures known to affect the accuracy and stability of finite volume solvers. On this basis and inspired by an approach of using face-area-weighted centroid to reduce the grid skewness, we explore a method by combining the global-direction stencil and face-area-weighted centroid on high-aspect-ratio triangular grids, so as to improve the computational accuracy. Four representative numerical cases are simulated on high-aspect-ratio triangular grids to examine the validity of the improved global-direction stencil. Results illustrate that errors of this improved methods are the lowest among all methods we tested, and in high-mach-number flow, with the increase of cell aspect ratio, the improved global-direction stencil always has a better stability than commonly used face-neighbor and vertex-neighbor stencils. Therefore, the computational accuracy as well as stability is greatly improved, and superiorities of this novel method are verified.展开更多
We propose an adaptive stencil construction for high-order accurate finite volume schemes a posteriori stabilized devoted to solve one-dimensional steady-state hyperbolic equations.High accuracy(up to the sixth-order ...We propose an adaptive stencil construction for high-order accurate finite volume schemes a posteriori stabilized devoted to solve one-dimensional steady-state hyperbolic equations.High accuracy(up to the sixth-order presently)is achieved,thanks to polynomial recon-structions while stability is provided with an a posteriori MOOD method which controls the cell polynomial degree for eliminating non-physical oscillations in the vicinity of dis-continuities.We supplemented this scheme with a stencil construction allowing to reduce even further the numerical dissipation.The stencil is shifted away from troubles(shocks,discontinuities,etc.)leading to less oscillating polynomial reconstructions.Experimented on linear,Burgers',and Euler equations,we demonstrate that the adaptive stencil technique manages to retrieve smooth solutions with optimal order of accuracy but also irregular ones without spurious oscillations.Moreover,we numerically show that the approach allows to reduce the dissipation still maintaining the essentially non-oscillatory behavior.展开更多
Improved Weighted Essentially Non-oscillatory Scheme is a high order finite volume method. The mixed stencils can be obtained by a combination of r + 1 order and r order stencils. We improve the weights by the mapping...Improved Weighted Essentially Non-oscillatory Scheme is a high order finite volume method. The mixed stencils can be obtained by a combination of r + 1 order and r order stencils. We improve the weights by the mapping method. The restriction that conventional ENO or WENO schemes only use r order stencils, is removed. Higher resolution can be achieved by introducing the r + 1 order stencils. This method is verified by three cases, i.e. the interaction of a moving shock with a density wave problem, the interacting blast wave problem and the double mach reflection problem. The numerical results show that the Improved Weighted Essential Non-oscillatory method is a stable, accurate high-resolution finite volume scheme.展开更多
The concept of mathematical stencil and the strategy of stencil elimination for solving the finite difference equation is presented, and then a new type of the iteration algorithm is established for the Poisson equati...The concept of mathematical stencil and the strategy of stencil elimination for solving the finite difference equation is presented, and then a new type of the iteration algorithm is established for the Poisson equation. The new algorithm has not only the obvious property of parallelism, but also faster convergence rate than that of the classical Jacobi iteration. Numerical experiments show that the time for the new algorithm is less than that of Jacobi and Gauss-Seidel methods to obtain the same precision, and the computational velocity increases obviously when the new iterative method, instead of Jacobi method, is applied to polish operation in multi-grid method, furthermore, the polynomial acceleration method is still applicable to the new iterative method.展开更多
In this paper, we present a hybrid circular queue method that can significantly boost the performance of stencil computations on GPU by carefully balancing usage of registers and shared-memory. Unlike earlier methods ...In this paper, we present a hybrid circular queue method that can significantly boost the performance of stencil computations on GPU by carefully balancing usage of registers and shared-memory. Unlike earlier methods that rely on circular queues predominantly implemented using indirectly addressable shared memory, our hybrid method exploits a new reuse pattern spanning across the multiple time steps in stencil computations so that circular queues can be implemented by both shared memory and registers effectively in a balanced manner. We describe a framework that automatically finds the best placement of data in registers and shared memory in order to maximize the performance of stencil computations. Validation using four different types of stencils on three different GPU platforms shows that our hybrid method achieves speedups up to 2.93X over methods that use circular queues implemented with shared-memory only.展开更多
文摘A set of small-stencil new Pade schemes with the same denominator are presented to solve high-order nonlinear evolution equations. Using this scheme, the fourth-order precision can not only be kept, but also the final three-diagonal discrete systems are solved by simple Doolittle methods, or ODE systems by Runge-Kutta technique. Numerical samples show that the schemes are very satisfactory. And the advantage of the schemes is very clear compared to other finite difference schemes.
基金Project supported by the National Key Project, China (Grant No. GJXM92579).
文摘The accuracy of unstructured finite volume methods is greatly influenced by the gradient reconstruction, for which the stencil selection plays a critical role. Compared with the commonly used face-neighbor and vertex-neighbor stencils, the global-direction stencil is independent of the mesh topology, and characteristics of the flow field can be well reflected by this novel stencil. However, for a high-aspect-ratio triangular grid, the grid skewness is evident, which is one of the most important grid-quality measures known to affect the accuracy and stability of finite volume solvers. On this basis and inspired by an approach of using face-area-weighted centroid to reduce the grid skewness, we explore a method by combining the global-direction stencil and face-area-weighted centroid on high-aspect-ratio triangular grids, so as to improve the computational accuracy. Four representative numerical cases are simulated on high-aspect-ratio triangular grids to examine the validity of the improved global-direction stencil. Results illustrate that errors of this improved methods are the lowest among all methods we tested, and in high-mach-number flow, with the increase of cell aspect ratio, the improved global-direction stencil always has a better stability than commonly used face-neighbor and vertex-neighbor stencils. Therefore, the computational accuracy as well as stability is greatly improved, and superiorities of this novel method are verified.
基金support by FEDER-Fundo Europeu de Desenvolvimento Regional,through COMPETE 2020-Programa Operational Fatores de Competitividade,and the National Funds through FCT-Fundacao para a Ciencia e a Tecnologia,project no.UID/FIS/04650/2019support by FEDER-Fundo Europeu de Desenvolvimento Regional,through COMPETI E 2020-Programa Operacional Fatores de Competitividade,and the National Funds through FCT-Fundacao para a Ciencia e a Tecnologia,project no.POCI-01-0145-FEDER-028118
文摘We propose an adaptive stencil construction for high-order accurate finite volume schemes a posteriori stabilized devoted to solve one-dimensional steady-state hyperbolic equations.High accuracy(up to the sixth-order presently)is achieved,thanks to polynomial recon-structions while stability is provided with an a posteriori MOOD method which controls the cell polynomial degree for eliminating non-physical oscillations in the vicinity of dis-continuities.We supplemented this scheme with a stencil construction allowing to reduce even further the numerical dissipation.The stencil is shifted away from troubles(shocks,discontinuities,etc.)leading to less oscillating polynomial reconstructions.Experimented on linear,Burgers',and Euler equations,we demonstrate that the adaptive stencil technique manages to retrieve smooth solutions with optimal order of accuracy but also irregular ones without spurious oscillations.Moreover,we numerically show that the approach allows to reduce the dissipation still maintaining the essentially non-oscillatory behavior.
文摘Improved Weighted Essentially Non-oscillatory Scheme is a high order finite volume method. The mixed stencils can be obtained by a combination of r + 1 order and r order stencils. We improve the weights by the mapping method. The restriction that conventional ENO or WENO schemes only use r order stencils, is removed. Higher resolution can be achieved by introducing the r + 1 order stencils. This method is verified by three cases, i.e. the interaction of a moving shock with a density wave problem, the interacting blast wave problem and the double mach reflection problem. The numerical results show that the Improved Weighted Essential Non-oscillatory method is a stable, accurate high-resolution finite volume scheme.
文摘The concept of mathematical stencil and the strategy of stencil elimination for solving the finite difference equation is presented, and then a new type of the iteration algorithm is established for the Poisson equation. The new algorithm has not only the obvious property of parallelism, but also faster convergence rate than that of the classical Jacobi iteration. Numerical experiments show that the time for the new algorithm is less than that of Jacobi and Gauss-Seidel methods to obtain the same precision, and the computational velocity increases obviously when the new iterative method, instead of Jacobi method, is applied to polish operation in multi-grid method, furthermore, the polynomial acceleration method is still applicable to the new iterative method.
基金Supported in part by the National Basic Research 973 Program of China under Grant Nos. 2011CB302504 and 2011ZX01028-001-002the National High Technology Research and Development 863 Program of China under Grant No. 2009AA01A129+1 种基金the National Natural Science Foundation of China (NSFC) under Grant No. 60970024the Innovation Research Group of NSFC under Grant No. 60921002
文摘In this paper, we present a hybrid circular queue method that can significantly boost the performance of stencil computations on GPU by carefully balancing usage of registers and shared-memory. Unlike earlier methods that rely on circular queues predominantly implemented using indirectly addressable shared memory, our hybrid method exploits a new reuse pattern spanning across the multiple time steps in stencil computations so that circular queues can be implemented by both shared memory and registers effectively in a balanced manner. We describe a framework that automatically finds the best placement of data in registers and shared memory in order to maximize the performance of stencil computations. Validation using four different types of stencils on three different GPU platforms shows that our hybrid method achieves speedups up to 2.93X over methods that use circular queues implemented with shared-memory only.