面向多尺度拓扑优化的渐进均匀化GPU并行算法研究

Efficient GPU parallel strategy for multi-scale topology optimization via asymptotic homogenization

下载PDF

导出

摘要针对多尺度结构拓扑设计计算效率低等问题,提出了一种基于水平集渐进均匀化的多尺度拓扑优化并行算法。基于通用图形处理器(graphics processing unit,GPU),通过水平集初始化、大型稀疏刚度矩阵方程求解以及本构矩阵并行计算,可大幅提升渐进均匀化算法的效率。实验结果表明,当三维晶胞单元网格细化至分辨率为10万时,多尺度结构拓扑优化GPU并行算法较CPU串行算法快数十倍。 In response to the low computational efficiency in the context of multi-scale structural topology design,an efficient asymptotic homogenization GPU parallel strategy is presented.The strategy leverages the graphics processing unit(GPU)and investigates parallel strategies for level set initialization,large sparse stiffness matrix equations solving and constitutive properties computing.Experimental results demonstrate that the computing efficiency of the asymptotic homogenization can be greatly improved by adopting the parallel strategies,in particular,when refining a three-dimensional unit cell grid to a resolution of 100000,the GPU parallel strategy achieves a speedup of two orders of magnitude compared to the CPU serial.

作者夏兆辉刘健力高百川聂涛余琛陈龙余金桂 XIA Zhaohui;LIU Jianli;GAO Baichuan;NIE Tao;YU Chen;CHEN Long;YU Jingui(School of Mechanical Science and Engineering/National Key Laboratory of Advanced Manufacturing Technology,Huazhong University of Science and Technology,Wuhan 430074,China;School of Mathematics and Computer Science,Wuhan Polytechnic University,Wuhan 430023,China;School of Mechanical Engineering,University of Shanghai for Science and Technology,Shanghai 200093,China;School of Mechanical and Electrical Engineering,Wuhan University of Technology,Wuhan 430070,China)

机构地区华中科技大学机械科学与工程学院/智能制造装备与技术全国重点实验室武汉轻工大学数学与计算机学院上海理工大学机械工程学院武汉理工大学机电工程学院

出处《浙江大学学报（理学版）》 CAS CSCD 北大核心 2023年第6期722-735,共14页 Journal of Zhejiang University（Science Edition）

基金国家自然科学基金青年项目(52005192) 国家重点研发计划青年科学家项目(2022YFB3302900).

关键词多尺度拓扑优化渐进均匀化统一计算设备架构(CUDA) GPU并行计算 multi-scale topology optimization asymptotic homogenization compute unified device architecture(CUDA) GPU parallel computing

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1陈尧,赵永华,赵慰,赵莲.GPU加速不完全Cholesky分解预条件共轭梯度法[J].计算机研究与发展,2015,52(4):843-850. 被引量：3

二级参考文献15

1Ament M, Knittel G, Weskopf D, et al. A parallel preconditioned conjugate gradient solver for the poisson problem on a muhi-gpu platform [C] //Proe of the 18th Euromicro Conf on Parallel, Distributed and Network-Based Processing. Piscataway, N J: IEEE, 2010:583-592.
2Benzi M, Tuma M. A comparative approximate inverse preconditioners [J]. Mathematics, 1999, 30(2): 305-340.
3Helfenstcin R, Koko J. Parallel precon gradient algorithm on GPU [J]. Journal and Applied Mathematics, 2012, 236(15).
4Li R, Saad Y, GPU-accelerated precondiiioned iterative linear solvers I-J3. The Journal of Supereomputing, 2013, 63 (2) : 443-466.
5Naumov M. Incomplete-LU and Cholesky preconditioned iterative methods using CUSPARSE and CUBLAS [R]. Santa Clara, CA; NVIDIA Corporation, 2011.
6Sudan H, Klie H, Li R, et al. High performance manyeore solvers for reservoir simulation [C] /]Proe of the 12th European Conf on the Mathematics of Oil Recovery. Berlin Springer, 2010 [2013-09-20]. http://www-users, es. umn. edu/saad/PDF/A044, pdf.
7Gupta R. A GPU implementation of a bubbly flow solver [D]. Delft, Holland: Delft University of Technology, 2009.
8Amestoy P R, Davis T A, Duff I S. An approximate minimum degree ordering algorithm [J]. SIAM Journal on Matrix Analysis and Applieations, 1996, 17(4): 886-905.
9George A, 1.iu J W H. The evolution of the minimum degree ordering algorithm [J]. SIAM Review, 1989, 31 (1) : 1-19.
10Saad Y. lteralive Methods for Sparse Linear Systems [M]. Philadelphia, PA, SIAM, 2003.

共引文献2

1程凯,田瑾,马瑞琳.基于GPU的高效稀疏矩阵存储格式研究[J].计算机工程,2018,44(8):54-60. 被引量：8
2龙立,贺金川,郑山锁,周炎.供水系统震后水力分析算法并行化研究[J].华中科技大学学报（自然科学版）,2020,48(12):121-126. 被引量：1

1Wen-ming Peng,Yun-feng Liu,Xian-feng Jiang,Xing-tao Dong,Janice Jun,Dale A. Baur,Jia-jie Xu,Hui Pan,Xu Xu.Bionic mechanical design and 3D printing of novel porous Ti6Al4V implants for biomedical applications[J].Journal of Zhejiang University-Science B(Biomedicine & Biotechnology),2019,20(8):647-659. 被引量：13
2刘金华,黑晓明,姚书山,刘树江.VESTA软件在典型晶体结构教学中的应用[J].化学通报,2020,83(10):955-959. 被引量：3
3陈嘉卿,黄嘉嘉,许弢.基于GPU实时仿真小脑模型神经网络优化的研究[J].五邑大学学报（自然科学版）,2023,37(4):31-37.
4金一杲,胡翰.海量点云通用图形处理器缓存机制与并行编辑方法[J].测绘科学,2023,48(7):200-207.
5于海平,何发智,杨艳霞,林晓丽.基于自适应扰动信息模型的水平集分割方法[J].计算机应用与软件,2023,40(11):213-219.
6唐芳,冯应朗,卢海山.无网格法结构拓扑优化模型的GPU并行加速求解及应用[J].装备制造技术,2023(6):10-15.
7张战伟,李增三,庞坤.RPC模型影像校正并行算法设计及优化[J].山西建筑,2023,49(17):173-176. 被引量：1
8韩丰,高嵩,薛峰,李月安.基于CUDA的并行雷达拼图算法研究[J].气象,2023,49(10):1246-1253. 被引量：1
9邓炫烨,张春威,罗梦婷.基于弹簧质点模型的三维布料仿真[J].信息与电脑,2023,35(8):11-14.
10汪飞,李伟鸿,杨彧,姜大志,赵宝全,罗笑南.动脉粥样硬化斑块生成的高效流固耦合不可压缩SPH模拟方法[J].浙江大学学报（理学版）,2023,50(6):711-721. 被引量：1

浙江大学学报（理学版）

2023年第6期

浏览历史

内容加载中请稍等...

面向多尺度拓扑优化的渐进均匀化GPU并行算法研究

参考文献1

二级参考文献15

共引文献2

相关作者

相关机构

相关主题

浏览历史