基于多绘制管线的大规模并行体绘制性能优化技术

Performance Optimization Technique for Large-Scale Parallel Volume Rendering Based on Multiple Rendering Pipelines

下载PDF

导出

摘要针对数值模拟输出的大规模科学数据,体绘制方法为了刻画复杂物理特征,会进行高密度光线采样,但由此带来了极大的计算开销和数据增量。在国产自主CPU高性能计算机上,由于处理器单核的计算能力低于商业CPU,只能使用更多的处理器核来分担体绘制任务,从而引起了采样数据并行通信的可扩展性瓶颈。为充分利用国产自主CPU高性能计算机来高效完成体绘制任务,针对大规模并行体绘制提出一种基于多绘制管线的性能优化技术,通过多管线、多进程的两级并行模式来降低单条管线的并行规模。在大规模并行体绘制中,该技术将绘制目标图像划分成多个子区域,绘制进程则相应分组,每个进程组独立执行一条绘制管线,以完成图像相应子区域的绘制,最后再收集所有的图像子区域,形成完整图像并输出。实验结果表明,优化后的体绘制算法在国产自主CPU高性能计算机上可以扩展到万核规模,并能有效完成体绘制任务。 For large-scale scientific data output in numerical simulations,volume rendering methods inevitably perform high-density ray sampling to capture complex physical features,resulting in significant computational overhead and data increment.However,on domestic autonomous-CPU supercomputers,owing to the lower computing power of a single processor core compared to that of commercial CPU,more processor cores must be used to share volume rendering tasks;this leads to scalability bottlenecks in the parallel communication of sampling data.Full utilization of domestic autonomous-CPU supercomputers to efficiently complete volume rendering tasks is an urgent problem that needs to be solved.To address this problem,this paper proposes a performance optimization technique for large-scale parallel volume rendering based on multiple rendering pipelines;here,the parallel scale of a rendering pipeline is reduced by two-level parallelism:first,at the pipeline level,and then,at the process level.In large-scale parallel volume rendering after optimization,the rendered goal image is first divided into multiple sub-regions,and all rendering processes are grouped accordingly.Each process group then executes a rendering pipeline independently,and as a result,the corresponding sub-region of the image is produced.Finally,all sub-regions of the image are collected,and the whole image is output.Experiments demonstrate that the optimized volume rendering algorithm can scale to approximately 10000 processing cores on domestic autonomous-CPU supercomputers and can effectively complete volume rendering tasks.

作者王华维刘若妍艾志玮曹轶 WANG Huawei;LIU Ruoyan;AI Zhiwei;CAO Yi(Laboratory of Computational Physics,Institute of Applied Physics and Computational Mathematics,Beijing 100088,China;CAEP Software Center for High Performance Numerical Simulation,Beijing 100088,China)

机构地区北京应用物理与计算数学研究所计算物理重点实验室中物院高性能数值模拟软件中心

出处《计算机工程》 CAS CSCD 北大核心 2024年第8期207-215,共9页 Computer Engineering

基金国家重点研发计划(2017YFB0202203)。

关键词体绘制多管线两级并行并行可扩展性性能优化 volume rendering multiple pipelines two-level parallelism parallel scalability performance optimization

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1陈为,夏佳志,张龙,于洋,郑文庭,彭群生.一种统一的硬件加速自适应EWA Splatting算法[J].计算机学报,2009,32(8):1571-1581. 被引量：6
2罗月童,薛晔,刘晓平.基于GPU的多分辨率体数据重构和渲染[J].计算机辅助设计与图形学学报,2009,21(1):107-111. 被引量：12
3王华维,何柳,曹轶,肖丽.大规模科学数据体绘制技术综述[J].国防科技大学学报,2020,42(2):1-12. 被引量：3

二级参考文献24

1吴恩华,柳有权.基于图形处理器(GPU)的通用计算[J].计算机辅助设计与图形学学报,2004,16(5):601-612. 被引量：227
2Cabral B, Cam N, Foran J. Accelerated volume rendering and tomographic reconstruction using texture mapping hardware [C]//Proceedings of Symposium on Volume Visualization, Washington D C, 1994:91-98+131
3Engel K, Kraus M, Ertl T. High quality pre integrated volume rendering using hardware accelerated pixel shading [C]//Proceedings of the ACM SIGGRAPH/EUROGRAPHICS Workshop on Graphics Hardware, Los Angeles, 2001:9-16
4Meiβner M, Guthe S, Straβer W. Interactive lighting models and pre integration for volume rendering on PC graphics accelerators [C] //Proceedings of Graphics Interface, Calgary, 2002:209-218
5Meiβner M, Hoffmann U, Straβer W. Enabling classification and shading for 3D texture mapping based volume rendering using OpenGL and extensions [C]//Proceedings of the Conference on Visualization, San Francisco, 1999:207-214
6Kraus M, Ertl T. Adaptive texture maps [C] //Proceedings of the ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware, Saarbrucken, 2002:7-15
7Binotto A P D, Comba J L D, Freitas C M D F. Real-time volume rendering of time varying data using a fragment shader compression approach [C]//Proceedings of IEEE Symposium on Parallel and Large-Data Visualization and Graphics, Washington D C, 2003: 69-75
8Fout N, Akiba H, Ma K L, etal. High quality rendering of compressed volume data formats [C] //Proceedings of EUROGRAPHICS-IEEE VGTC Symposium on Visualization, Leeds, 2005: 77-84
9Guthe S, Wand M, Gonser J, et al. Interactive rendering of large volume data sets [C] //Proceedings of IEEE Visualization, Boston, 2002:53-60
10Linde Y, Buzo A, Gray R M. An algorithm for vector quantizer design [J]. IEEE Transactions on Communication, 1980, COM-28(1): 84-95

共引文献16

1赵利平,肖德贵,李肯立,乐光学,彭成斌.一种高效体数据压缩算法及其在地震数据处理中的应用[J].计算机辅助设计与图形学学报,2009,21(11):1606-1611. 被引量：6
2周志光,陶煜波,林海.一种有效显示隐藏特征的光线投射算法[J].计算机学报,2011,34(3):517-525. 被引量：10
3韩松,潘纲,傅俊康,张宇霆.GPU加速的三维人脸真实表情合成[J].计算机辅助设计与图形学学报,2011,23(5):747-755. 被引量：1
4肖德贵,李磊,张杨.一种大规模体数据压缩体绘制策略[J].湖南大学学报（自然科学版）,2011,38(7):73-77.
5马伯宁,王晨昊,汤晓安,匡纲要.基于GPU的二维离散小波变换快速计算[J].国防科技大学学报,2011,33(3):111-114. 被引量：1
6秦绪佳,王建奇,朱思达,郑红波,徐晓刚.基于GPU的四维医学图像动态快速体绘制[J].计算机辅助设计与图形学学报,2011,23(11):1789-1798. 被引量：13
7孙劲光,杨新年,李扬.针对类球形对象的改进光线投射算法[J].计算机工程与设计,2012,33(8):3239-3243. 被引量：1
8梁荣华,吴云飞,马祥音.局部特征加强的体绘制算法[J].计算机辅助设计与图形学学报,2012,24(10):1302-1311. 被引量：7
9连远锋,赵琰,何晖光,吴发林.基于GPU加速的并行脑皮层重建算法研究[J].仪器仪表学报,2013,34(4):866-872. 被引量：5
10贺承浩,金西,郑琳琳,刘子恒,王浩原.时变数据的实时体绘制加速算法优化[J].计算机辅助设计与图形学学报,2014,26(2):314-319. 被引量：1

1任瑞恩.针对功率降低的农用柴油机性能优化技术[J].中国农机装备,2024(2):6-8.
2曹艺迪.关于Ceph分布式存储系统性能优化技术的研究[J].信息与电脑,2023,35(24):178-180.
3刘颖.基于管道外防腐层检测装置的研究[J].中文科技期刊数据库（全文版）工程技术,2021(10):224-226.
4肖艺,郭凯昕,明平剑.大规模并行非结构滑移网格守恒型隐式处理方法[J].中国造船,2024,65(1):267-277.
5方丽萍,袁瑛,陶晓峰.基于Cuda的改进体绘制算法在CT血管图像三维重建的应用价值[J].实用放射学杂志,2024,40(4):659-662.
6毛润彰,杜皓,田鸿运,黄思路,张鹏,徐小文.几类典型应用的代数多重网格算法并行可扩展瓶颈分析[J].计算物理,2024,41(4):403-417.
7耿磊,张文跃,肖志涛,王雯,李晓捷.基于特征重建的无监督木材图像异常检测[J].计算机工程与设计,2024,45(6):1829-1835.
8王凤华.智慧校园建设中的网络架构与性能优化[J].IT经理世界,2024(1):99-101.
9张帅,王玉亮.基于VR技术的室内物体可视化建模设计[J].新乡学院学报,2024,41(3):26-31.
10黄涛,程新满,魏石峰,侯东.SCADE座舱显示软件性能优化技术[J].中国科技信息,2024(12):116-119.

计算机工程

2024年第8期

浏览历史

内容加载中请稍等...

基于多绘制管线的大规模并行体绘制性能优化技术

参考文献3

二级参考文献24

共引文献16

相关作者

相关机构

相关主题

浏览历史