面向多核CPU多GPU的节点内并行混合绘制模型被引量：3

Hybrid Rendering Model for Multi-CPU Multi-GPU Distributed Parallel Rendering Cluster Node

下载PDF

导出

摘要分布式并行绘制集群节点可以配置多核CPU和多个GPU构建节点内多CPU多GPU系统。现有的节点内并行绘制模型既没有充分发挥多核CPU的强大计算能力，还将绘制、读回和合成阶段串行耦合在一起导致了大量的GPU闲置停顿，严重影响了节点内并行绘制性能。提出了一种节点内高效的并行绘制模型，通过软件绘制与硬件绘制相结合的方法将硬件绘制与图像合成分离，同时利用DMA异步传输机制，构建了节点内绘制、读回和合成三段并行绘制流水线。与现有节点内并行绘制模型相比，并行混合绘制模型不但降低GPU资源闲置率，而且提高了CPU资源使用率。理论分析与实验表明相同应用采用并行混合绘制模型的性能可以达到现有模型的3-4倍，并且具有更好的数据扩展性、性能扩展性。 Distributed parallel rendering cluster nodes can accommodate multi-core CPU and multi-GPU. But the present parallel rendering models of node do not make full use of the multi-core CPU computing power and serially join the rendering, readback and composition stages together. This damages system performance and frequently makes GPUs stall. A novel efficient parallel rendering model was introduced. It decoupled the hardware rendering and composition stage with hybrid rendering. With asynchronous DMA transfer, a parallel rendering pipeline with the three stages in one node was constructed. Comparing with the present models, the model not only decreases GPU stall and improves the multi-core CPU usage. Theoretical analysis and experiment results show that the model performance is 3~4 times of the presents model and has much better data and performance scalability.

作者刘华海王攀蔡勋曾亮王文珂李思昆

机构地区国防科学技术大学计算机学院

出处《系统仿真学报》 CAS CSCD 北大核心 2012年第1期94-98,112,共6页 Journal of System Simulation

基金国家"973"项目(2009CB723803) 国家自然科学基金(61170157)

关键词 Multi-GPU MULTI-CPU 分布式并行绘制异步合成 DMA multi-GPU multi-CPU distributed parallel rendering asynchronous composition DMA

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献10

1T Fogal, H Childs, S Shankar, J Krueger, R D Bergeron, P Hatcher Large Data Visualization on Distributed Memory Multi-GPU Clusters [C]// Proc. of High Performance Graphics 2010. Switzerland: Eurographics Association, 2010: 57-66.
2Kitware Incorporated. ParaView. [EB/OL]. (2009) [2011 ]. http://www.paraview.org.
3NVIDIA Incorporated. NVIDIA Hybrid_SLI. [EB/OL]. (2010) [2011]. http ://www.nvidia.com/obj ectJhybrid_sli.html.
4AMD Incorporated. AMD CrossFire. [EB/OL]. (2010) [2011]. http://www.amd.com/us/PRODUCTS/ WORKSTATION/ GRAPHICS/ CROSSFIRE-PRO/Pages/crossfire-pro.aspx.
5G Humphreys, M Eldridge, I Buck, G Stoll, M Everett, P Hanrahan. WireGL: A Scalable Graphics System for Clusters [C]// Proc. of SIGGRAPH 2001. USA: ACM, 2001: 129-140.
6G Humphreys, H Mike, T James. Chromium: A Stream- Processing Framework for Interactive Rendering on Clusters [C]// Proc. of SIGGRAPH 2002. USA: ACM, 2002: 693-702.
7Moerschell J Owens. Distributed Texture Memory in a Multi- GPU Environment [J]. Computer Graphics Forum (S0167-7055), 2008, (27): 130-151.
8Lefohn S Sengupta, J Kniss, R Strzodka, J Owens Glift. Generic, Efficient, Random-Access GPU Data Structures [J]. ACM Transactions on Graphics (S0730-0301), 2006, (25): 60-99.
9P Bhaniramka, P C D Robert, S Eilemann. OpenGL Multipipe SDK: A Toolkit for Scalable Parallel Rendering [C]// Proc. of IEEE Visualization 2005. USA: IEEE, 2005: 119-126.
10S Eilemarm, M Maldainya, R Pajarola. Equalizer: A Scalable Parallel Rendering Framework Visualization and Computer Graphics [J]. IEEE Transactions on Graphics (S0730-0301), 2009, (15): 436-452.

同被引文献34

1沈卫超,曹立强,夏芳,宋磊.面向数值模拟数据的HDF5性能优化[J].计算机研究与发展,2012,49(S1):314-318. 被引量：10
2张纯,毛菁霞,张如鸿,吴百锋,彭澄廉,陈泽文,孙晓光.基于图形硬件加速的体绘制关键技术综述[J].计算机工程与设计,2005,26(7):1732-1734. 被引量：5
3Bhaniramka P, Robert P C D, Eilemann S. OpenGL multipipe SDK: a toolkit for scalable parallel rendering [C] // Proceedings of IEEE Visualization. Los Alamitos: IEEE Computer Society Press, 2005:119-126.
4Eilemann S, Makhinya M, Pajarola R. Equalizer; a scalable parallel rendering framework [J]. IEEE Transactions on Visualization and Computer Graphics, 2009, 15(3) : 436-452.
5Zhou K, Hou Q M, Ren Z, et al. RenderAnts: interactive Reyes rendering on GPUs [J]. ACM Transactions on Graphics, 2009, 28(5): Article No. 155.
6Moll L, Heirich A, Shand M. Sepia: scalable 3D eompositing using PCI pamette [C] //Proceedings of IEEE Symposium on Field -Programmable Custom Computing Machines. Los Alamitos: IEEE Computer Society Press, 1999:146-155.
7Lombeyda S, Moll L, Shand M, et al. Scalable interactive volume rendering using Off-The-Shelf components [C] // Proceedings of the IEEE Symposium on Parallel and Large-data Visualization and Graphics. Los Alamitos: IEEE Computer Society Press, 2001:115-121.
8Stoll G, Eldridge M, Patterson D, et al. Lighming-2; a high-performance display subsystem for PC clusters [C] // Proceedings of ACM SIGGRAPH. New York: ACM Press, 2001:141-148.
9Zhang X Y, Ba]a] C, Blanke W. Scalable isosur{ace visualization of massive datasets on COTS clusters [C] // Proceedings of IEEE Symposium on Parallel and Large Data Visualization and Graphics. Los Alamitos: IEEE Computer Society Press, 2001 : 51-58.
10Muraki S, Ogata M, Ma K L, et al. Next-generation visual supereomputing using PC clusters with volume graphics hardware deviees [C]//Proceedings of ACM/IEEE Conference on Supereomputing. New York: ACM Press, 2001:51-95.

引证文献3

1刘华海,王攀,李思昆,蔡勋,王文珂,曾亮.节点内无冗余图像合成方法[J].计算机辅助设计与图形学学报,2013,25(5):646-652. 被引量：1
2谈敦铭,曹国廷,郎娟芳,杨朔.飞行器大数据量CAD模型并行绘制[J].系统仿真学报,2016,28(9):2049-2053. 被引量：1
3艾志玮,曹轶,肖丽,王华维.图形驱动感知的异构硬件高效绘制模型[J].系统仿真学报,2016,28(10):2394-2399.

二级引证文献2

1刘凡美.基于GPU加速的多投影融合新算法的实现[J].电子技术与软件工程,2013(19):204-206. 被引量：1
2杨艳芳,高居建,王奇,舒亮,无.面向复杂生产场景的数字孪生模型分布式渲染方法[J].计算机集成制造系统,2023,29(6):1811-1823. 被引量：3

1向南平,江资斌,周翠竹.VC++6.0中利用OpenGL实现3DS模型的交互控制[J].电脑编程技巧与维护,2003(3):73-76. 被引量：1
2灵犀.存储虚拟化以“虚”管“实”[J].中国计算机用户,2004(46).
3本苯.SLI回归?——NVIDIA SLI multi-GPU简介[J].大众硬件,2004(8):91-91.
4宋锦华,马传琦.用Vb编程控制AutoCAD绘制模型[J].科技信息,2008(1):66-66. 被引量：3
5多GPU(MULTI-GPU)：创建项尖游戏平台[J].新电脑,2008(4):216-216.
6张岩.并行显卡:nVIDIA SLI Multi-GPU技术再现[J].个人电脑,2004,10(8):192-197. 被引量：1
7刘春明,郭成,刘宏敏.基于面向对象设计方法的列车运行图绘制[J].铁路计算机应用,2008,17(3):4-7.
8谷震离.CORBA中的异步传输机制[J].微型电脑应用,2003,19(10):57-59.
9吕相文,袁家斌,张玉洁.云计算环境下多GPU资源调度机制研究[J].小型微型计算机系统,2016,37(4):687-693. 被引量：3
10崔英志,唐鑫,李为,高博.基于Petri网模型的异步并发性[J].重庆工学院学报（自然科学版）,2009,23(10):119-124. 被引量：3

系统仿真学报

2012年第1期

浏览历史

内容加载中请稍等...

面向多核CPU多GPU的节点内并行混合绘制模型被引量：3

参考文献10

同被引文献34

引证文献3

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

面向多核CPU多GPU的节点内并行混合绘制模型 被引量：3

参考文献10

同被引文献34

引证文献3

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

面向多核CPU多GPU的节点内并行混合绘制模型被引量：3