期刊文献+

片上P/G网求解算法及其GPU上的并行化

Algorithm Parallelization of on-Die Power/Ground Network Solving Based on GPU Parallel Computing
下载PDF
导出
摘要 为了得到片上电源线/地线网络(P/G网)快速而准确的求解算法,根据结构化供电网的局部性效应,重新分析了连续过松弛迭代法(SOR)和变向隐含迭代法(ADI)在P/G网中的求解效率及并行性,提出了利于GPU加速的并行算法:G_RBSOR和G_ADI.它们均采用规则的数据结构,以利于GPU并行读写数据,并采用合并归约来并行计算迭代结束标志位.为了避免GPU计算的数据冲突,G_RBSOR算法采用棋盘格方式对电路节点进行红黑分类,并对红黑节点进行交错松弛.实验结果表明,在不损失精度的前提下,与各自对应的CPU串行算法相比,G_RBSOR和G_ADI算法均取得了超过50倍的加速效果;与高效的P/G分析串行求解算法ICCG相比,也取得了超过5倍的加速效果. In order to study fast and accurate algorithms for power/ground network (P/G network) analyses, based on the locality effect of structure P/G networks, this work rethinks the efficiency and parallelism of successive over relaxation (SOR) algorithm and alternating direction implicit (ADI) algorithm. And then it proposes the optimized GPU-friendly parallel algorithms: G_RBSOR and G_ ADI. The algorithms both use the regular data structure to facilitate GPU parallel data reading/ writing. And they both use the merging reduction technique for GPU parallel computing to fast calculate the iteration-ending flags, too. Furthermore, in order to avoid the data collision in GPU parallel calculating, G_RBSOR uses the checkerboard strategy to classify all P/G network nodes into red and black groups and then, relax red nodes and black nodes step-by-step. Experimental results show that without any precision penalty, G_RBSOR and G_ADI algorithms can achieve more than 50X speedup over their serial CPU counterparts. In comparison with the efficient serial algorithm ICCG, both can also achieve more than 5X speedup.
出处 《计算机辅助设计与图形学学报》 EI CSCD 北大核心 2014年第7期1203-1210,共8页 Journal of Computer-Aided Design & Computer Graphics
基金 国家自然科学基金(61274033 61271198 61301146) 国家"八六三"高技术研究发展计划(2009AA01Z126)
关键词 电源线 地线网络 连续过松弛迭代法 交替方向迭代法 图形处理器 并行计算 power/ground network successive over relaxation alternating direction iterative GPU parallel computing
  • 相关文献

参考文献16

  • 1Wilson L,Mangum S.International technology roadmap for Semiconductors(ITRS)[OL].http://www.itrs.net/Links/2011ITRS/20 11Chapters/2011Interconnect.pdf.
  • 2骆祖莹.电热分析研究的现状与展望[J].计算机辅助设计与图形学学报,2009,21(9):1203-1211. 被引量:9
  • 3LUO ZuYing,ZHAO GuoXing,GORDON Joseph A.,TAN Sheldon X.-D..Localized relaxation theory of circuits and its applications in electro-thermal analyses[J].Science China(Information Sciences),2012,55(4):938-950. 被引量:1
  • 4Zhong Y,Wong M D F.Fast algorithms for IR drop analysis in large power grid [C]//Proceedings of the IEEE/ACM International Conference on Computer-Aided Design.Los Alamitos:IEEE Computer Society Press,2005:351-357.
  • 5Luo Z Y,Tan S X D,Fan J.Localized statistical 3D thermal analysis considering electro-thermal coupling [C]//Proceedings of IEEE International Symposium on Circuit and System.Los Alamitos:IEEE Computer Society Press,2009:1289-1292.
  • 6Chen T H,Chen C C P.Efficient large-scale power grid analysis based on preconditioned krylov-suhspace iterative methods [C]//Proceedings of the 38th Annual Design Automation Conference.New York:ACM Press,2001:559-562.
  • 7Kozhaya J N,Nassif S R,Najm F N.A multigrid-like technique for power grid analysis [J].IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems,2002,21(10):1148-1160.
  • 8Qian H F,Nassif S R,Sapatnekar S S.Random walks in a supply network [C]//Proceedings of Design Automation Conference.New York:ACM Press,2003:93-98.
  • 9刘鑫,许华荣.基于GPU的特征点提取与匹配算法比较[J].计算机辅助设计与图形学学报,2013,25(10):1496-1502. 被引量:7
  • 10Sinha R,Prakash A,Patel H D.Parallel simulation of mixed-abstraction SystemC models on GPUs and multicore CPUs [C]//Proceedings of the 7th Asia and South Pacific Design Automation Conference.Los Alamitos:IEEE Computer Society Press,2012:455-460.

二级参考文献8

共引文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部