期刊文献+

考虑温度/功耗/热导之间相互作用的单循环迭代热分析算法 被引量:1

TPG-Sli: Single-Loop Iterative Thermal Analysis Algorithm Considering Interactions Among Temperature,Power and Heat Conductance
下载PDF
导出
摘要 随着纳米工艺的不断改进,温度对漏电流功耗和热导的影响日益显著.考虑温度/功耗/热导相互作用的3D芯片热分析需要采用迭代方法对温度进行精确求解,即先用功耗密度向量和热导矩阵来求解温度向量,再用求解出来的温度向量来刷新功耗密度向量和热导矩阵.为了提高3D芯片热分析的效率,本文以一个设定温度值下的均匀热导矩阵作为预条件,先提出了一种双循环、内循环低迭代次数的高效求解算法TPG-FTCG.鉴于TPG-FTCG具有超快的内循环收敛速度,本文省去了TPG-FTCG算法的内循环部分,提出了一种单循环、低迭代次数的TPG求解算法TPG-Sli.基于GPU(Graphics Processing Unit)并行加速技术,本文编写并改进了TPG-Sli的GPU加速算法.实验数据表明:与采用经典高效的ICCG算法进行3D芯片热分析的TPG-ICCG算法相比,在足够小的误差范围内,TPG-Sli的GPU加速算法可以获得120倍的速度提升. With the improvement of the nanometer technology,the influences among temperature,leakage power and heat conductance become increasingly significant and it should be taken into account in 3 D chip comprehensive thermal anal-ysis to solve the accurate temperature based on the iterative solution.The comprehensive thermal analysis method uses the nodal power density vector and the heat conductance matrix to solve the nodal temperature vector,and then,refreshes power density and heat conductance with the obtained nodal temperature.In order to improve the efficiency of 3D chip comprehen-sive thermal analysis,this work uses the heat conductance matrix as the precondition under a setting temperature.Then it pro-poses an efficient algorithm TPG-FTCG(CG with the Fast Transform-based Preconditioner)which has double-loop and low-er inner-loop iterations.According to TPG-FTCG’s fast inner-loop convergence rate,this work removes TPG-FTCG’s in-ner-loop part then proposes a more efficient TPG solving algorithm TPG-Sli(Single-loop iterative),which only has single-loop iterative and fewer iterations.Based on the GPU parallel computing,this work compiles and refines TPG-Sli’s GPU-parallel-computing algorithm.Experimental results demonstrate that:On the premise of precision losing,the TPG-Sli’s GPU algorithm can achieve about 120X speedup compared with the TPG-ICCG algorithm,which uses the classical and efficient ICCG to deal with the 3 D chip comprehensive thermal analysis.
出处 《电子学报》 EI CAS CSCD 北大核心 2016年第6期1300-1306,共7页 Acta Electronica Sinica
基金 国家自然科学基金(No.51331002)
关键词 算法 热分析 快速傅里叶变换 GPU并行 algorithm thermal analysis Fast Fourier transform GPU parallel computing
  • 相关文献

参考文献2

二级参考文献19

  • 1李乡儒,吴福朝,胡占义.均值漂移算法的收敛性[J].软件学报,2005,16(3):365-374. 被引量:88
  • 2彭宁嵩,杨杰,刘志,张风超.Mean-Shift跟踪算法中核函数窗宽的自动选取[J].软件学报,2005,16(9):1542-1550. 被引量:165
  • 3丁少华.中国的机器视觉底层软件现状观察[R].深圳:深圳大学城图书馆,2008.
  • 4Matrox Imaging Library (MIL) 9. 0 User's Manual [OL]. (2009-01- 13) [2009 -05-15]. http://www, matrox, corn/ imaging/media/pdf/products/mil/b_mil, pdf.
  • 5Fukunaga K, Hostetler L D. The estimation of the gradient of density function, with applications in pattern recognition [J]. IEEE Transactions on Information Theory, 1975, 21 (1) : 32-40.
  • 6Cheng Yizong. Mean Shift, mode seeking, and clustering [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1995, 17(8): 790-799.
  • 7Comaniciu D, Meer P. Mean shift: a robust approach toward feature space analysis [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(5) : 603-619.
  • 8Yang C J, Duraiswami R, Davis L. Efficient mean shift tracking via a new similarity measure [C] //Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Diego, 2005:176-183.
  • 9Wang J, Thiesson B, Xu Y G, et al. Image and video segmentation by anisotropic kernel mean shift [C]// Proceedings of the 8th European Conference on Computer Vision, Prague, 2004:238-249.
  • 10Zhang K, Tang M, Kwok J consistency for fast clustering T. Applying neighborhood and kernel density estimation [C] //Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Diego, 2005:1001-1007.

共引文献10

同被引文献6

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部