热导之间相互作用的单循环迭代热分析算法被引量：1

TPG-Sli: Single-Loop Iterative Thermal Analysis Algorithm Considering Interactions Among Temperature,Power and Heat Conductance

下载PDF

导出

摘要随着纳米工艺的不断改进,温度对漏电流功耗和热导的影响日益显著.考虑温度/功耗/热导相互作用的3D芯片热分析需要采用迭代方法对温度进行精确求解,即先用功耗密度向量和热导矩阵来求解温度向量,再用求解出来的温度向量来刷新功耗密度向量和热导矩阵.为了提高3D芯片热分析的效率,本文以一个设定温度值下的均匀热导矩阵作为预条件,先提出了一种双循环、内循环低迭代次数的高效求解算法TPG-FTCG.鉴于TPG-FTCG具有超快的内循环收敛速度,本文省去了TPG-FTCG算法的内循环部分,提出了一种单循环、低迭代次数的TPG求解算法TPG-Sli.基于GPU(Graphics Processing Unit)并行加速技术,本文编写并改进了TPG-Sli的GPU加速算法.实验数据表明:与采用经典高效的ICCG算法进行3D芯片热分析的TPG-ICCG算法相比,在足够小的误差范围内,TPG-Sli的GPU加速算法可以获得120倍的速度提升. With the improvement of the nanometer technology,the influences among temperature,leakage power and heat conductance become increasingly significant and it should be taken into account in 3 D chip comprehensive thermal anal-ysis to solve the accurate temperature based on the iterative solution.The comprehensive thermal analysis method uses the nodal power density vector and the heat conductance matrix to solve the nodal temperature vector,and then,refreshes power density and heat conductance with the obtained nodal temperature.In order to improve the efficiency of 3D chip comprehen-sive thermal analysis,this work uses the heat conductance matrix as the precondition under a setting temperature.Then it pro-poses an efficient algorithm TPG-FTCG（CG with the Fast Transform-based Preconditioner）which has double-loop and low-er inner-loop iterations.According to TPG-FTCG’s fast inner-loop convergence rate,this work removes TPG-FTCG’s in-ner-loop part then proposes a more efficient TPG solving algorithm TPG-Sli（Single-loop iterative）,which only has single-loop iterative and fewer iterations.Based on the GPU parallel computing,this work compiles and refines TPG-Sli’s GPU-parallel-computing algorithm.Experimental results demonstrate that：On the premise of precision losing,the TPG-Sli’s GPU algorithm can achieve about 120X speedup compared with the TPG-ICCG algorithm,which uses the classical and efficient ICCG to deal with the 3 D chip comprehensive thermal analysis.

作者潘月斗王嘉琪唐亮骆祖莹

机构地区北京科技大学自动化学院北京科技大学钢铁流程先进控制教育部重点实验室北京师范大学信息科学与技术学院

出处《电子学报》 EI CAS CSCD 北大核心 2016年第6期1300-1306,共7页 Acta Electronica Sinica

基金国家自然科学基金(No.51331002)

关键词算法热分析快速傅里叶变换 GPU并行 algorithm thermal analysis Fast Fourier transform GPU parallel computing

分类号 TP393 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1陈加,吴晓军,蔡荣.GPU并行加速的均值偏移算法[J].计算机辅助设计与图形学学报,2010,22(3):461-466. 被引量：6
2闫佳琪,骆祖莹,唐亮,赵国兴.考虑温度对漏电流功耗影响的MPSoC结构级热分析方法[J].计算机辅助设计与图形学学报,2013,25(11):1767-1774. 被引量：6

二级参考文献19

1李乡儒,吴福朝,胡占义.均值漂移算法的收敛性[J].软件学报,2005,16(3):365-374. 被引量：88
2彭宁嵩,杨杰,刘志,张风超.Mean-Shift跟踪算法中核函数窗宽的自动选取[J].软件学报,2005,16(9):1542-1550. 被引量：165
3丁少华.中国的机器视觉底层软件现状观察[R].深圳:深圳大学城图书馆,2008.
4Matrox Imaging Library (MIL) 9. 0 User's Manual [OL]. (2009-01- 13) [2009 -05-15]. http://www, matrox, corn/ imaging/media/pdf/products/mil/b_mil, pdf.
5Fukunaga K, Hostetler L D. The estimation of the gradient of density function, with applications in pattern recognition [J]. IEEE Transactions on Information Theory, 1975, 21 (1) : 32-40.
6Cheng Yizong. Mean Shift, mode seeking, and clustering [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1995, 17(8): 790-799.
7Comaniciu D, Meer P. Mean shift: a robust approach toward feature space analysis [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(5) : 603-619.
8Yang C J, Duraiswami R, Davis L. Efficient mean shift tracking via a new similarity measure [C] //Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Diego, 2005:176-183.
9Wang J, Thiesson B, Xu Y G, et al. Image and video segmentation by anisotropic kernel mean shift [C]// Proceedings of the 8th European Conference on Computer Vision, Prague, 2004:238-249.
10Zhang K, Tang M, Kwok J consistency for fast clustering T. Applying neighborhood and kernel density estimation [C] //Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Diego, 2005:1001-1007.

共引文献10

1刘晶,朱清香,梅群,张蕾.一种基于单处理机的并行关联规则算法及其在数字图书馆中的应用[J].图书情报工作,2011,55(7):114-117. 被引量：7
2周伟,施宁,王健,汪群山.基于GPU-CPU流水线的雷达回波快速聚类[J].微电子学与计算机,2012,29(4):71-75.
3周伟,安虹,刘谷,李小强,吴石磊.一种输入感知的雷达回波快速聚类实现[J].计算机科学,2012,39(12):295-299.
4孙晓鹏,荣丹.三维网格的Mean Shift并行分割算法[J].计算机工程与设计,2013,34(1):230-234.
5田立,周付根,孟偲,白相志,金挺.互相关跟踪算法的多核DSP快速实现[J].高技术通讯,2013,23(12):1248-1253. 被引量：2
6李晓怡,骆岩林,骆祖莹.基于CUDA的三维芯片温度场实时可视化[J].计算机仿真,2015,32(8):289-293. 被引量：2
7李钊,李业德,吴兴华.基于动态功耗的流水线优化方法研究[J].仪器仪表学报,2016,37(5):1058-1064.
8魏琳,周磊,吴宁,杨睛.多处理器片上系统中一种结合二阶导数的温度预测模型[J].电子学报,2016,44(6):1272-1278. 被引量：1
9褚新建,宋东亚.一种考虑漏电流最低损耗的控制器电子模块设计与实现[J].现代电子技术,2016,39(22):138-141.
10李博夫,段士华,韩顺枫,李大猛,杨宝斌,张娜,李德建.工业级芯片热管理与漏电流阈值设定技术[J].半导体技术,2024,49(10):940-945.

同被引文献6

1张梁娟,钱吉裕,魏涛,孔祥举.微波功率组件基板热阻研究[J].电子机械工程,2012,28(6):5-7. 被引量：2
2曾海,郭震宁,陈俄振,胡治伟.矩形基板上LED芯片阵列热分析[J].华侨大学学报（自然科学版）,2013,34(3):267-273. 被引量：3
3郭怀新,韩平,陈堂胜.基于GaN功率器件的热仿真技术研究[J].固体电子学研究与进展,2017,37(3):176-181. 被引量：2
4翟玉卫,郑世棋,刘岩,梁法国.半导体器件用显微红外热成像技术原理及应用[J].计测技术,2018,38(6):53-60. 被引量：9
5张琦,蔡志匡,王子轩,孙海燕,郭宇锋.一种基于热阻网络的叠层芯片结温预测模型[J].固体电子学研究与进展,2020,40(1):66-70. 被引量：7
6许春良,杨卅男,万悦.W波段5W GaN四路合成功率放大器MMIC[J].半导体技术,2022,47(5):391-396. 被引量：1

引证文献1

1郜佳佳,游恒果,李静强,舒国富.GaN功率放大器MMIC的近结区热阻解析模型[J].半导体技术,2024,49(4):380-387.

1许振新.笔记本并不神秘[J].中国计算机用户,2002(36):59-59.
2王海军,张南平.基于数据驱动的单循环排序算法及其优化[J].微计算机应用,2006,27(2):213-214.
3三一.三一集团总裁唐修国:“互联网+工业化”铸就“新三一”[J].工程机械,2016,47(7).
4曾红.一种极速排序算法[J].无线互联科技,2012,9(1):81-81.
5李明.微珠状热导气敏传感器[J].仪表技术与传感器,1990(3):26-30.
6ST针对U盘推出双核心控制器芯片ST72681[J].电子测试（新电子）,2005(7):83-83.
7谢如萍.3D芯片简介[J].电子测试,2000,13(3):72-73.
8读语.部分3D芯片性能比较[J].微型计算机,1997(6):41-41.
9蔡家盛.3D芯片的现在与未来[J].电子测试,2000(10):190-191.
10秦笃烈.3D硬件市场开始形成[J].多媒体世界,1996(9):23-23.

电子学报

2016年第6期

浏览历史

内容加载中请稍等...

考虑温度/功耗/热导之间相互作用的单循环迭代热分析算法被引量：1

参考文献2

二级参考文献19

共引文献10

同被引文献6

引证文献1

相关作者

相关机构

相关主题

浏览历史

考虑温度/功耗/热导之间相互作用的单循环迭代热分析算法 被引量：1

参考文献2

二级参考文献19

共引文献10

同被引文献6

引证文献1

相关作者

相关机构

相关主题

浏览历史

考虑温度/功耗/热导之间相互作用的单循环迭代热分析算法被引量：1