改善系统能量效率的体系结构方法:并行处理被引量：5

An Efficient Architecture Method for Improving Energy Efficiency:Parallel Processing

下载PDF

导出

摘要因为对高性能微芯片和系统设计的广泛影响,能量消耗问题受到计算机界越来越广泛的关注.多个层次的技术被用于改善系统的能量效率,并行处理是体系结构层提高能量效率的主要手段.并行处理使用性能适中的计算节点减少能量消耗,使用多个节点并行执行维持高吞吐量.文中分析了并行处理提高能量效率的基本原理,给出了并行处理的时间开销和能量开销模型.基于模型分析,对低电压并行系统、动态电压调节(Dynamic Voltage Scaling,DVS)的并行系统和多核微处理器3个并行处理方向进行了展望,给出了这些并行处理方向改善能量效率的空间. Energy consumption has been paid increasing attention to in the computer domain because of its deep influence on the design of high performance chips and systems. Many techniques are proposed to improve energy efficiency of computer systems, and in the paper the author focuses on parallel processing on architecture level. Parallel processing improves energy efficiency by using some computing nodes with moderate performance, which maintain high throughput by parallel execution. In this paper, the authors present the fundamental of parallel processing improving energy efficiency, and models the time and energy overhead involved in parallel execution. Based on the models, the author investigates low voltage parallel systems, parallel systems with dynamic voltage scaling, and multi-core microprocessors, and reveals their potential of improving energy efficiency.

作者易会战刘永鹏

机构地区国防科学技术大学计算机学院计算机研究所

出处《计算机学报》 EI CSCD 北大核心 2009年第12期2475-2481,共7页 Chinese Journal of Computers

基金国家自然科学基金"软件指导的高性能计算机系统功耗和热量管理"(60903059) 国家"八六三"高技术研究发展计划项目"面向片上多处理器系统的程序设计环境"(2008AA01Z110) 国家科技重大专项(2009ZX01036-001-003-001) 高效能服务器和存储技术国家重点实验室开放基金项目(2009 HSSA04)资助

关键词并行处理能量效率动态电压调节低电压设计多核处理 parallel processing energy efficiency dynamic voltage scaling low-voltage design multi-core processing

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献11

1Hsu Chung Hsing, Feng Wu Chun. A power-aware run time system for high performance computing//Proceedings of the 2005 ACM/IENE Conference on Supercomputing. Seattle,Washington, USA, 2005:1-10.
2Ge Rong, Fen Xizhou, Cameron Kirk W. Performance-constrained distributed DVS scheduling for scientific applications on power-aware clusters//Proceedings of the 2005 ACM/ IEEE Conference on Supercomputing. Seattle, Washington, USA, 2005:34- 45.
3Rabaey J M, Chandrakasan A, Nikolic B. Digital Integrated Circuits: A Design Perspective. 2nd Edition. Beijing: Tsinghua University Press, 2004.
4Lorch J R. Operating systems techniques for reducing processor energy consumption[Ph. D. dissertation]. University of California, Berkeley, USA, 2001.
5Freeh Vincent W, Pan Feng, Kappiah Nandini, Lowenthal David K, Springer Rob. Exploring the energy-time tradeoff in MPI programs on a power-scalable cluster//Proceedings of the 19th IEEE International Parallel and Distributed Process ingSymposium(IPDPS'05). Denver, Colorado, 2005: 41- 50.
6The BlueGene/L Team. An overview of the BlueGene/L supereomputer//Proeeedings of the 2002 ACM/IEEE Conference on Supereomputing. Baltimore, USA, 2002: 1 -22.
7Fan Zhe, Qiu Feng, Kaufman Arie, Yoakum Stover Suzanne. GPU cluster for high performance computing//Proceedings of the 2004 ACM/IEEE Conference on Supercomputing. Pittsbnrgh: PA, 2004. Washington, DC: USA, 2005:47- 58.
8Luk Chi-Keung, Hong Sunpyo, Kim Hyesoon. Qilin: Exploiling parallelism on heterogeneous multiprocessors with adaptive mapping//Proceedings of the 42nd International Symposium on Microarchitecture (MICRO 42). New York, USA, 2009. LosAlamitos, CA, USA, 2002:1-10.
9Tomov Stanimire, Dongarra Jack, Baboulin Marc. Towards dense linear algebra for hybrid GPU accelerated many-core systems. Department of Computer Science, University of Tennessee, Knoxville, TN, USA.. Technical Report UTCS 08 632, 2008.
10Hwang Kai, Xu Zhiwei, Arakawa Masahiro. Benchmark evaluation of the IBM SP2 for parallel signal processing. IEEE Transactions on Parallel and Distributed Systems, 1996, 7(5): 522-536.

同被引文献37

1刘近光,梁满贵.多核多线程处理器的发展及其软件系统架构[J].微处理机,2007,28(1):1-3. 被引量：22
2卢春鹏.动态电压与频率调节在降低功耗中的作用[J].单片机与嵌入式系统应用,2007,7(5):12-14. 被引量：11
3REINDERS J. Intel threading building blocks: outfitting C++ for multi-core processor parallelism [M]. Sebastopol, CA, USA: O'Reilly Media, 2007: 133- 168.
4FRIGO M, LEISERSON C E, RANDALL K H. The implementation of the Cilk-5 multithreaded language [C] // Proceedings of ACM SIGPLAN Conference on Programming Language Design and Implementation.New York, USA. ACM, 1998: 212-223.
5BIENIA C, KUMAR S, SINGH J P, et al. The PAR- SEC benchmark suite: characterization and architectural implications [C] // Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques. New York, USA: ACM, 2008: 72-81.
6AGRAWAL K, LEISERSON C E, SUKHA J. Executing task graphs using Work-Stealing [C]//Proceedings of 24th IEEE International Parallel and Distributed Processing Symposium (IPDPS). Piscataway, NJ, USA: IEEE, 2010: 1-12.
7AGRAWAL K, HE Y, LEISERSON C E. Adaptive work stealing with parallelism feedback [C]// Proceedings of the 12th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP). New York, USA:ACM, 2007: 112-120.
8HILL M, MARTY M. Amdahl's law in the multicore era [J]. Computer, 2008, 41(7):33-38.
9CHAPMAN B, HUANG Lei. Enhancing OpenMP and its implementation for programming multicore systems [M]//Parallel Computing: Architectures, Algorithms, and Applications. Amsterdam, Netherlands: IOS Press, 2008 : 3-18.
10Choi K, Soma R, Pedram M. Dynamic Voltage and Frequency Scaling Based on Workload Decomposition[C]//Proc. of Inter- national Symposium on Low Power Electronics and Design. New York, USA: ACM Press, 2004: 174-179.

引证文献5

1白俊峰,邓祖朴.多核系统下的IPSec VPN网关的研究和实现[J].计算机工程与设计,2010,31(13):2992-2995. 被引量：5
2曹仰杰,杨海兵,钱德沛,伍卫国.多核编程模型运行时环境的自适应性研究[J].西安交通大学学报,2011,45(6):130-134. 被引量：3
3易会战,罗兆成.基于动态电压调节的高性能业务系统能耗优化[J].华中科技大学学报（自然科学版）,2013,41(1):25-29. 被引量：1
4周亦敏,沈云龙,曹丽东.基于异构多核平台H.264解码的DVFS算法[J].计算机工程,2013,39(11):268-271. 被引量：4
5彭慧.基于智能化分配算法的计算机负荷并行处理技术研究[J].赤峰学院学报（自然科学版）,2015,31(2):21-23. 被引量：1

二级引证文献14

1张小波,程良伦.虚拟专用路由网络的集成研究[J].计算机工程与设计,2011,32(8):2619-2622.
2谭海.基于对象的隐式并行编程众核体系结构研究[J].计算机工程与设计,2013,34(2):623-626.
3尹淑玲.Easy VPN技术及其应用[J].信息安全与技术,2013,4(2):65-66. 被引量：3
4徐建.多核编程模型运行时环境的自适应性探讨[J].科技信息,2013(26):225-225. 被引量：1
5尹淑玲.GET VPN技术及应用[J].计算机安全,2014(1):51-53. 被引量：2
6陈红,宋长军.优先级融合动态电压频率调节的云计算任务调度算法研究[J].激光杂志,2014,35(12):112-115.
7熊永华,张因升,陈鑫,吴敏.云视频监控系统的能耗优化研究[J].软件学报,2015,26(3):680-698. 被引量：22
8邓定胜.高性能计算中一种改进的数据访问节能技术研究[J].计算机科学,2015,42(2):191-197. 被引量：1
9余娅梅.探讨计算机应用程序编程模型的发展方向[J].电脑编程技巧与维护,2015(16):16-17. 被引量：3
10王艳杰.基于改进遗传算法的电力网络负载预警模型[J].计算机仿真,2015,32(9):167-170. 被引量：2

1孙方敏.芯片的多核系统应用软件、硬件设计[J].国外科技新书评介,2011(8):18-19.
2卢敏.风河展多核虚拟化优势[J].软件世界,2009(12):84-84.
3大河蟹.让程序与多核处理密切合作[J].网友世界,2009(5):38-39.
4硬件[J].电脑爱好者,2010(9):67-67.
5PaulR.GendreauJr..便携产品设计的低电压逻辑问题[J].电子产品世界,1999,6(6):37-38.
6张翔,刘梦伦,卢志翔.基于OpenMP的互关联后继树索引创建算法优化[J].计算机光盘软件与应用,2011(8):160-161.
7微处理器:多核已成为主流[J].计算机与网络,2009,35(8):12-13.
8竹居智久,邱石（译）.软件开发将朝多核应用发展[J].电子设计应用,2006(12):45-45.
9陈利平,徐洪珍.一种基于矩阵的软件演化模型[J].湖南农机,2012(7):45-45.
10飞思卡尔八核微处理器重新定义了嵌入式多核处理的最高标准[J].半导体技术,2008,33(7):644-644.

计算机学报

2009年第12期

浏览历史

内容加载中请稍等...

改善系统能量效率的体系结构方法:并行处理被引量：5

参考文献11

同被引文献37

引证文献5

二级引证文献14

相关作者

相关机构

相关主题

浏览历史

改善系统能量效率的体系结构方法:并行处理 被引量：5

参考文献11

同被引文献37

引证文献5

二级引证文献14

相关作者

相关机构

相关主题

浏览历史

改善系统能量效率的体系结构方法:并行处理被引量：5