摘要
功耗管控是高性能计算系统和分布式数据中心管理的热点问题。当机房供电受限时需要对机群系统的功耗上限进行控制,使有限的电力适应供电容量的动态变化。为此,设计并实现一个基于RAPL的功耗限额控制系统。建立机群系统功耗模型,利用RAPL对CPU功耗限额的控制能力并结合功耗差额测量方法,将机群系统功耗上限控制在设定限额内,在此基础上尽可能减少程序性能的损失。实验结果表明,在较小的性能损失下,该系统可有效降低峰值功耗并将其稳定在限额内。
The management and control of power has already become a hot issue in the area of management of High Performance Computing(HPC) system and distributed data center. In order to adapt the limited power to the dynamic change of the power supply capacity when the supply of energy is limited in the computer room, it is necessary to control the upper power limit of cluster system. Aiming at this problem,this paper designs and realizes a power capping control system based on RAPL. By constructing the power model of cluster system, utilizing RAPL' s capability of controlling the power consumption limit of CPU and combining the method of measuring the difference of power,it sets the upper limit of energy consumption of cluster system within the previously set power cap. On this basis ,it tries to reduce the losses of performance as much as possible. The result of experiment shows that this system can reduce the peak power effecti,gely with slight performance and keep it below the power cap stably.
出处
《计算机工程》
CAS
CSCD
北大核心
2017年第5期40-46,共7页
Computer Engineering
基金
国家"863"计划重大项目(2012AA01A302)
关键词
高性能计算
分布式数据中心
峰值功耗
功耗限额
差额测量
RAPL技术
High Performance Computing (HPC)
distributed data center
peak power
power capping
difference measurement
RAPL technology