Online Power Estimation of Graphics Processing Units

被引:10
作者
Adhinarayanan, Vignesh [1 ]
Subramaniam, Balaji [2 ]
Feng, Wu-chun [1 ]
机构
[1] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA
[2] Argonne Natl Lab, Math & Comp Sci Div, Argonne, IL 60439 USA
来源
2016 16TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID) | 2016年
关键词
D O I
10.1109/CCGrid.2016.93
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Accurate power estimation at runtime is essential for the efficient functioning of a power management system. While years of research have yielded accurate power models for the online prediction of instantaneous power for CPUs, such power models for graphics processing units (GPUs) are lacking. GPUs rely on low-resolution power meters that only nominally support basic power management. To address this, we propose an instantaneous power model, and in turn, a power estimator, that uses performance counters in a novel way so as to deliver accurate power estimation at runtime. Our power estimator runs on two real NVIDIA GPUs to show that accurate runtime estimation is possible without the need for the high-fidelity details that are assumed on simulation-based power models. To construct our power model, we first use correlation analysis to identify a concise set of performance counters that work well despite GPU device limitations. Next, we explore several statistical regression techniques and identify the best one. Then, to improve the prediction accuracy, we propose a novel application-dependent modeling technique, where the model is constructed online at runtime, based on the readings from a low-resolution, built-in GPU power meter. Our quantitative results show that a multi-linear model, which produces a mean absolute error of 6%, works the best in practice. An application-specific quadratic model reduces the error to nearly 1%. We show that this model can be constructed with low overhead and high accuracy at runtime. To the best of our knowledge, this is the first work attempting to model the instantaneous power of a real GPU system; earlier related work focused on average power.
引用
收藏
页码:245 / 254
页数:10
相关论文
共 33 条
  • [1] Power and Performance Characterization and Modeling of GPU-Accelerated Systems
    Abe, Yuki
    Inoue, Koji
    Sasaki, Hiroshi
    Edahiro, Masato
    Kato, Shinpei
    Peres, Martin
    [J]. 2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, 2014,
  • [2] [Anonymous], 2015, Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing, DOI DOI 10.1145/2749246.2749262
  • [3] [Anonymous], 2000, SER EW
  • [4] [Anonymous], P WORKSH POW AW COMP
  • [5] [Anonymous], TECH REP
  • [6] [Anonymous], 2013, SIGARCH Comput. Archit. News, DOI [DOI 10.1145/2508148.2485964, 10.1145/2508148.2485964, DOI 10.1145/2485922]
  • [7] [Anonymous], P 2008 WORKSH POW AW
  • [8] [Anonymous], CUPTI US GUID
  • [9] [Anonymous], TECH REP
  • [10] [Anonymous], NVML API REF MAN