A scalable framework for online power modelling of high-performance computing nodes in production

被引:3
|
作者
Pittino, Federico [1 ]
Beneventi, Francesco [1 ]
Bartolini, Andrea [1 ]
Benini, Luca [1 ,2 ]
机构
[1] Univ Bologna, Dept Elect Elect & Informat Engn DEI, Bologna, Italy
[2] Swiss Fed Inst Technol, Integrated Syst Lab, Zurich, Switzerland
来源
PROCEEDINGS 2018 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS) | 2018年
关键词
power model; HPC cluster in production; machine learning; scalable framework;
D O I
10.1109/HPCS.2018.00058
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Power and thermal design and management are critical components of high performance computing (HPC) systems, due to their cutting-edge position in terms of high power density and large total power consumption. Many HPC power management strategies rely on the availability of accurate compact power models, capable of predicting power consumption and tracking its sensitivity to workload parameters and operating points. In this paper we describe a methodology and a framework for training power models derived with two of the best-in-class procedures directly on the online in production nodes and without requiring dedicated training instances. The compact power models are obtained using an online regression-based approach which can track non-stationary workloads and hardware variability. Our experiments on a real-life HPC system demonstrate that the models achieve very high accuracy over all operating modes. We also demonstrate the scalability of our approach and the small amount of resources needed for the online modeling, for both the training and inference phases.
引用
收藏
页码:300 / 307
页数:8
相关论文
共 50 条
  • [21] Power Signatures of High-Performance Computing Workloads
    Combs, Jacob
    Nazor, Jolie
    Thysell, Rachelle
    Santiago, Fabian
    Hardwick, Matthew
    Olson, Lowell
    Rivoire, Suzanne
    Hsu, Chung-Hsing
    Poole, Stephen W.
    2014 ENERGY EFFICIENT SUPERCOMPUTING WORKSHOP (E2SC), 2014, : 70 - 78
  • [22] The Paradigm of Power Bounded High-Performance Computing
    Rong Ge
    Xizhou Feng
    Pengfei Zou
    Tyler Allen
    Journal of Computer Science and Technology, 2023, 38 : 87 - 102
  • [23] Performance analysis challenges and framework for high-performance reconfigurable computing
    Koehler, Seth
    Curreri, John
    George, Alan D.
    PARALLEL COMPUTING, 2008, 34 (4-5) : 217 - 230
  • [24] High-performance, power-aware computing
    1600, IEEE Computer Society
  • [25] The Paradigm of Power Bounded High-Performance Computing
    Ge, Rong
    Feng, Xizhou
    Zou, Pengfei
    Allen, Tyler
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 38 (01) : 87 - 102
  • [26] Implementation of High-Performance Computing Technologies in the BmnRoot Framework
    Nemnyugin, S.
    Driuk, A.
    Merts, S.
    Myasnikov, A.
    Stepanova, M.
    Iufryakova, A.
    PHYSICS OF PARTICLES AND NUCLEI, 2023, 54 (04) : 656 - 659
  • [27] Implementation of High-Performance Computing Technologies in the BmnRoot Framework
    S. Nemnyugin
    A. Driuk
    S. Merts
    A. Myasnikov
    M. Stepanova
    A. Iufryakova
    Physics of Particles and Nuclei, 2023, 54 : 656 - 659
  • [28] A Grid Computing Framework for High-Performance Medical Imaging
    Manana Guichon, Gabriel
    Romero Castro, Eduardo
    IX INTERNATIONAL SEMINAR ON MEDICAL INFORMATION PROCESSING AND ANALYSIS, 2013, 8922
  • [29] NEMO A Network Monitoring Framework for High-performance Computing
    Calle, Elio Perez
    DCNET 2010/OPTICS 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA COMMUNICATION NETWORKING AND INTERNATIONAL CONFERENCE ON OPTICAL COMMUNICATION SYSTEM, 2010, : 61 - 66
  • [30] Scalable deep text comprehension for Cancer surveillance on high-performance computing
    John X. Qiu
    Hong-Jun Yoon
    Kshitij Srivastava
    Thomas P. Watson
    J. Blair Christian
    Arvind Ramanathan
    Xiao C. Wu
    Paul A. Fearn
    Georgia D. Tourassi
    BMC Bioinformatics, 19