A scalable framework for online power modelling of high-performance computing nodes in production

被引:3
|
作者
Pittino, Federico [1 ]
Beneventi, Francesco [1 ]
Bartolini, Andrea [1 ]
Benini, Luca [1 ,2 ]
机构
[1] Univ Bologna, Dept Elect Elect & Informat Engn DEI, Bologna, Italy
[2] Swiss Fed Inst Technol, Integrated Syst Lab, Zurich, Switzerland
来源
PROCEEDINGS 2018 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS) | 2018年
关键词
power model; HPC cluster in production; machine learning; scalable framework;
D O I
10.1109/HPCS.2018.00058
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Power and thermal design and management are critical components of high performance computing (HPC) systems, due to their cutting-edge position in terms of high power density and large total power consumption. Many HPC power management strategies rely on the availability of accurate compact power models, capable of predicting power consumption and tracking its sensitivity to workload parameters and operating points. In this paper we describe a methodology and a framework for training power models derived with two of the best-in-class procedures directly on the online in production nodes and without requiring dedicated training instances. The compact power models are obtained using an online regression-based approach which can track non-stationary workloads and hardware variability. Our experiments on a real-life HPC system demonstrate that the models achieve very high accuracy over all operating modes. We also demonstrate the scalability of our approach and the small amount of resources needed for the online modeling, for both the training and inference phases.
引用
收藏
页码:300 / 307
页数:8
相关论文
共 50 条
  • [31] ScalaTrace: Scalable compression and replay of communication traces for high-performance computing
    Noeth, Michael
    Ratn, Prasun
    Mueller, Frank
    Schulz, Martin
    de Supinski, Bronis R.
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2009, 69 (08) : 696 - 710
  • [32] Mobiliti: Scalable Transportation Simulation Using High-Performance Parallel Computing
    Chan, Cy
    Wang, Bin
    Bachan, John
    Macfarlane, Jane
    2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 634 - 641
  • [33] Scalable deep text comprehension for Cancer surveillance on high-performance computing
    Qiu, John X.
    Yoon, Hong-Jun
    Srivastava, Kshitij
    Watson, Thomas P.
    Christian, J. Blair
    Ramanathan, Arvind
    Wu, Xiao C.
    Fearn, Paul A.
    Tourassi, Georgia D.
    BMC BIOINFORMATICS, 2018, 19
  • [34] SROdcn: Scalable and Reconfigurable Optical DCN Architecture for High-Performance Computing
    Geresu, Kassahun
    Gu, Huaxi
    Yu, Xiaoshan
    Fadhel, Meaad
    Tian, Hui
    Wei, Wenting
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2025, 13 (01) : 245 - 258
  • [35] A High-Performance and Scalable Distributed Storage and Computing System for IMS Services
    Seraoui, Youssef
    Bellafkih, Mostafa
    Raouyane, Brahim
    2016 2ND INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGIES AND APPLICATIONS (CLOUDTECH), 2016, : 335 - 342
  • [36] Scalable Embedded Systems: Towards the Convergence of High-Performance and Embedded Computing
    Giorgi, Roberto
    PROCEEDINGS IEEE/IFIP 13TH INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING 2015, 2015, : 148 - 153
  • [37] Large-Memory Nodes for Energy Efficient High-Performance Computing
    Zivanovic, Darko
    Radulovic, Milan
    Llort, German
    Zaragoza, David
    Strassburg, Janko
    Carpenter, Paul M.
    Radojkovic, Petar
    Ayguade, Eduard
    MEMSYS 2016: PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON MEMORY SYSTEMS, 2016, : 3 - 9
  • [38] High-performance computing nodes for real-time parallel applications
    Carden, TC
    Dobinson, RW
    Fisher, S
    Maley, PD
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 1997, 394 (1-2): : 211 - 218
  • [39] SPRINT: Scalable Photonic Switching Fabric for High-Performance Computing (HPC)
    Neel, Brian
    Morris, Randy
    Ditomaso, Dominic
    Kodi, Avinash
    JOURNAL OF OPTICAL COMMUNICATIONS AND NETWORKING, 2012, 4 (09) : A38 - A47
  • [40] A Survey Of High-performance Computing Approaches in Power Systems
    Khaitan, Siddhartha Kumar
    2016 IEEE POWER AND ENERGY SOCIETY GENERAL MEETING (PESGM), 2016,