ControlPULP: A RISC-V On-Chip Parallel Power Controller for Many-Core HPC Processors with FPGA-Based Hardware-In-The-Loop Power and Thermal Emulation

被引:0
作者
Alessandro Ottaviano
Robert Balas
Giovanni Bambini
Antonio Del Vecchio
Maicol Ciani
Davide Rossi
Luca Benini
Andrea Bartolini
机构
[1] ETH Zürich,Integrated Systems Laboratory
[2] University of Bologna,DEI
来源
International Journal of Parallel Programming | 2024年 / 52卷
关键词
RISC-V; HPC processor; Power and thermal control; Scalable; Parallel microcontroller; PULP;
D O I
暂无
中图分类号
学科分类号
摘要
High-performance computing (HPC) processors are nowadays integrated cyber-physical systems demanding complex and high-bandwidth closed-loop power and thermal control strategies. To efficiently satisfy real-time multi-input multi-output (MIMO) optimal power requirements, high-end processors integrate an on-die power controller system (PCS). While traditional PCSs are based on a simple microcontroller (MCU)-class core, more scalable and flexible PCS architectures are required to support advanced MIMO control algorithms for managing the ever-increasing number of cores, power states, and process, voltage, and temperature variability. This paper presents ControlPULP, an open-source, HW/SW RISC-V parallel PCS platform consisting of a single-core MCU with fast interrupt handling coupled with a scalable multi-core programmable cluster accelerator and a specialized DMA engine for the parallel acceleration of real-time power management policies. ControlPULP relies on FreeRTOS to schedule a reactive power control firmware (PCF) application layer. We demonstrate ControlPULP in a power management use-case targeting a next-generation 72-core HPC processor. We first show that the multi-core cluster accelerates the PCF, achieving 4.9x speedup compared to single-core execution, enabling more advanced power management algorithms within the control hyper-period at a shallow area overhead, about 0.1% the area of a modern HPC CPU die. We then assess the PCS and PCF by designing an FPGA-based, closed-loop emulation framework that leverages the heterogeneous SoCs paradigm, achieving DVFS tracking with a mean deviation within 3% the plant’s thermal design power (TDP) against a software-equivalent model-in-the-loop approach. Finally, we show that the proposed PCF compares favorably with an industry-grade control algorithm under computational-intensive workloads.
引用
收藏
页码:93 / 123
页数:30
相关论文
共 60 条
[1]  
Leiserson CE(2020)There’s plenty of room at the top: What will drive computer performance after Moore’s law? Science 368 eaam9744-695
[2]  
Thompson NC(2022)A two-layer distributed mpc approach to thermal control of multiprocessor systems-on-chip Control Eng. Pract. 1 682-27
[3]  
Emer JS(2020)COUNTDOWN: a run-time library for performance-neutral energy saving in MPI applications IEEE Trans. Comput. 32 20-183
[4]  
Kuszmaul BC(2012)Power-management architecture of the intel microarchitecture code-named Sandy Bridge IEEE Micro 24 170-964
[5]  
Lampson BW(2013)Thermal and energy management of high-performance multicores: distributed and self-calibrating model-predictive controller IEEE Trans. Parallel Distrib. Syst. 40 943-66
[6]  
Sanchez D(2010)A survey of the research on power management techniques for high-performance systems Softw. Pract. Exp. 14 50-143
[7]  
Schardl TB(2010)Energy-efficient computing: power management system on the Nehalem family of processors Intel Technol. J. 54 133-1053
[8]  
Tilli A(2019)“Zeppelin”: an SoC for multichip architectures IEEE J. Solid-State Circuits 33 1038-4382
[9]  
Garone E(2022)A low-power transprecision floating-point cluster for efficient near-sensor data analytics IEEE Trans. Parallel Distrib. Syst. 33 4368-1110
[10]  
Conficoni C(2022)HEROv2: full-stack open-source research platform for heterogeneous computing IEEE Trans. Parallel Distrib. Syst. 63 1097-undefined