Workload Change Point Detection for Runtime Thermal Management of Embedded Systems

被引:14
作者
Das, Anup [1 ]
Merrett, Geoff V. [1 ]
Tribastone, Mirco [1 ]
Al-Hashimi, Bashir M. [1 ]
机构
[1] Univ Southampton, Dept Elect & Comp Sci, Southampton SO17 1BJ, Hants, England
基金
英国工程与自然科学研究理事会;
关键词
Change point detection; embedded system; runtime manager (RTM); thermal optimization; POWER MANAGEMENT; TEMPERATURE; RATIO;
D O I
10.1109/TCAD.2015.2504875
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Applications executed on multicore embedded systems interact with system software [such as the operating system (OS)] and hardware, leading to widely varying thermal profiles which accelerate some aging mechanisms, reducing the lifetime reliability. Effectively managing the temperature therefore requires: 1) autonomous detection of changes in application workload and 2) appropriate selection of control levers to manage thermal profiles of these workloads. In this paper, we propose a technique for workload change detection using density ratio-based statistical divergence between overlapping sliding windows of CPU performance statistics. This is integrated in a runtime approach for thermal management, which uses reinforcement learning to select workload-specific thermal control levers by sampling on-board thermal sensors. Identified control levers override the OSs native thread allocation decision and scale hardware voltage-frequency to improve average temperature, peak temperature, and thermal cycling. The proposed approach is validated through its implementation as a hierarchical runtime manager for Linux, with heuristic-based thread affinity selected from the upper hierarchy to reduce thermal cycling and learning-based voltage-frequency selected from the lower hierarchy to reduce average and peak temperatures. Experiments conducted with mobile, embedded, and high performance applications on ARM-based embedded systems demonstrate that the proposed approach increases workload change detection accuracy by an average 3.4x, reducing the average temperature by 4 degrees C-25 degrees C, peak temperature by 6 degrees C-24 degrees C, and thermal cycling by 7%-35% over state-of-the-art approaches.
引用
收藏
页码:1358 / 1371
页数:14
相关论文
共 32 条
  • [1] Runtime Thermal Management Using Software Agents for Multi- and Many-Core Architectures
    Al Faruque, Mohammad Abdullah
    Jahn, Janmartin
    Ebi, Thomas
    Henkel, Joerg
    [J]. IEEE DESIGN & TEST OF COMPUTERS, 2010, 27 (06): : 58 - 68
  • [2] Amrouch H, 2014, ICCAD-IEEE ACM INT, P478, DOI 10.1109/ICCAD.2014.7001394
  • [3] [Anonymous], 2011, JEP122G JEDEC
  • [4] [Anonymous], 2014, P 2014 ACM INT C HAR
  • [5] Bienia Christian, 2008, 2008 IEEE International Symposium on Workload Characterization (IISWC), P47, DOI 10.1109/IISWC.2008.4636090
  • [6] Cochran R, 2010, DES AUT CON, P62
  • [7] Coskun AK, 2008, DES AUT CON, P890
  • [8] A Fast High-Level Event-Driven Thermal Estimator for Dynamic Thermal Aware Scheduling
    Cui, Jin
    Maskell, Douglas L.
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2012, 31 (06) : 904 - 917
  • [9] Das Abhishek., 2014, Electrical Engineering Congress (iEECON), 2014 International, P1, DOI DOI 10.7873/DATE.2014.115
  • [10] Reliability and Energy-Aware Mapping and Scheduling of Multimedia Applications on Multiprocessor Systems
    Das, Anup
    Kumar, Akash
    Veeravalli, Bharadwaj
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (03) : 869 - 884