Reliability-Aware Scheduling on Heterogeneous Multicore Processors

被引:23
|
作者
Naithani, Ajeya [1 ]
Eyerman, Stijn [2 ]
Eeckhout, Lieven [1 ]
机构
[1] Univ Ghent, Ghent, Belgium
[2] Intel, Brussels, Belgium
基金
欧洲研究理事会;
关键词
VULNERABILITY FACTORS; CORES;
D O I
10.1109/HPCA.2017.12
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Reliability to soft errors is an increasingly important issue as technology continues to shrink. In this paper, we show that applications exhibit different reliability characteristics on big, high-performance cores versus small, power-efficient cores, and that there is significant opportunity to improve system reliability through reliability-aware scheduling on heterogeneous multicore processors. We monitor the reliability characteristics of all running applications, and dynamically schedule applications to the different core types in a heterogeneous multicore to maximize system reliability. Reliabilityaware scheduling improves reliability by 25.4% on average (and up to 60.2%) compared to performance-optimized scheduling on a heterogeneous multicore processor with two big cores and two small cores, while degrading performance by 6.3% only. We also introduce a novel system-level reliability metric for multiprogram workloads on (heterogeneous) multicores. We further show that our reliability-aware scheduler is robust across core count, number of big and small cores, and their frequency settings. The hardware cost in support of our reliability-aware scheduler is limited to 296 bytes per core.
引用
收藏
页码:397 / 408
页数:12
相关论文
共 50 条
  • [1] Reliability-aware scheduling strategy for heterogeneous distributed computing systems
    Tang, Xiaoyong
    Li, Kenli
    Li, Renfa
    Veeravalli, Bharadwaj
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2010, 70 (09) : 941 - 952
  • [2] Soft and Hard Reliability-Aware Scheduling for Multicore Embedded Systems with Energy Harvesting
    Xiang, Yi
    Pasricha, Sudeep
    IEEE TRANSACTIONS ON MULTI-SCALE COMPUTING SYSTEMS, 2015, 1 (04): : 220 - 235
  • [3] Optimizing Soft Error Reliability Through Scheduling on Heterogeneous Multicore Processors
    Naithani, Ajeya
    Eyerman, Stijn
    Eeckhout, Lieven
    IEEE TRANSACTIONS ON COMPUTERS, 2018, 67 (06) : 830 - 846
  • [4] Reliability-aware task scheduling for energy efficiency on heterogeneous multiprocessor systems
    Zexi Deng
    Dunqian Cao
    Hong Shen
    Zihan Yan
    Huimin Huang
    The Journal of Supercomputing, 2021, 77 : 11643 - 11681
  • [5] Reliability-aware task scheduling for energy efficiency on heterogeneous multiprocessor systems
    Deng, Zexi
    Cao, Dunqian
    Shen, Hong
    Yan, Zihan
    Huang, Huimin
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (10): : 11643 - 11681
  • [6] Energy-Efficient Reliability-Aware Scheduling Algorithm on Heterogeneous Systems
    Tang, Xiaoyong
    Tan, Weizhen
    SCIENTIFIC PROGRAMMING, 2016, 2016
  • [7] Instruction Scheduling for Reliability-Aware Compilation
    Rehman, Semeen
    Shafique, Muhammad
    Henkel, Joerg
    2012 49TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2012, : 1288 - 1296
  • [8] Enhanced energy aware scheduling in multicore processors
    Kumar, K. Vinod
    Ranvijay
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 35 (02) : 1375 - 1385
  • [9] Energy- and reliability-aware task scheduling onto heterogeneous MPSoC architectures
    Suleyman Tosun
    The Journal of Supercomputing, 2012, 62 : 265 - 289
  • [10] Energy- and reliability-aware task scheduling onto heterogeneous MPSoC architectures
    Tosun, Suleyman
    JOURNAL OF SUPERCOMPUTING, 2012, 62 (01): : 265 - 289