Evaluating fault tolerance on asymmetric multicore systems-on-chip using iso-metrics

被引:0
作者
Chalios, Charalampos [1 ]
Nikolopoulos, Dimitrios S. [1 ]
Catalan, Sandra [2 ]
Quintana-Orti, Enrique S. [2 ]
机构
[1] Queens Univ Belfast, Sch EEECS, Belfast BT7 1NN, Antrim, North Ireland
[2] Univ Jaume 1, Dept Ingn & Ciencia Comp, Castellon de La Plana, Spain
基金
英国工程与自然科学研究理事会;
关键词
system-on-chip; multiprocessing systems; fault tolerant computing; integrated circuit reliability; fault tolerance evaluation; asymmetric multicore systems-on-chip; SoC; isometrics; Dennard scaling; low power consumption; computing systems; power conservation schemes; voltage scaling; frequency scaling; performance-constrained environments; embedded processors; near-threshold voltage computing; NTVC; HPC systems; CG solver; very low voltage operation; ENERGY; PERFORMANCE;
D O I
10.1049/iet-cdt.2015.0056
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The end of Dennard scaling has promoted low power consumption into a first-order concern for computing systems. However, conventional power conservation schemes such as voltage and frequency scaling are reaching their limits when used in performance-constrained environments. New technologies are required to break the power wall while sustaining performance on future processors. Low-power embedded processors and near-threshold voltage computing (NTVC) have been proposed as viable solutions to tackle the power wall in future computing systems. Unfortunately, these technologies may also compromise per-core performance and, in the case of NTVC, reliability. These limitations would make them unsuitable for HPC systems and datacenters. To demonstrate that emerging low-power processing technologies can effectively replace conventional technologies, this study relies on ARM's big.LITTLE processors as both an actual and emulation platform, and state-of-the-art implementations of the CG solver. For NTVC in particular, the study describes how efficient algorithm-based fault tolerance schemes preserve the power and energy benefits of very low voltage operation.
引用
收藏
页码:85 / 92
页数:8
相关论文
共 10 条
  • [1] A Survey and Taxonomy of On-Chip Monitoring of Multicore Systems-on-Chip
    Kornaros, Georgios
    Pnevmatikatos, Dionisios
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2013, 18 (02)
  • [2] A Full Lifecycle Performance Verification Methodology for Multicore Systems-on-Chip
    Holt, Jim
    Dastidar, Jaideep
    Lindberg, David
    Pape, John
    Yang, Peng
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2012, 17 (03)
  • [3] Modification of Fault Injection Method via On-Chip Debugging for Processor Cores of Systems-On-Chip
    Chekmarev, S. A.
    Khanov, V. Kh
    Antamoshkin, O. A.
    2015 INTERNATIONAL SIBERIAN CONFERENCE ON CONTROL AND COMMUNICATIONS (SIBCON), 2015,
  • [4] Evaluating fault tolerance approaches in multi-agent systems
    Stankovic, Rade
    Stula, Maja
    Maras, Josip
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2017, 31 (01) : 151 - 177
  • [5] A permanent fault tolerant dynamic task allocation approach for Network-on-Chip based multicore systems
    Paul, Suraj
    Chatterjee, Navonil
    Ghosal, Prasun
    JOURNAL OF SYSTEMS ARCHITECTURE, 2019, 97 : 287 - 303
  • [6] Fault Coverage-Aware Metrics for Evaluating the Reliability Factor of Solar Tracking Systems
    Rotar, Raul
    Jurj, Sorin Liviu
    Opritoiu, Flavius
    Vladutiu, Mircea
    ENERGIES, 2021, 14 (04)
  • [7] Energy-Efficient Fault Tolerance in Chip Multiprocessors Using Critical Value Forwarding
    Subramanyan, Pramod
    Singh, Virendra
    Saluja, Kewal K.
    Larsson, Erik
    2010 IEEE-IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS DSN, 2010, : 121 - 130
  • [8] Actuator fault tolerance evaluation approach of nonlinear model predictive control systems using viability theory
    Zarch, Majid Ghaniee
    Puig, Vicenc
    Poshtan, Javad
    Shoorehdeli, Mandi Aliyari
    JOURNAL OF PROCESS CONTROL, 2018, 71 : 35 - 45
  • [9] Extending Performance and Evaluating Risks of PV Systems Failure Using a Fault Tree and Event Tree Approach: Analysis of the Possible Application
    Colli, Alessandra
    2012 38TH IEEE PHOTOVOLTAIC SPECIALISTS CONFERENCE (PVSC), 2012, : 2922 - 2926
  • [10] Fault tolerance on-chip: a reliable computing paradigm using self-test, self-diagnosis, and self-repair (3S) approach
    Li, Xiaowei
    Yan, Guihai
    Ye, Jing
    Wang, Ying
    SCIENCE CHINA-INFORMATION SCIENCES, 2018, 61 (11)