The DeSyRe Runtime support for Fault-tolerant Embedded MPSoCs

被引:5
|
作者
Theodoropoulos, Dimitris [1 ]
Pnevmatikatos, Dionisios [1 ]
Tzilis, Stavros [2 ]
Sourdis, Ioannis [2 ]
机构
[1] Fdn Res & Technol Hellas FORTH, Inst Comp Sci, Comp Architecture & VLSI Syst Lab, Iraklion, Greece
[2] Chalmers Univ Technol, Dept Comp Sci & Engn, S-41296 Gothenburg, Sweden
来源
2014 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA) | 2014年
关键词
runtime support; embedded MPSoCs; FPGAs; SOFTWARE;
D O I
10.1109/ISPA.2014.34
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Semiconductor technology scaling makes chips more sensitive to faults. This paper describes the DeSyRe design approach and its runtime management for future reliable embedded Multiprocessor Systems-on-Chip (MPSoCs). A lightweight runtime system is described for shared-memory MPSoCs to support fault-tolerant execution upon detection of transient and permanent faults. The DeSyRe runtime system offers reexecution of tasks that suffer from transient faults and taskmigration in cases where a worker processor is permanently faulty. In addition, a faulty worker can potentially remain usable, increasing systems fault-tolerance. This is achieved using alternative task implementations, which avoid the faulty circuit and are indicated in the application-code via pragma annotations, as well as by repairing a faulty core via hardware reconfiguration. Thereby, the system can be dynamically adapted using one or multiple of the above mechanisms to mitigate faults. The DeSyRe runtime system is evaluated using micro-benchmarks running on a Virtex-6 FPGA MPSoC. Results suggest that our enhanced fault-tolerant runtime system can successfully and efficiently execute all application tasks under a variety of fault cases.
引用
收藏
页码:197 / 204
页数:8
相关论文
共 50 条
  • [1] Efficient Runtime Support for Embedded MPSoCs
    Theodoropoulos, Dimitris
    Pratikakis, Polyvios
    Pnevmatikatos, Dionisios
    2013 INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING AND SIMULATION (IC-SAMOS), 2013, : 164 - 171
  • [2] Embedded fault-tolerant systems
    Avresky, DR
    Grosspietsch, KE
    Johnson, BW
    Lombardi, F
    IEEE MICRO, 1998, 18 (05) : 8 - 11
  • [3] Fault-tolerant embedded systems
    Avresky, DR
    Lombardi, F
    Grosspietsch, KE
    Johnson, BW
    IEEE MICRO, 2001, 21 (05) : 12 - 15
  • [4] Challenges in Fault-Tolerant Distributed Runtime Verification
    Bonakdarpour, Borzoo
    Fraigniaud, Pierre
    Rajsbaum, Sergio
    Travers, Corentin
    LEVERAGING APPLICATIONS OF FORMAL METHODS, VERIFICATION AND VALIDATION: DISCUSSION, DISSEMINATION, APPLICATIONS, ISOLA 2016, PT II, 2016, 9953 : 363 - 370
  • [6] An Optimized Weighted Average Makespan in Fault-Tolerant Heterogeneous MPSoCs
    Youness, Hassan
    Omar, Aly
    Moness, Mohamed
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (08) : 1933 - 1946
  • [7] Smart Reconfiguration Approach for Fault-Tolerant NoC Based MPSoCs
    Silveira, Jarbas
    Cortez, Paulo
    Cadore, Alan
    Mota, Rafael
    Marcon, Cesar
    Brahm, Lucas
    Fernandes, Ramon
    2015 28TH SYMPOSIUM ON INTEGRATED CIRCUITS AND SYSTEMS DESIGN (SBCCI), 2015,
  • [8] Fault-tolerant communication in embedded supercomputing
    Efthivoulidis, G
    Verentziotis, EA
    Meliones, AN
    Varvarigou, TA
    Kontizas, A
    Deconinck, G
    De Florio, V
    IEEE MICRO, 1998, 18 (05) : 42 - 52
  • [9] Synthesis of fault-tolerant embedded systems
    Eles, Petru
    Izosimov, Viacheslav
    Pop, Paul
    Peng, Zebo
    2008 DESIGN, AUTOMATION AND TEST IN EUROPE, VOLS 1-3, 2008, : 960 - +
  • [10] A fault-tolerant embedded microcontroller testbed
    Rennels, DA
    Caldwell, DW
    Hwang, RK
    Mesarina, M
    PACIFIC RIM INTERNATIONAL SYMPOSIUM ON FAULT-TOLERANT SYSTEMS, PROCEEDINGS, 1997, : 7 - 14