The DeSyRe Runtime support for Fault-tolerant Embedded MPSoCs

被引:5
|
作者
Theodoropoulos, Dimitris [1 ]
Pnevmatikatos, Dionisios [1 ]
Tzilis, Stavros [2 ]
Sourdis, Ioannis [2 ]
机构
[1] Fdn Res & Technol Hellas FORTH, Inst Comp Sci, Comp Architecture & VLSI Syst Lab, Iraklion, Greece
[2] Chalmers Univ Technol, Dept Comp Sci & Engn, S-41296 Gothenburg, Sweden
来源
2014 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA) | 2014年
关键词
runtime support; embedded MPSoCs; FPGAs; SOFTWARE;
D O I
10.1109/ISPA.2014.34
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Semiconductor technology scaling makes chips more sensitive to faults. This paper describes the DeSyRe design approach and its runtime management for future reliable embedded Multiprocessor Systems-on-Chip (MPSoCs). A lightweight runtime system is described for shared-memory MPSoCs to support fault-tolerant execution upon detection of transient and permanent faults. The DeSyRe runtime system offers reexecution of tasks that suffer from transient faults and taskmigration in cases where a worker processor is permanently faulty. In addition, a faulty worker can potentially remain usable, increasing systems fault-tolerance. This is achieved using alternative task implementations, which avoid the faulty circuit and are indicated in the application-code via pragma annotations, as well as by repairing a faulty core via hardware reconfiguration. Thereby, the system can be dynamically adapted using one or multiple of the above mechanisms to mitigate faults. The DeSyRe runtime system is evaluated using micro-benchmarks running on a Virtex-6 FPGA MPSoC. Results suggest that our enhanced fault-tolerant runtime system can successfully and efficiently execute all application tasks under a variety of fault cases.
引用
收藏
页码:197 / 204
页数:8
相关论文
共 50 条
  • [11] Embedded fault-tolerant systems - Introduction
    Avresky, DR
    Johnson, BW
    Lombardi, FF
    JOURNAL OF SUPERCOMPUTING, 2000, 16 (01): : 5 - 6
  • [12] Static Mapping of Mixed-Critical Applications for Fault-Tolerant MPSoCs
    Kang, Shin-haeng
    Yang, Hoeseok
    Kim, Sungchan
    Bacivarov, Iuliana
    Ha, Soonhoi
    Thiele, Lothar
    2014 51ST ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2014,
  • [13] Design and Validation of Fault-tolerant Embedded Controllers
    Ghosh, Saurav Kumar
    Dey, Soumyajjit
    Goswami, Dip
    Mueller-Gritschneder, Daniel
    Chakraborty, Samar
    PROCEEDINGS OF THE 2018 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2018, : 1283 - 1288
  • [14] Towards Fault-Tolerant Embedded Systems with Imperfect Fault Detection
    Huang, Jia
    Huang, Kai
    Raabe, Andreas
    Buckl, Christian
    Knoll, Alois
    2012 49TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2012, : 188 - 196
  • [15] Scenario preprocessing approach for the reconfiguration of fault-tolerant NoC-based MPSoCs
    Silveira, Jarbas
    Marcon, Cesar
    Cortez, Paulo
    Barroso, Giovanni
    Ferreira, Joao M.
    Mota, Rafael
    MICROPROCESSORS AND MICROSYSTEMS, 2016, 40 : 137 - 153
  • [16] Self-healing network for scalable fault-tolerant runtime environments
    Angskun, Thara
    Fagg, Graham
    Bosilca, George
    Pjesivac-Grbovic, Jelena
    Dongarra, Jack
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2010, 26 (03): : 479 - 485
  • [17] A runtime fault-tolerant routing algorithm based on region flooding in NoCs
    Wang, Lu
    Ma, Sheng
    Wang, Zhiying
    MICROPROCESSORS AND MICROSYSTEMS, 2016, 45 : 198 - 207
  • [18] Fault-tolerant meshes and tori embedded in a faulty supercube
    Lin, Jen-Chih
    Wu, Shih-Jung
    Keh, Huan-Chao
    Wang, Lu
    WSEAS Transactions on Computers, 2010, 9 (05): : 445 - 454
  • [19] Hardware support for CAN fault-tolerant communication
    Rufino, Jose
    Pedrosa, Nuno
    Monteiro, Jose
    Verissimo, Paulo
    Arroz, Guilherme
    Proceedings of the IEEE International Conference on Electronics, Circuits, and Systems, 1998, 1 : 263 - 266
  • [20] Fault-tolerant distributed deployment of embedded control software
    Pinello, Claudio
    Carloni, Luca P.
    Sangiovanni-Vincentelli, Alberto L.
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2008, 27 (05) : 906 - 919