On Providing Scalable Self-healing Adaptive Fault-tolerance to RTR SoCs

被引:0
|
作者
Navas, Byron [1 ,2 ]
Oberg, Johnny [1 ]
Sander, Ingo [1 ]
机构
[1] KTH Royal Inst Technol, Dept Elect Syst, Stockholm, Sweden
[2] ESPE Univ Fuerzas Armadas, Dept Elect & Elect Engn, Sangolqui, Ecuador
来源
2014 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG) | 2014年
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The dependability of heterogeneous many-core FPGA based systems are threatened by higher failure rates caused by disruptive scales of integration, increased design complexity, and radiation sensitivity. Triple-modular redundancy (TMR) and run-time reconfiguration (RTR) are traditional faulttolerant (FT) techniques used to increase dependability. However, hardware redundancy is expensive and most approaches have poor scalability, flexibility, and programmability. Therefore, innovative solutions are needed to reduce the redundancy cost but still preserve acceptable levels of dependability. In this context, this paper presents the implementation of a self-healing adaptive fault-tolerant SoC that reuses RTR IP-cores in order to self-assemble different TMR schemes during run-time. The presented system demonstrates the feasibility of the Upset-Fault-Observer concept, which provides a run-time self-test and recovery strategy that delivers fault-tolerance over functions accelerated in RTR cores, at the same time reducing the redundancy scalability cost by running periodic reconfigurable TMR scan-cycles. In addition, this paper experimentally evaluates the trade-off of the implemented reconfigurable TMR schemes by characterizing important fault tolerant metrics i.e., recovery time (self-repair and self-replicate), detection latency, self-assembly latency, throughput reduction, and increase of physical resources.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Fault-Tolerance of Self-Timed Circuits
    Stepchenkov, Yuri A.
    Kamenskih, Anton N.
    Diachenko, Yuri G.
    Rogdestvenski, Yuri V.
    Diachenko, Denis Y.
    PROCEEDINGS OF THE 2019 10TH INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS, SERVICES AND TECHNOLOGIES (DESSERT), 2019, : 41 - 44
  • [22] Local decisions and triggering mechanisms for adaptive fault-tolerance
    Stanley-Marbell, P
    Marculescu, D
    DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION, VOLS 1 AND 2, PROCEEDINGS, 2004, : 968 - 973
  • [23] Adaptive Fault-Tolerance for Cyber-Physical Systems
    Krishna, C. M.
    Koren, I.
    2013 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), 2013,
  • [24] Self-adaptive fault-tolerance of HLA-based simulations in the grid environment
    Huang, Jijie
    Chai, Xudong
    Zhang, Lin
    Li, Bo Hu
    ASIASIM 2007, 2007, 5 : 56 - +
  • [25] Implementation of a Dynamic Fault-Tolerance Scaling Technique on a Self-adaptive Hardware Architecture
    Soto Vargas, J.
    Moreno, J. M.
    Madrenas, J.
    Cabestany, J.
    2009 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS, 2009, : 445 - 450
  • [26] Efficient, scalable migration of IP telephony calls for enhanced fault-tolerance
    Marwah, M
    Chavez, D
    Gillespie, D
    Velamala, V
    ICCCN 2005: 14th International Conference on Computer Communications and Networks, Proceedings, 2005, : 517 - 522
  • [27] A-SOFT-AES: Self-Adaptive Software-Implemented Fault-Tolerance for AES
    Oboril, Fabian
    Sagar, Ilias
    Tahoori, Mehdi B.
    PROCEEDINGS OF THE 2013 IEEE 19TH INTERNATIONAL ON-LINE TESTING SYMPOSIUM (IOLTS), 2013, : 104 - 109
  • [28] Mixed-Signal SoCs With In Situ Self-Healing Circuitry
    Maxey, Christopher
    Raman, Sanjay
    Groves, Kari
    Quach, Tony
    Orlando, Len
    Mattamana, Aji
    Creech, Gregory
    Rockway, Jay
    IEEE DESIGN & TEST OF COMPUTERS, 2012, 29 (06): : 27 - 39
  • [29] Dynamic, adaptive and self-healing crystals
    Naumov, P.
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2022, 78 : E84 - E84
  • [30] Scalable, Self-Healing, and Self-Optimizing Routing Overlays
    Brun, Olivier
    Hassan, Hassan
    Vallet, Josselin
    2016 IFIP NETWORKING CONFERENCE (IFIP NETWORKING) AND WORKSHOPS, 2016, : 64 - 72