Defender: A Low Overhead and Efficient Fault-Tolerant Mechanism for Reliable on-Chip Router

被引:10
作者
Baloch, Naveed Khan [1 ]
Baig, Muhammad Iram [2 ]
Daneshtalab, Masoud [3 ]
机构
[1] Univ Engn & Technol Taxila, Comp Engn Dept, Taxila 47040, Pakistan
[2] Univ Engn & Technol Taxila, Elect Engn Dept, Taxila 47040, Pakistan
[3] Malardalen Univ, Sch Innovat Design & Engn, S-72220 Vasteras, Sweden
关键词
Circuit faults; Fault tolerance; Fault tolerant systems; Routing; Computer architecture; Switches; Network-on-Chip; router architecture; permanent fault tolerance; silicon protection factor; mean time to failure; NETWORK; ARCHITECTURE; DESIGN; CHALLENGES; LOGIC;
D O I
10.1109/ACCESS.2019.2944490
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The ever-shrinking size of a transistor has made Network on Chip (NoC) susceptible to faults. A single error in the NoC can disrupt the entire communication. In this paper, we introduce Defender, a fault-tolerant router architecture, that is capable of tolerating permanent faults in all the parts of the router. We intend to employ structural modifications in baseline router design to achieve fault tolerance. In Defender we provide the fault tolerance to the input ports and routing computation unit by grouping the neighboring ports together. Default winner strategy is used to provide fault resilience to the virtual channel arbiters and switch allocators. Multiple routes are provided to the crossbar to tolerate the faults. Defender provides improved fault tolerance to all stages of routers as compared to the currently prevailing fault tolerant router architectures. Reliability analysis using silicon protection factor (SPF) and Mean Time to Failure (MTTF) metrics confirms that our proposed design Defender is 10.78 times more reliable than baseline unprotected router and then the current state of the art architectures.
引用
收藏
页码:142843 / 142854
页数:12
相关论文
共 43 条
  • [1] GARNET: A Detailed On-Chip Network Model inside a Full-System Simulator
    Agarwal, Niket
    Krishna, Tushar
    Peh, Li-Shiuan
    Jha, Niraj K.
    [J]. ISPASS 2009: IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, 2009, : 33 - 42
  • [2] Barsky R, 2004, ICECS 2004: 11th IEEE International Conference on Electronics, Circuits and Systems, P121
  • [3] Networks on chips: A new SoC paradigm
    Benini, L
    De Micheli, G
    [J]. COMPUTER, 2002, 35 (01) : 70 - +
  • [4] Error control schemes for on-chip communication links: The energy-reliability tradeoff
    Bertozzi, D
    Benini, L
    De Micheli, G
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2005, 24 (06) : 818 - 831
  • [5] The PARSEC Benchmark Suite: Characterization and Architectural Implications
    Bienia, Christian
    Kumar, Sanjeev
    Singh, Jaswinder Pal
    Li, Kai
    [J]. PACT'08: PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2008, : 72 - 81
  • [6] Binkert Nathan, 2011, Computer Architecture News, V39, P1, DOI 10.1145/2024716.2024718
  • [7] MECHANISM OF NEGATIVE-BIAS-TEMPERATURE INSTABILITY
    BLAT, CE
    NICOLLIAN, EH
    POINDEXTER, EH
    [J]. JOURNAL OF APPLIED PHYSICS, 1991, 69 (03) : 1712 - 1720
  • [8] Design challenges of technology scaling
    Borkar, S
    [J]. IEEE MICRO, 1999, 19 (04) : 23 - 29
  • [9] Designing reliable systems from unreliable components: The challenges of transistor variability and degradation
    Borkar, S
    [J]. IEEE MICRO, 2005, 25 (06) : 10 - 16
  • [10] Thousand core chips-a technology perspective
    Borkar, Shekhar
    [J]. 2007 44TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, VOLS 1 AND 2, 2007, : 746 - 749