FAC: A Fault-Tolerant Design Approach Based on Approximate Computing

被引:0
|
作者
Balasubramanian, Padmanabhan [1 ]
Maskell, Douglas L. [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, 50 Nanyang Ave, Singapore 639798, Singapore
关键词
fault tolerance; triple modular redundancy; approximate computing; arithmetic circuits; digital logic design; low power; high-speed; CMOS; TRANSIENT FAULTS; CIRCUITS; LOGIC; ADDER; TMR;
D O I
10.3390/electronics12183819
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article introduces a new fault-tolerant design approach based on approximate computing, called FAC, for designing redundant circuits and systems. Traditionally, triple modular redundancy (TMR) has been used to ensure complete tolerance to any single fault or a faulty processing unit, where the processing unit may be a circuit or a system. However, TMR incurs more than 200% overhead in terms of area and power compared to a single processing unit. Alternative redundancy approaches have been proposed in the literature to mitigate these overheads associated with TMR, but they provide only partial or moderate fault tolerance. Among the alternatives, majority voting-based reduced precision redundancy (MVRPR) may be useful for error-resilient applications such as digital signal processing. While MVRPR guarantees only moderate fault tolerance, the proposed FAC is well-suited for error-resilient applications and ensures 100% tolerance to any single fault or a faulty processing unit, like TMR. In this work, we evaluate the performance of TMR, MVRPR, and FAC for a digital image processing application. The image processing results obtained demonstrate the effectiveness of FAC. Moreover, when the processing unit is implemented using a 28-nm CMOS technology, FAC achieves significant improvements over TMR, including a 15.3% reduction in delay, a 19.5% reduction in area, and a 24.7% reduction in power. Compared to MVRPR, FAC exhibits notable enhancements, with an 18% reduction in delay, a 5.4% reduction in area, and an 11.2% reduction in power. When considering the power-delay product, which reflects energy efficiency, FAC demonstrates a 36.2% reduction compared to TMR and a 27.2% reduction compared to MVRPR. When considering the power-delay-area product, which represents design efficiency, FAC achieves a 48.7% reduction compared to TMR and a 31.1% reduction compared to MVRPR.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] FAULT-TOLERANT COMPUTING - INTRODUCTION AND AN OVERVIEW
    RAMAMOORTHY, CV
    IEEE TRANSACTIONS ON COMPUTERS, 1971, C 20 (11) : 1241 - +
  • [32] Abstractions for fault-tolerant global computing
    Chothia, T
    Duggan, D
    THEORETICAL COMPUTER SCIENCE, 2004, 322 (03) : 567 - 613
  • [33] A short history of fault-tolerant computing
    Avizienis, Algirdas
    IT - Information Technology, 1988, 30 (03): : 162 - 168
  • [34] Immune system and fault-tolerant computing
    Xanthakis, S
    Karapoulios, S
    Pajot, R
    Rozz, A
    ARTIFICIAL EVOLUTION, 1996, 1063 : 181 - 197
  • [35] Efficient fault-tolerant quantum computing
    Steane, AM
    NATURE, 1999, 399 (6732) : 124 - 126
  • [36] Early Fault-Tolerant Quantum Computing
    Katabarwa, Amara
    Gratsea, Katerina
    Caesura, Athena
    Johnson, Peter D.
    PRX QUANTUM, 2024, 5 (02):
  • [37] FAULT-TOLERANT COMPUTING - INTRODUCTION AND A VIEWPOINT
    CARTER, WC
    IEEE TRANSACTIONS ON COMPUTERS, 1973, C 22 (03) : 225 - 229
  • [38] BIBLIOGRAPHY FOR FAULT-TOLERANT DISTRIBUTED COMPUTING
    COAN, BA
    LECTURE NOTES IN COMPUTER SCIENCE, 1990, 448 : 274 - 298
  • [39] Fault-Tolerant Nanosatellite Computing on a Budget
    Fuchs, Christian M.
    Murillo, Nadia M.
    Plaat, Aske
    van Der Kouwe, Erik
    Harsono, Daniel
    Stefanov, Todor P.
    2018 18TH EUROPEAN CONFERENCE ON RADIATION AND ITS EFFECTS ON COMPONENTS AND SYSTEMS (RADECS), 2018, : 134 - 141
  • [40] Special issue on fault-tolerant computing
    Nanya, T
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1997, E80D (01) : 1 - 2