Dynamic Triple Modular Redundancy in Interleaved Hardware Threads: An Alternative Solution to Lockstep Multi-Cores for Fault-Tolerant Systems

被引:2
作者
Barbirotta, Marcello [1 ]
Menichelli, Francesco [1 ]
Cheikh, Abdallah [1 ]
Mastrandrea, Antonio [1 ]
Angioli, Marco [1 ]
Olivieri, Mauro [1 ]
机构
[1] Sapienza Univ Rome, Dept Informat Engn Elect & Telecommun DIET, I-00184 Rome, Italy
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Termination of employment; Hardware; Circuit faults; Fault tolerant systems; Computer architecture; Registers; Digital integrated circuits; Field programmable gate arrays; Microprocessors; Radiation hardening (electronics); Redundancy; digital integrated circuits; fault detection; fault tolerant computing; field programmable gate arrays; microprocessors; multithreading; radiation hardening (electronics); redundancy; robustness; DESIGN;
D O I
10.1109/ACCESS.2024.3425579
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Over the years, significant work has been done on high-integrity systems, such as those found in cars, satellites and aircrafts, to minimize the risk that a logic fault causes a system failure, thus having functional safety as a key requirement. In this study, we employ an innovative approach to harness the benefits of both Dual Modular Redundancy and Triple Modular Redundancy techniques within an Interleaved-Multi-Threading microprocessor core, by means of a microarchitecture design capable of dynamically switching from Dual Modular Redundancy to Triple Modular Redundancy in case of faults. We explain the quantitative results obtained from an extensive fault injection simulation campaign on the fault tolerant core compared with its previous version regarding fault tolerant capabilities. The results show that in several application cases the fault resilience improvement and the hardware and timing overhead are better compared to the lockstep-based dual core approach. The proposed technique achieves 98,6% fault mitigation at the expense of only 4 clock cycles for roll-back overhead, with no checkpointing redundancy.
引用
收藏
页码:95720 / 95735
页数:16
相关论文
共 41 条
  • [1] A New Mitigation Approach for Soft Errors in Embedded Processors
    Abate, Francesco
    Sterpone, Luca
    Violante, Massimo
    [J]. IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 2008, 55 (04) : 2063 - 2069
  • [2] Abbas Zia, 2015, 2015 4th International Conference on Reliability, Infocom Technologies and Optimization (ICRITO) (Trends and Future Directions), P1, DOI 10.1109/ICRITO.2015.7359366
  • [3] Applying Lightweight Soft Error Mitigation Techniques to Embedded Mixed Precision Deep Neural Networks
    Abich, Geancarlo
    Gava, Jonas
    Garibotti, Rafael
    Reis, Ricardo
    Ost, Luciano
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2021, 68 (11) : 4772 - 4782
  • [4] Analysis of a Fault Tolerant Edge-Computing Microarchitecture Exploiting Vector Acceleration
    Barbirotta, Marcello
    Cheikh, Abdallah
    Mastrandrea, Antonio
    Menichelli, Francesco
    Olivieri, Mauro
    [J]. PRIME 2022: 17TH INTERNATIONAL CONFERENCE ON PHD RESEARCH IN MICROELECTRONICS AND ELECTRONICS, 2022, : 153 - 156
  • [5] Fault resilience analysis of a RISC-V microprocessor design through a dedicated UVM environment
    Barbirotta, Marcello
    Mastrandrea, Antonio
    Menichelli, Francesco
    Vigli, Francesco
    Blasi, Luigi
    Cheikh, Abdallah
    Sordillo, Stefano
    Di Gennaro, Fabio
    Olivieri, Mauro
    [J]. 2020 33RD IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT TOLERANCE IN VLSI AND NANOTECHNOLOGY SYSTEMS (DFT), 2020,
  • [6] Evaluation of Dynamic Triple Modular Redundancy in an Interleaved-Multi-Threading RISC-V Core
    Barbirotta, Marcello
    Cheikh, Abdallah
    Mastrandrea, Antonio
    Menichelli, Francesco
    Ottavi, Marco
    Olivieri, Mauro
    [J]. JOURNAL OF LOW POWER ELECTRONICS AND APPLICATIONS, 2023, 13 (01)
  • [7] Design and Evaluation of Buffered Triple Modular Redundancy in Interleaved-Multi-Threading Processors
    Barbirotta, Marcello
    Cheikh, Abdallah
    Mastrandrea, Antonio
    Menichelli, Francesco
    Olivieri, Mauro
    [J]. IEEE ACCESS, 2022, 10 : 126074 - 126088
  • [8] A Fault Tolerant soft-core obtained from an Interleaved-Multi-Threading RISC-V microprocessor design
    Barbirotta, Marcello
    Cheikh, Abdallah
    Mastrandrea, Antonio
    Menichelli, Francesco
    Vigli, Francesco
    Olivieri, Mauro
    [J]. 34TH IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT TOLERANCE IN VLSI AND NANOTECHNOLOGY SYSTEMS (DFT 2021), 2021,
  • [9] SafeDE: A Low-Cost Hardware Solution to Enforce Diverse Redundancy in Multicores
    Bas, Francisco
    Alcaide, Sergi
    Cabo, Guillem
    Benedicte, Pedro
    Abella, Jaume
    [J]. IEEE TRANSACTIONS ON DEVICE AND MATERIALS RELIABILITY, 2022, 22 (02) : 111 - 119
  • [10] Blasi L., 2019, INT C APPL ELECT PER, P505