Microcontroller Compiler-Assisted Software Fault Tolerance

被引:24
作者
Bohman, Matthew [1 ,2 ]
James, Benjamin [1 ,2 ]
Wirthlin, Michael J. [1 ,2 ]
Quinn, Heather [3 ]
Goeders, Jeffrey [1 ,2 ]
机构
[1] Brigham Young Univ, Dept Elect & Comp Engn, Provo, UT 84602 USA
[2] NSF Ctr Space High Performance & Resilient Comp, Provo, UT 84602 USA
[3] Los Alamos Natl Lab, ISR 3 Space Data Syst, Los Alamos, NM 87545 USA
基金
美国国家科学基金会;
关键词
Silent data corruption (SDC); single-event upset (SEU); soft errors; software fault tolerance;
D O I
10.1109/TNS.2018.2886094
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Commercial off-the-shelf microcontrollers can be useful for noncritical processing on spaceborne platforms. These microprocessors can be inexpensive and consume small amounts of power. However, the software running on these processors is vulnerable to radiation upsets. In this paper, we present a fully automated, configurable, software-based tool to increase the reliability of microprocessors in high-radiation environments. This tool consists of a set of open-source LLVM compiler passes to automatically implement software-based mitigation techniques. We duplicate or triplicate computations and insert voting mechanisms into software during the compilation process, allowing for runtime error correction. While the techniques we implement are not novel, previous work has typically been closed source, processor architecture dependent, not automated, and not tested in real high-radiation environments. In contrast, the compiler passes presented in this paper are publicly available, highly customizable, and are platform independent and language independent. We have tested our modified software using both fault injection and through neutron beam radiation on a Texas Instruments MSP430 microcontroller. When tested by a neutron beam, we were able to decrease the cross section of programs by 17-29x, increasing mean-work-to-failure by 4-7x.
引用
收藏
页码:223 / 232
页数:10
相关论文
共 50 条
  • [31] Computing reliability: On the differences between software testing and software fault injection techniques
    Kooli, Maha
    Kaddachi, Firas
    Di Natale, Giorgio
    Bosio, Alberto
    Benoit, Pascal
    Torres, Lionel
    MICROPROCESSORS AND MICROSYSTEMS, 2017, 50 : 102 - 112
  • [32] Estimating circuit fault-tolerance by means of transient-fault injection in VHDL
    Vargas, F
    Amory, A
    Velazco, R
    6TH IEEE INTERNATIONAL ON-LINE TESTING WORKSHOP, PROCEEDINGS, 2000, : 67 - 72
  • [33] Fault tolerance at system level based on RADIC architecture
    Castro-Leon, Marcela
    Meyer, Hugo
    Rexachs, Dolores
    Luque, Emilio
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2015, 86 : 98 - 111
  • [34] Selective Fault Tolerance by Counting Gates with Controlling Value
    Breitenreiter, Anselm
    Weidling, Stefan
    Schrape, Oliver
    Zeidler, Steffen
    Reviriego, Pedro
    Krstic, Milos
    2019 IEEE 25TH INTERNATIONAL SYMPOSIUM ON ON-LINE TESTING AND ROBUST SYSTEM DESIGN (IOLTS 2019), 2019, : 15 - 20
  • [35] Recovery Block Fault Tolerance Considering Correlated Failures
    Fiondella, Lance
    Zeephongsekul, Panlop
    2014 60TH ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM (RAMS), 2014,
  • [36] FT-iSort: Efficient Fault Tolerance for Introsort
    Li, Sihuan
    Li, Hongbo
    Liang, Xin
    Chen, Jieyang
    Giem, Elisabeth
    Ouyang, Kaiming
    Zhao, Kai
    Di, Sheng
    Cappello, Franck
    Chen, Zizhong
    PROCEEDINGS OF SC19: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2019,
  • [37] PROCESS CHECKPOINTING PRIMITIVES FOR FAULT TOLERANCE - DEFINITIONS AND EXAMPLES
    CLEMATIS, A
    DODERO, G
    GIANUZZI, V
    MICROPROCESSORS AND MICROSYSTEMS, 1992, 16 (01) : 15 - 23
  • [38] A HIERARCHICAL PROGRAM STRUCTURE FOR CONCURRENT FAULT TOLERANT SOFTWARE
    CLEMATIS, A
    GIANUZZI, V
    MICROPROCESSING AND MICROPROGRAMMING, 1993, 37 (1-5): : 179 - 182
  • [39] Using a register model fault simulator to assess the fault detection coverage of fault tolerant embedded software for automotive systems
    Freinatis, S
    Hunger, A
    ESS'98 - SIMULATION TECHNOLOGY: SCIENCE AND ART, 1998, : 490 - 494
  • [40] Fault Injection and Fault Tolerance Methodologies for Assessing Device Robustness and Mitigating against Ionizing Radiation
    Alexandrescu, Dan
    Sterpone, Luca
    Lopez-Ongil, Celia
    2014 19TH IEEE EUROPEAN TEST SYMPOSIUM (ETS 2014), 2014,