New Techniques for Improving the Performance of the Lockstep Architecture for SEEs Mitigation in FPGA Embedded Processors

被引:28
作者
Abate, F. [1 ]
Sterpone, L. [1 ]
Lisboa, C. A. [2 ]
Carro, L. [2 ]
Violante, M. [1 ]
机构
[1] Politecn Torino, I-10129 Turin, Italy
[2] Univ Fed Rio Grande do Sul, PPGC, Inst Informat, BR-91501970 Porto Alegre, RS, Brazil
关键词
Embedded processors reliability; single event effects; lockstep; checkpoint; rollback recovery; fault injection; SOFT ERRORS;
D O I
10.1109/TNS.2009.2013237
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The growing availability of embedded processors inside FPGAs provides unprecedented flexibility for system designers. The use of such devices for space or mission critical applications, however, is being delayed by the lack of effective low cost techniques to mitigate radiation induced errors. In this paper a non invasive approach for the implementation of fault tolerant systems based on COTS processors embedded in FPGAs, using lockstep in conjunction with checkpoint and rollback recovery, is presented. The proposed approach does not require modifications in the processor architecture or in the application software. The experimental validation of this approach through fault injection is described, the corresponding results are discussed, and the addition of a write history table as a means to reduce the performance overhead imposed by previous implementations is proposed and evaluated.
引用
收藏
页码:1992 / 2000
页数:9
相关论文
共 15 条
[1]   A New Mitigation Approach for Soft Errors in Embedded Processors [J].
Abate, Francesco ;
Sterpone, Luca ;
Violante, Massimo .
IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 2008, 55 (04) :2063-2069
[2]   Soft errors in advanced computer systems [J].
Baumann, R .
IEEE DESIGN & TEST OF COMPUTERS, 2005, 22 (03) :258-266
[3]   A new hybrid fault detection technique for systems-on-a-chip [J].
Bernardi, P ;
Bolzani, LMV ;
Rebaudengo, M ;
Reorda, MS ;
Vargas, FL ;
Violante, M .
IEEE TRANSACTIONS ON COMPUTERS, 2006, 55 (02) :185-198
[4]   Experimentally evaluating an automatic approach for generating safety-critical software with respect to transient errors [J].
Cheynet, P ;
Nicolescu, B ;
Velazco, R ;
Rebaudengo, M ;
Reorda, MS ;
Violante, M .
IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 2000, 47 (06) :2231-2236
[5]  
Dupont E, 2002, IEEE DES TEST COMPUT, V19, P56, DOI 10.1109/MDT.2002.1003798
[6]  
Heijmen T., 2002, 2002828 PHIL EL NATL
[7]   CONCURRENT ERROR-DETECTION USING WATCHDOG PROCESSORS - A SURVEY [J].
MAHMOOD, A ;
MCCLUSKEY, EJ .
IEEE TRANSACTIONS ON COMPUTERS, 1988, 37 (02) :160-174
[8]   Control-flow checking by software signatures [J].
Oh, N ;
Shirvani, PP ;
McCluskey, EJ .
IEEE TRANSACTIONS ON RELIABILITY, 2002, 51 (01) :111-122
[9]  
PIGNOL M, 2006, P IEEE INT ON LIN TE, P10
[10]  
PRADHAN D, 1995, FAULT TOLERANT COMPU