Improving the Effectiveness of TMR Designs on FPGAs with SEU-Aware Incremental Placement

被引:17
作者
Cannon, Matthew [1 ]
Keller, Andrew [1 ]
Wirthlin, Michael [1 ]
机构
[1] Brigham Young Univ, NSF Ctr High Performance Reconfigurable Comp CHRE, Provo, UT 84602 USA
来源
PROCEEDINGS 26TH IEEE ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2018) | 2018年
基金
美国国家科学基金会;
关键词
Field Programmable Gate Array (FPGA); Triple Modular Redundancy (TMR); reliability; common mode failures; Single Event Effect (SEE); Single Event Upset (SEU); faultinjection; radiation testing; BENCHMARKS;
D O I
10.1109/FCCM.2018.00031
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
TMR combined with configuration scrubbing is an effective technique to mitigate against radiation-induced CRAM upsets on SRAM-based FPGAs. However, its effectiveness is limited by low-level common mode failures due to the physical mapping of a design to the FPGA device. This paper describes how common mode failures are introduced during the implementation process and introduces an approach for resolving them through a custom incremental placement tool for Xilinx 7-Series FPGAs. Multiple designs across multiple generations of devices are shown to be sensitive to common mode failures. Applying the incremental placement technique yields an improvement of 10,721x over an unmitigated design through fault-injection testing. Radiation testing is then performed to show that the MTTF of this technique is 91,500 days in GEO orbit, a 367x improvement over the unmitigated design and a 5x improvement over baseline TMR.
引用
收藏
页码:141 / 148
页数:8
相关论文
共 25 条
[1]   Single-Event Upset (SEU) Results of Embedded Error Detect and Correct Enabled Block Random Access Memory (Block RAM) Within the Xilinx XQR5VFX130 [J].
Allen, Gregory R. ;
Edmonds, Larry ;
Tseng, Chen Wei ;
Swift, Gary ;
Carmichael, Carl .
IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 2010, 57 (06) :3426-3431
[2]   FAULT INJECTION FOR DEPENDABILITY VALIDATION - A METHODOLOGY AND SOME APPLICATIONS [J].
ARLAT, J ;
AGUERA, M ;
AMAT, L ;
CROUZET, Y ;
FABRE, JC ;
LAPRIE, JC ;
MARTINS, E ;
POWELL, D .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1990, 16 (02) :166-182
[3]  
Berg M., 2017, NEW DEV ERROR DETECT
[4]   Effectiveness of Internal Versus External SEU Scrubbing Mitigation Strategies in a Xilinx FPGA: Design, Test, and Analysis [J].
Berg, Melanie ;
Poivey, C. ;
Petrick, D. ;
Espinosa, D. ;
Lesea, Austin ;
LaBel, K. A. ;
Friendlich, M. ;
Kim, H. ;
Phan, Anthony .
IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 2008, 55 (04) :2259-2266
[5]   RT-level ITC'99 benchmarks and first ATPG results [J].
Corno, F ;
Reorda, MS ;
Squillero, G .
IEEE DESIGN & TEST OF COMPUTERS, 2000, 17 (03) :44-53
[6]  
Haroldsen T., 2015, Proceedings of the 2015 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, P66, DOI DOI 10.1145/2684746.2689085
[7]   Design Techniques for Xilinx Virtex FPGA Configuration Memory Scrubbers [J].
Herrera-Alzu, I. ;
Lopez-Vallejo, M. .
IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 2013, 60 (01) :376-385
[8]   Reconfigurable Fault Tolerance: A Comprehensive Framework for Reliable and Adaptive FPGA-Based Space Computing [J].
Jacobs, Adam ;
Cieslewski, Grzegorz ;
George, Alan D. ;
Gordon-Ross, Ann ;
Lam, Herman .
ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2012, 5 (04)
[9]  
Johnson J, 2010, FPGA 10, P249
[10]   On the optimal design of triple modular redundancy logic for SRAM-based FPGAs [J].
Kastensmidt, FL ;
Sterpone, L ;
Carro, L ;
Reorda, MS .
DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION, VOLS 1 AND 2, PROCEEDINGS, 2005, :1290-1295