RASA: Reliability-Aware Scheduling Approach for FPGA-Based Resilient Embedded Systems in Extreme Environments

被引:13
作者
Saha, Sangeet [1 ]
Zhai, Xiaojun [1 ]
Ehsan, Shoaib [1 ]
Majeed, Shakaiba [2 ]
McDonald-Maier, Klaus [1 ]
机构
[1] Univ Essex, Embedded & Intelligent Syst Lab, Colchester CO4 3SQ, Essex, England
[2] Hanyang Univ, Real Time Comp & Commun Lab, Seoul 04763, South Korea
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2022年 / 52卷 / 06期
基金
英国工程与自然科学研究理事会;
关键词
Field programmable gate arrays; Task analysis; Real-time systems; Robots; Hardware; Fault tolerant systems; Schedules; Extreme environments (EEs); field-programmable gate array (FPGA); partial reconfiguration; real-time scheduling; reliability; resilient systems; single-event upsets (SEUs); TASKS;
D O I
10.1109/TSMC.2021.3077697
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Field-programmable gate arrays (FPGAs) offer the flexibility of general-purpose processors along with the performance efficiency of dedicated hardware that essentially renders it as a platform of choice for modern-day robotic systems for achieving real-time performance. Such robotic systems when deployed in harsh environments often get plagued by faults due to extreme conditions. Consequently, the real-time applications running on FPGA become susceptible to errors which call for a reliability-aware task scheduling approach, the focus of this article. We attempt to address this challenge using a hybrid offline-online approach. Given a set of periodic real-time tasks that require to be executed, the offline component generates a feasible preemptive schedule with specific preemption points. At runtime, these preemption events are utilized for fault detection. Upon detecting any faulty execution at such distinct points, the reliability-aware scheduling approach, RASA, orchestrates the recovery mechanism to remediate the scenario without jeopardizing the predefined schedule. Effectiveness of the proposed strategy has been verified through simulation-based experiments and we observed that the RASA is able to achieve 72% of task acceptance rate even under 70% of system workloads with high fault occurrence rates.
引用
收藏
页码:3885 / 3899
页数:15
相关论文
共 36 条
[1]  
[Anonymous], 2013, P IEEE HIGH PERF EXT
[2]   From cryptography to hardware: analyzing and protecting embedded Xilinx BRAM for cryptographic applications [J].
Bhasin, Shivam ;
Guilley, Sylvain ;
Heuser, Annelie ;
Danger, Jean-Luc .
JOURNAL OF CRYPTOGRAPHIC ENGINEERING, 2013, 3 (04) :213-225
[3]   Robust Mixed-Criticality Systems [J].
Burns, Alan ;
Davis, Robert, I ;
Baruah, Sanjoy ;
Bate, Iain .
IEEE TRANSACTIONS ON COMPUTERS, 2018, 67 (10) :1478-1491
[4]  
Davidson S., 1999, PROC INT TEST CONF, P1125
[5]   A Survey of Hard Real-Time Scheduling for Multiprocessor Systems [J].
Davis, Robert I. ;
Burns, Alan .
ACM COMPUTING SURVEYS, 2011, 43 (04)
[6]  
Ejlali A., 2009, PROC IEEEACM INT C H, P193
[7]  
Eles P, 2008, DES AUT TEST EUROPE, P960
[8]   Energy-Efficient Scheduling of Real-Time Tasks in Reconfigurable Homogeneous Multicore Platforms [J].
Gammoudi, Aymen ;
BenZina, Adel ;
Khalgui, Mohamed ;
Chillet, Daniel .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (12) :5092-5105
[9]  
Ghorbel A, 2013, 2013 INTERNATIONAL CONFERENCE ON INDIVIDUAL AND COLLECTIVE BEHAVIORS IN ROBOTICS (ICBR), P56, DOI 10.1109/ICBR.2013.6729272
[10]  
Gobillot N, 2016, 2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), P3306, DOI 10.1109/IROS.2016.7759509