DFTS: A dynamic fault-tolerant scheduling for real-time tasks in multicore processors

被引:23
作者
Mottaghi, Mohammad H. [1 ]
Zarandi, Hamid R. [1 ,2 ]
机构
[1] Amirkabir Univ Technol, Dept Comp Engn & Informat Technol, Tehran Polytech, Tehran, Iran
[2] Inst Res Fundamental Sci IPM, Sch Comp Sci, Tehran, Iran
关键词
Real-time systems; Dynamic scheduling; Fault tolerance; Multicore processors; APERIODIC TASKS;
D O I
10.1016/j.micpro.2013.11.013
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a dynamic scheduling for real-time tasks in multicore processors to tolerate single and multiple transient faults. The scheduling is performed based on three important issues: (I) current released tasks, (2) current available processor cores, and (3) consideration of the number of faults and their occurrences. Using tasks utilization along with a defined criticality threshold in the proposed scheduling method, current ready tasks are divided into critical- and noncritical ones. Based on whether a task is critical or noncritical, an appropriate fault-tolerance policy is exploited. Moreover, scheduling decisions are made to fulfill two key goals: (1) increasing scheduling feasibility and (2) decreasing the total tasks execution time. Several simulation experiments are carried out to compare the proposed method with two well-known methods, called checkpointing with rollback recovery and hardware replication. Experimental results reveal that in the presence of multiple transient faults, the feasibility rate of the proposed method is considerably higher than the other well-known fault-tolerance methods. Moreover, the average timing overhead of this method is lower than the traditional methods. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:88 / 97
页数:10
相关论文
共 25 条
[21]  
Siddha S., 2007, INTEL TECHNOL J, V11, P61
[22]  
Suhendra V, 2008, DES AUT CON, P300
[23]   WCET analysis for multi-core processors with shared L2 instruction caches [J].
Yan, Jun ;
Zhang, Wei .
PROCEEDINGS OF THE 14TH IEEE REAL-TIME AND EMBEDDED TECHNOLOGY AND APPLICATIONS SYMPOSIUM, 2008, :80-89
[24]   A unified approach for fault tolerance and dynamic power management in fixed-priority real-time embedded systems [J].
Zhang, Y ;
Chakrabarty, K .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2006, 25 (01) :111-125
[25]   Fault recovery based on checkpointing for hard real-time embedded systems [J].
Zhang, Y ;
Chakrabarty, K .
18TH IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT TOLERANCE IN VLSI SYSTEMS, PROCEEDINGS, 2003, :320-327