A scheduling heuristics for distributed real-time embedded systems tolerant to processor and communication media failures

被引:6
作者
Girault, A
Kalla, H
Sorel, Y
机构
[1] ZIRST, INRIA, Rhone Alpes Res Unit, F-38334 Montbonnot St Martin, Saint Ismier, France
[2] Inst Natl Rech Informat & Automat, Rocquencourt Res Unit, F-78153 Le Chesnay, France
关键词
D O I
10.1080/00207540410001705248
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Hardware fault tolerance is an important consideration in critical distributed real-time embedded systems and has been extensively researched. In these systems, critical real-time constraints must be satisfied even in the presence of hardware component failures. Our goal is to propose a solution to automatically produce a fault-tolerant distributed schedule of a given algorithm onto a given distributed architecture, according to real-time constraints. The distributed architectures we consider have bidirectional point-to-point communication links. Our solution is a list scheduling heuristics, based on disjoint paths to tolerate a fixed number of arbitrary processor and communication link failures. Because of the resource limitation in embedded systems, our heuristics implements a software solution based on the active replication technique, where each operation of the algorithm is replicated on different processors. With a detailed example, we show the techniques used to satisfy the real-time constraints and tolerate the failure of processor and communication links. Simulations show the efficiency of our method compared with other heuristics found in the literature.
引用
收藏
页码:2877 / 2898
页数:22
相关论文
共 48 条
[1]   On exploiting task duplication in parallel program scheduling [J].
Ahmad, I ;
Kwok, YK .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1998, 9 (09) :872-892
[2]   Fault-tolerant real-time scheduling using passive replicas [J].
Ahn, KD ;
Kim, J ;
Hong, SJ .
PACIFIC RIM INTERNATIONAL SYMPOSIUM ON FAULT-TOLERANT SYSTEMS, PROCEEDINGS, 1997, :98-103
[3]  
ALOMARI R, 2001, P 15 INT PAR DISTR P
[4]  
[Anonymous], 2000, Software Fault Tolerance: A Tutorial (tech. rep.)
[5]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theoryof NP-Completeness
[6]  
AVIZIENIS A, 2000, P 3 INF SURV WORKSH, P7
[7]  
BANERJEA A, 1994, P GLOBECOM 94 SAN FR, P162
[8]   THE SYNCHRONOUS APPROACH TO REACTIVE AND REAL-TIME SYSTEMS [J].
BENVENISTE, A ;
BERRY, G .
PROCEEDINGS OF THE IEEE, 1991, 79 (09) :1270-1282
[9]   SCHEDULING ALGORITHMS FOR FAULT-TOLERANCE IN HARD-REAL-TIME SYSTEMS [J].
BERTOSSI, AA ;
MANCINI, LV .
REAL-TIME SYSTEMS, 1994, 7 (03) :229-245
[10]  
BOPPANA R, 1995, P 1995 INT C PAR PRO, P118