EVALUATION AND COMPARISON OF FAULT-TOLERANT SOFTWARE TECHNIQUES

被引:16
作者
HUDAK, J [1 ]
SUH, BH [1 ]
SIEWIOREK, D [1 ]
SEGALL, Z [1 ]
机构
[1] CARNEGIE MELLON UNIV,DEPT ELECT & COMP ENGN,PITTSBURGH,PA 15213
关键词
FAULT-TOLERANT SOFTWARE; N-VERSION PROGRAMMING; RECOVERY BLOCK; CONCURRENT ERROR-DETECTION; ALGORITHM-BASED FAULT TOLERANCE; MARKOV MODEL; FAULT-INJECTION TESTING;
D O I
10.1109/24.229487
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Various fault-tolerant software techniques have been proposed in order to meet the reliability requirements of critical systems. This paper evaluates 4 implementations of fault-tolerant software techniques with respect to hardware and design faults. Project participants were divided into 4 groups, each of which developed fault-tolerant software based on a common specification. Each group applied one of the following techniques: n-version programming, recovery block, concurrent error-detection, and algorithm-based fault tolerance. Independent testing and modeling groups within the project then thoroughly analyzed the fault-tolerant software. Using fault-injection tools, the testing group subjected the fault-tolerant software to simulated design and hardware faults. Simulated design-faults included control flow, array boundary, computational, and post/pre increment/decrement software mutations. Simulated hardware-faults included code and data corruption. Data collected from the fault-injection experiment were then mapped into a discrete-time Markov model developed by the modeling group. Based on this model, the effectiveness of each implementation of the fault-tolerant software technique with respect to availability, correctness, and time to failure given an error, is contrasted with measured data. Finally, the model is analyzed with respect to additional figures of merit identified during the modeling process, and the techniques are ranked using an application taxonomy.
引用
收藏
页码:190 / 204
页数:15
相关论文
共 22 条
[1]   SOFTWARE FAULT TOLERANCE - AN EVALUATION [J].
ANDERSON, T ;
BARRETT, PA ;
HALLIWELL, DN ;
MOULDING, MR .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1985, 11 (12) :1502-1510
[2]   THE N-VERSION APPROACH TO FAULT-TOLERANT SOFTWARE [J].
AVIZIENIS, A .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1985, 11 (12) :1491-1501
[3]  
BARTON S, 1988, 18THE IEEE P INT S F
[4]   AN EXPERIMENTAL EVALUATION OF SOFTWARE REDUNDANCY AS A STRATEGY FOR IMPROVING RELIABILITY [J].
ECKHARDT, DE ;
CAGLAYAN, AK ;
KNIGHT, JC ;
LEE, LD ;
MCALLISTER, DF ;
VOUK, MA ;
KELLY, JPJ .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1991, 17 (07) :692-702
[5]   A CENSUS OF TANDEM SYSTEM AVAILABILITY BETWEEN 1985 AND 1990 [J].
GRAY, J .
IEEE TRANSACTIONS ON RELIABILITY, 1990, 39 (04) :409-418
[6]   ALGORITHM-BASED FAULT TOLERANCE FOR MATRIX OPERATIONS [J].
HUANG, KH ;
ABRAHAM, JA .
IEEE TRANSACTIONS ON COMPUTERS, 1984, 33 (06) :518-528
[7]  
KLEINROCK LR, 1977, QUEUING SYSTEMS
[8]   AN EXPERIMENTAL EVALUATION OF THE ASSUMPTION OF INDEPENDENCE IN MULTIVERSION PROGRAMMING [J].
KNIGHT, JC ;
LEVESON, NG .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1986, 12 (01) :96-109
[9]  
Larson H. J., 1979, PROBABILISTIC MODELS, VII
[10]  
Larson H.M., 1979, PROBABILISTIC MODELS, V1