OPTIMAL-DESIGN OF FAULT-TOLERANT DISTRIBUTED SYSTEMS BASED ON A RECURSIVE ALGORITHM

被引:4
作者
PHAM, H [1 ]
UPADHYAYA, SJ [1 ]
机构
[1] SUNY BUFFALO,DEPT ELECT & COMP ENGN,BUFFALO,NY 14260
关键词
DISTRIBUTED SYSTEM; FAULT TOLERANCE; OPTIMIZATION; SYSTEM RELIABILITY;
D O I
10.1109/24.85460
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses the issue of optimal design (in terms of the number of processors) of a distributed system and is based on a recursive algorithm for fault tolerance (RAFT). The reliability and performance of the system using RAFT are determined as a function of reliability of individual processors and the number of fault modes in a processor. Also discussed are how to determine the design policies when the objective is to minimize the average system cost given the cost of each processor and the cost of the system failure. Several numerical examples illustrate the results.
引用
收藏
页码:375 / 379
页数:5
相关论文
共 4 条
[1]   FAULT TOLERANCE IN MULTIPROCESSOR SYSTEMS WITHOUT DEDICATED REDUNDANCY [J].
AGRAWAL, P .
IEEE TRANSACTIONS ON COMPUTERS, 1988, 37 (03) :358-362
[2]  
AGRAWAL P, 1985, AUG P INT C PAR PROC, P814
[3]  
KONEMAN B, 1979, CHERRY HILL TEST C, P37
[4]   RELIABILITY-ANALYSIS OF A CLASS OF FAULT-TOLERANT SYSTEMS [J].
PHAM, H ;
UPADHYAYA, SJ .
IEEE TRANSACTIONS ON RELIABILITY, 1989, 38 (03) :333-337