PROBABILISTIC EVALUATION OF ONLINE CHECKS IN FAULT-TOLERANT MULTIPROCESSOR SYSTEMS

被引:4
|
作者
NAIR, VSS [1 ]
HOSKOTE, YV [1 ]
ABRAHAM, JA [1 ]
机构
[1] UNIV TEXAS, COMP ENGN RES CTR, AUSTIN, TX 78758 USA
关键词
ALGORITHM-BASED FAULT TOLERANCE; CONCURRENT ERROR DETECTION; FAULT COVERAGE; LOCATABILITY; PROBABILISTIC TECHNIQUES;
D O I
10.1109/12.142679
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The analysis of fault-tolerant multiprocessor systems that use concurrent error detection (CED) schemes is much more difficult than the analysis of conventional fault-tolerant architectures. Various analytical techniques have been proposed to evaluate CED schemes deterministically. However, these approaches are based on worst-case assumptions related to the failure of system components. Often, the evaluation results do not reflect the actual fault tolerance capabilities of the system. In this paper, we develop a probabilistic approach to evaluate the fault detecting and locating capabilities of on-line checks in a system. The various probabilities associated with the checking schemes are identified and used in the framework of the matrix-based model [1]. Based on these probabilistic matrices, estimates for the fault tolerance capabilities of various systems are derived analytically.
引用
收藏
页码:532 / 541
页数:10
相关论文
共 50 条