EFFICIENT DIAGNOSIS OF MULTIPROCESSOR SYSTEMS UNDER PROBABILISTIC MODELS

被引:43
|
作者
BLOUGH, DM [1 ]
SULLIVAN, GF [1 ]
MASSON, GM [1 ]
机构
[1] JOHNS HOPKINS UNIV, DEPT COMP SCI, BALTIMORE, MD 21218 USA
关键词
ALGORITHMS; FAULT DIAGNOSIS; HYPERCUBE; MULTIPROCESSOR SYSTEMS; PERMANENT FAULTS; PROBABILISTIC MODELS;
D O I
10.1109/12.165394
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, the problem of fault diagnosis in multiprocessor systems is considered under a probabilistic fault model. This work focuses on minimizing the number of tests that must be conducted in order to correctly diagnose the state of every processor in the system with high probability. A diagnosis algorithm that can correctly diagnose the state of every processor with probability approaching one in a class of systems performing slightly greater than a linear number of tests is presented. A nearly matching lower bound on the number of tests required to achieve correct diagnosis in arbitrary systems is also proven. Lower and upper bounds on the number of tests required for regular systems are also presented. A class of regular systems which includes hypercubes is shown to be correctly diagnosable with high probability. In all cases, the number of tests required under this probabilistic model is shown to be significantly less than under a bounded-size fault set model. Because the number of tests that must be conducted is a measure of the diagnosis overhead, these results represent a dramatic improvement in the performance of system-level diagnosis techniques.
引用
收藏
页码:1126 / 1136
页数:11
相关论文
共 50 条
  • [1] PROBABILISTIC DIAGNOSIS OF MULTIPROCESSOR SYSTEMS
    LEE, SG
    SHIN, KG
    ACM COMPUTING SURVEYS, 1994, 26 (01) : 121 - 139
  • [2] PROBABILISTIC DIAGNOSIS IN MULTIPROCESSOR SYSTEMS
    NARRAWAY, JJ
    MA, W
    MICROPROCESSING AND MICROPROGRAMMING, 1990, 28 (1-5): : 75 - 78
  • [3] Probabilistic cluster fault diagnosis for multiprocessor systems
    Niu, Baohua
    Zhou, Shuming
    Zhang, Hong
    Zhang, Qifan
    THEORETICAL COMPUTER SCIENCE, 2024, 1020
  • [4] ON PROBABILISTIC DIAGNOSIS OF MULTIPROCESSOR SYSTEMS USING MULTIPLE SYNDROMES
    LEE, S
    SHIN, KG
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1994, 5 (06) : 630 - 638
  • [5] Probabilistic Fault Diagnosis of Clustered Faults for Multiprocessor Systems
    Sun, Xue-Li
    Fan, Jian-Xi
    Cheng, Bao-Lei
    Wang, Yan
    Zhang, Li
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 38 (04) : 821 - 833
  • [6] Probabilistic Fault Diagnosis of Clustered Faults for Multiprocessor Systems
    Xue-Li Sun
    Jian-Xi Fan
    Bao-Lei Cheng
    Yan Wang
    Li Zhang
    Journal of Computer Science and Technology, 2023, 38 : 821 - 833
  • [7] Local Diagnosis Algorithms for Multiprocessor Systems Under the Comparison Diagnosis Model
    Lin, Cheng-Kuan
    Teng, Yuan-Hsiang
    Tan, Jimmy J. M.
    Hsu, Lih-Hsing
    IEEE TRANSACTIONS ON RELIABILITY, 2013, 62 (04) : 800 - 810
  • [8] Conditional-Fault Diagnosability of Multiprocessor Systems with an Efficient Local Diagnosis Algorithm under the PMC Model
    Lin, Cheng-Kuan
    Kung, Tzu-Liang
    Tan, Jimmy J. M.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2011, 22 (10) : 1669 - 1680
  • [9] MODELS FOR DIAGNOSABLE SYSTEMS AND PROBABILISTIC FAULT DIAGNOSIS
    MAHESHWARI, SN
    HAKIMI, SL
    IEEE TRANSACTIONS ON COMPUTERS, 1976, 25 (03) : 228 - 236
  • [10] A PARALLEL PROBABILISTIC SYSTEM-LEVEL FAULT DIAGNOSIS APPROACH FOR LARGE MULTIPROCESSOR SYSTEMS
    Elhadef, Mourad
    Abrougui, Kaouther
    Das, Shantanu
    Nayak, Amiya
    PARALLEL PROCESSING LETTERS, 2006, 16 (01) : 63 - 79