EFFICIENT DIAGNOSIS OF MULTIPROCESSOR SYSTEMS UNDER PROBABILISTIC MODELS

被引:43
作者
BLOUGH, DM [1 ]
SULLIVAN, GF [1 ]
MASSON, GM [1 ]
机构
[1] JOHNS HOPKINS UNIV, DEPT COMP SCI, BALTIMORE, MD 21218 USA
关键词
ALGORITHMS; FAULT DIAGNOSIS; HYPERCUBE; MULTIPROCESSOR SYSTEMS; PERMANENT FAULTS; PROBABILISTIC MODELS;
D O I
10.1109/12.165394
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, the problem of fault diagnosis in multiprocessor systems is considered under a probabilistic fault model. This work focuses on minimizing the number of tests that must be conducted in order to correctly diagnose the state of every processor in the system with high probability. A diagnosis algorithm that can correctly diagnose the state of every processor with probability approaching one in a class of systems performing slightly greater than a linear number of tests is presented. A nearly matching lower bound on the number of tests required to achieve correct diagnosis in arbitrary systems is also proven. Lower and upper bounds on the number of tests required for regular systems are also presented. A class of regular systems which includes hypercubes is shown to be correctly diagnosable with high probability. In all cases, the number of tests required under this probabilistic model is shown to be significantly less than under a bounded-size fault set model. Because the number of tests that must be conducted is a measure of the diagnosis overhead, these results represent a dramatic improvement in the performance of system-level diagnosis techniques.
引用
收藏
页码:1126 / 1136
页数:11
相关论文
共 50 条
  • [21] Fault Diagnosis Method for Rolling Bearing Based on Probabilistic Diffusion Models Under Imbalanced Data
    Zhou, Peng
    Wu, Dengshuai
    Xu, Jiacan
    Wang, Zinan
    Ma, Dazhong
    IEEE SENSORS JOURNAL, 2024, 24 (23) : 40059 - 40068
  • [22] G-good-neighbor diagnosability under the modified comparison model for multiprocessor systems
    Wang, Mu-Jiang-Shan
    Xiang, Dong
    Hsieh, Sun-Yuan
    THEORETICAL COMPUTER SCIENCE, 2025, 1028
  • [23] Designing clustered multiprocessor systems under packaging and technological advancements
    Basak, D
    Panda, DK
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1996, 7 (09) : 962 - 978
  • [24] Probabilistic fault diagnosis in discrete event systems
    Wang, X
    Chattopadhyay, I
    Ray, A
    2004 43RD IEEE CONFERENCE ON DECISION AND CONTROL (CDC), VOLS 1-5, 2004, : 4794 - 4799
  • [25] Compiler Techniques for Efficient Communications in Circuit Switched Networks for Multiprocessor Systems
    Shao, Shuyi
    Jones, Alex K.
    Melhem, Rami
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2009, 20 (03) : 331 - 345
  • [26] Reliability Evaluation of Clustered Faults for Regular Networks Under the Probabilistic Diagnosis Model
    Li, Xiao-Yan
    Zhang, Yufang
    Liu, Ximeng
    Wang, Xiangke
    Cheng, Hongju
    COMPUTER JOURNAL, 2023, 66 (02) : 441 - 462
  • [27] FFNLFD: Fault Diagnosis of Multiprocessor Systems at Local Node With Fault-Free Neighbors Under PMC Model and MM* Model
    Lin, Limei
    Huang, Yanze
    Lin, Yuhang
    Hsieh, Sun-Yuan
    Xu, Li
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (07) : 1739 - 1751
  • [28] Fault diagnosability of (K4 - e)-free multiprocessor systems under the PMC and HPMC model
    Xu, Liqiong
    Yu, Lin
    Zhou, Shuming
    Lin, Cheng-Kuan
    INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2024, 39 (06) : 653 - 668
  • [29] Energy-Efficient Scheduling of Periodic Applications on Safety-Critical Time-Triggered Multiprocessor Systems
    Jiang, Xiaowen
    Huang, Kai
    Zhang, Xiaomeng
    Yan, Rongjie
    Wang, Ke
    Xiong, Dongliang
    Yan, Xiaolang
    ELECTRONICS, 2018, 7 (06):
  • [30] Energy-Efficient Task Allocation Techniques for Asymmetric Multiprocessor Embedded Systems
    Elewi, Abdullah
    Shalan, Mohamed
    Awadalla, Medhat
    Saad, Elsayed M.
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2014, 13