EFFICIENT DIAGNOSIS OF MULTIPROCESSOR SYSTEMS UNDER PROBABILISTIC MODELS

被引:44
作者
BLOUGH, DM [1 ]
SULLIVAN, GF [1 ]
MASSON, GM [1 ]
机构
[1] JOHNS HOPKINS UNIV, DEPT COMP SCI, BALTIMORE, MD 21218 USA
关键词
ALGORITHMS; FAULT DIAGNOSIS; HYPERCUBE; MULTIPROCESSOR SYSTEMS; PERMANENT FAULTS; PROBABILISTIC MODELS;
D O I
10.1109/12.165394
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, the problem of fault diagnosis in multiprocessor systems is considered under a probabilistic fault model. This work focuses on minimizing the number of tests that must be conducted in order to correctly diagnose the state of every processor in the system with high probability. A diagnosis algorithm that can correctly diagnose the state of every processor with probability approaching one in a class of systems performing slightly greater than a linear number of tests is presented. A nearly matching lower bound on the number of tests required to achieve correct diagnosis in arbitrary systems is also proven. Lower and upper bounds on the number of tests required for regular systems are also presented. A class of regular systems which includes hypercubes is shown to be correctly diagnosable with high probability. In all cases, the number of tests required under this probabilistic model is shown to be significantly less than under a bounded-size fault set model. Because the number of tests that must be conducted is a measure of the diagnosis overhead, these results represent a dramatic improvement in the performance of system-level diagnosis techniques.
引用
收藏
页码:1126 / 1136
页数:11
相关论文
共 50 条
[31]   Energy-Efficient Task Allocation Techniques for Asymmetric Multiprocessor Embedded Systems [J].
Elewi, Abdullah ;
Shalan, Mohamed ;
Awadalla, Medhat ;
Saad, Elsayed M. .
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2014, 13
[32]   SELF-DIAGNOSABILITY OF MULTIPROCESSOR SYSTEMS WITH HYBRID FAULTS - DIAGNOSIS BY COMPARISON APPROACH [J].
SHIN, YC ;
SENGUPTA, A .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-FUNDAMENTAL THEORY AND APPLICATIONS, 1993, 40 (05) :355-358
[33]   A Probabilistic SDG approach to fault diagnosis of industrial systems [J].
Lue, Ning ;
Wang, Xiong .
ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF CONTEMPORARY INTELLIGENT COMPUTING TECHNIQUES, 2007, 2 :517-+
[34]   Probabilistic Fault Diagnosis and its Analysis in Multicomputer Systems [J].
Kobayashi, Manabu ;
Matsushima, Toshiyasu ;
Hirasawa, Shigeichi .
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2018, E101A (12) :2072-2081
[35]   Probabilistic Fault Diagnosis and its Analysis in Multicomputer Systems [J].
Kobayashi, Manabu ;
Takabatake, Toshinori ;
Matsushima, Toshiyasu ;
Hirasawa, Shigeichi .
2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, :1205-1211
[36]   Markov nets: Probabilistic models for distributed and concurrent systems [J].
Benveniste, A ;
Fabre, E ;
Haar, S .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2003, 48 (11) :1936-1950
[37]   FAULT DIAGNOSIS OF MULTIPROCESSOR SYSTEMS BASED ON GENETIC AND ESTIMATION OF DISTRIBUTION ALGORITHMS: A PERFORMANCE EVALUATION [J].
Duarte, Elias P., Jr. ;
Pozo, Aurora T. R. ;
Nassu, Bogdan T. .
INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2010, 19 (01) :1-18
[38]   Energy-efficient real-time task scheduling in multiprocessor DVS systems [J].
Chen, Jian-Jia ;
Yang, Chuan-Yue ;
Kuo, Tei-Wei ;
Shih, Chi-Sheng .
PROCEEDINGS OF THE ASP-DAC 2007, 2007, :342-+
[39]   A Path-Based Topology-Agnostic Fault Diagnosis Strategy for Multiprocessor Systems [J].
Chen, Lin ;
Feng, Hao ;
Wu, Jiong .
IEEE TRANSACTIONS ON COMPUTERS, 2025, 74 (06) :1886-1896
[40]   Probabilistic diagnosis of large systems using a parallel genetic approach [J].
Elhadef, M ;
Abrougui, K ;
Das, S ;
Nayak, A .
PDPTA '05: Proceedings of the 2005 International Conference on Parallel and Distributed Processing Techniques and Applications, Vols 1-3, 2005, :1010-1016