A Path-Based Topology-Agnostic Fault Diagnosis Strategy for Multiprocessor Systems

被引:0
作者
Chen, Lin [1 ]
Feng, Hao [1 ]
Wu, Jiong [1 ]
机构
[1] Hainan Univ, Sch Comp Sci & Technol, Haikou 570100, Peoples R China
基金
海南省自然科学基金;
关键词
Fault diagnosis; Program processors; Computational modeling; Multiprocessing systems; Testing; Terminology; Computers; Training; Topology; Stability analysis; System-level fault diagnosis; PMC model; MM model; interconnection networks; CONDITIONAL DIAGNOSABILITY; ALGORITHM; NETWORKS;
D O I
10.1109/TC.2025.3543701
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Fault diagnosis technology is a method for locating faulty processors in multiprocessor systems, and it plays a crucial role in ensuring system stability, security and reliability. A widely used approach in this technology is the system-level strategy, which determines processor status by interpreting the set of test results between adjacent processors. Among them, the PMC and MM models are two commonly employed methods for generating these results. The diversity and complexity of network topologies in systems constrain existing algorithms to specific topologies, while the limitations of fault diagnosis strategies lead to reduced fault tolerance. In this paper, we present a novel path-based method to tackle the fault diagnosis problems in various networks according to the PMC and MM models. Firstly, we introduce the algorithm for partitioning the path into subpaths based on these models. To ensure that at least one subpath is diagnosed as fault-free, we derive the relationship between the fault bound $T$T and the path length $N$N. Then, building on methods for recognizing the subpath states, we have developed fault diagnosis algorithms for both the PMC and MM models. The simulation results show that our proposed algorithms can quickly and accurately diagnose faults in multiprocessor systems.
引用
收藏
页码:1886 / 1896
页数:11
相关论文
共 32 条
[11]   Conditional diagnosability measures for large multiprocessor systems [J].
Lai, PL ;
Tan, JJM ;
Chang, CP ;
Hsu, LH .
IEEE TRANSACTIONS ON COMPUTERS, 2005, 54 (02) :165-175
[12]   A fast fault-identification algorithm for bijective connection graphs using the PMC model [J].
Li, Tseng-Kuei ;
Tsai, Chang-Hsiung ;
Hsu, Hong-Chun .
INFORMATION SCIENCES, 2012, 187 :291-297
[13]   Performance evaluation on hybrid fault diagnosability of regular networks [J].
Lian, Guanqin ;
Zhou, Shuming ;
Hsieh, Sun-Yuan ;
Liu, Jiafei ;
Chen, Gaolin ;
Wang, Yihong .
THEORETICAL COMPUTER SCIENCE, 2019, 796 :147-153
[14]   Intermittent Fault Diagnosability of Interconnection Networks [J].
Liang, Jia-Rong ;
Feng, Hao ;
Du, Xiaojiang .
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2017, 32 (06) :1279-1287
[15]   An evolutionary fault diagnosis algorithm for interconnection networks under the PMC model [J].
Liang, Jiarong ;
Guo, Yang ;
Xie, Yuhao ;
Liang, Xinyu .
JOURNAL OF SUPERCOMPUTING, 2023, 79 (09) :9964-9984
[16]   A Comparison Fault Diagnosis Algorithm for Star Networks [J].
Liang, Jiarong ;
Zhang, Qian ;
Li, Changzhen .
IEEE ACCESS, 2021, 9 :94214-94223
[17]   Conditional-Fault Diagnosability of Multiprocessor Systems with an Efficient Local Diagnosis Algorithm under the PMC Model [J].
Lin, Cheng-Kuan ;
Kung, Tzu-Liang ;
Tan, Jimmy J. M. .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2011, 22 (10) :1669-1680
[18]   FFNLFD: Fault Diagnosis of Multiprocessor Systems at Local Node With Fault-Free Neighbors Under PMC Model and MM* Model [J].
Lin, Limei ;
Huang, Yanze ;
Lin, Yuhang ;
Hsieh, Sun-Yuan ;
Xu, Li .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (07) :1739-1751
[19]   A Pessimistic Fault Diagnosability of Large-Scale Connected Networks via Extra Connectivity [J].
Lin, Limei ;
Huang, Yanze ;
Xu, Li ;
Hsieh, Sun-Yuan .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (02) :415-428
[20]  
Maeng J., 1981, FTCS-11. The Eleventh Annual International Symposium on Fault-Tolerant Computing, P173