FAULT-TOLERANT DISTRIBUTED SUBCUBE MANAGEMENT SCHEME FOR HYPERCUBE MULTICOMPUTER SYSTEMS

被引:0
|
作者
CHEN, YL
LIU, JC
机构
[1] Department of Computer Science, Texas A & M University, College Station
关键词
DISTRIBUTED SUBCUBE MANAGEMENT; FAULT-TOLERANCE; PERCUBE MULTICOMPUTER; RELIABLE BROADCAST;
D O I
10.1109/71.395406
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper proposes a fault tolerant distributed subcube management scheme for hypercube multicomputer systems. Gracefully degradable subcube management is supported by a data structure, called the distributed subcube table (DST), and a fault tolerant broadcast protocol, called the reliably synchronized broadcast (RSB). In an n-dimensional hypercube, DST is the collection of 2(n) local subcube tables (LSTs), DST = {LST(0), LST(1), ..., LST(2-1)(n)}, where LST(x) is a bit mapped table assigned to N-x, a fault free node whose address is x. LST(x), For All x, is n + 1 bits long, and it records the status (free/busy) of certain subcubes adjacent to N-x. The RSB diagnoses and avoids faults during interprocessor communication to prevent faulty nodes from being allocated for job execution. In addition to possessing a fault-tolerant design, our scheme can also achieve comparable or better performance than existing centralized schemes, as verified by extensive simulation.
引用
收藏
页码:766 / 772
页数:7
相关论文
共 50 条
  • [1] Highly fault-tolerant hypercube multicomputer
    Izadi, BA
    Özgüner, F
    Acan, A
    IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 1999, 146 (02): : 77 - 82
  • [2] Subcube reliability of a modular fault-tolerant hypercube architecture
    AbdElBarr, MH
    Benten, MS
    Hai, MA
    KUWAIT JOURNAL OF SCIENCE & ENGINEERING, 1996, : 7 - 25
  • [3] Real-time fault-tolerant hypercube multicomputer
    Izadi, BA
    Özgüner, F
    IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 2002, 149 (05): : 197 - 202
  • [4] A fault-tolerant tree communication scheme for hypercube systems
    Leu, YR
    Kuo, SY
    IEEE TRANSACTIONS ON COMPUTERS, 1996, 45 (06) : 641 - 650
  • [5] Fault tolerant subcube allocation in hypercube
    Hashimoto, H
    Masuyama, H
    Sasama, T
    SECOND INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS, AND NETWORKS (I-SPAN '96), PROCEEDINGS, 1996, : 401 - 407
  • [6] A Novel Fault-Tolerant Scheme for Distributed Systems
    Zhang, Xiaoqin
    Wei, Zhidong
    Zhang, Fenggui
    Liu, Guoliang
    CEIS 2011, 2011, 15
  • [7] A FAULT-TOLERANT COMMUNICATION SCHEME FOR HYPERCUBE COMPUTERS
    LEE, TC
    HAYES, JP
    IEEE TRANSACTIONS ON COMPUTERS, 1992, 41 (10) : 1242 - 1256
  • [8] AN ADAPTIVE DEPENDABLE FAULT-TOLERANT SCHEME FOR DISTRIBUTED SYSTEMS
    Liu, Guoliang
    Chen, Shuyu
    THIRD INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND TECHNOLOGY (ICCET 2011), 2011, : 697 - 702
  • [9] Optimal Fault-Tolerant Routing Scheme for generalized hypercube
    Tian, SH
    Lu, YP
    Zhang, DF
    11TH PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING, PROCEEDINGS, 2005, : 91 - 98
  • [10] Efficient fault-tolerant multicast scheme for hypercube multicomputers
    Natl Taiwan Univ of Science and, Technology, Taipei, Taiwan
    IEEE Trans Parallel Distrib Syst, 10 (952-962):