FAULT-TOLERANT DISTRIBUTED SUBCUBE MANAGEMENT SCHEME FOR HYPERCUBE MULTICOMPUTER SYSTEMS

被引:0
|
作者
CHEN, YL
LIU, JC
机构
[1] Department of Computer Science, Texas A & M University, College Station
关键词
DISTRIBUTED SUBCUBE MANAGEMENT; FAULT-TOLERANCE; PERCUBE MULTICOMPUTER; RELIABLE BROADCAST;
D O I
10.1109/71.395406
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper proposes a fault tolerant distributed subcube management scheme for hypercube multicomputer systems. Gracefully degradable subcube management is supported by a data structure, called the distributed subcube table (DST), and a fault tolerant broadcast protocol, called the reliably synchronized broadcast (RSB). In an n-dimensional hypercube, DST is the collection of 2(n) local subcube tables (LSTs), DST = {LST(0), LST(1), ..., LST(2-1)(n)}, where LST(x) is a bit mapped table assigned to N-x, a fault free node whose address is x. LST(x), For All x, is n + 1 bits long, and it records the status (free/busy) of certain subcubes adjacent to N-x. The RSB diagnoses and avoids faults during interprocessor communication to prevent faulty nodes from being allocated for job execution. In addition to possessing a fault-tolerant design, our scheme can also achieve comparable or better performance than existing centralized schemes, as verified by extensive simulation.
引用
收藏
页码:766 / 772
页数:7
相关论文
共 50 条
  • [21] Adaptive distributed and fault-tolerant systems
    Hiltunen, MA
    Schlichting, RD
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 1996, 11 (05): : 275 - 285
  • [22] Synthesis of Fault-Tolerant Distributed Systems
    Dimitrova, Rayna
    Finkbeiner, Bernd
    AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS, PROCEEDINGS, 2009, 5799 : 321 - 336
  • [23] Distributed recovery block based fault-tolerant routing in hypercube networks
    Khan, GN
    Hura, GS
    Wei, G
    IEEE CCEC 2002: CANADIAN CONFERENCE ON ELECTRCIAL AND COMPUTER ENGINEERING, VOLS 1-3, CONFERENCE PROCEEDINGS, 2002, : 603 - 608
  • [24] DECENTRALIZED METHOD OF DYNAMIC ALLOCATION OF COLLS IN FAULT-TOLERANT MULTICOMPUTER SYSTEMS
    BOGATYREV, V
    AVTOMATIKA I VYCHISLITELNAYA TEKHNIKA, 1993, (03): : 73 - 75
  • [25] Fault-tolerant cycle embedding in the hypercube
    Fu, JS
    PARALLEL COMPUTING, 2003, 29 (06) : 821 - 832
  • [26] ON THE FEASIBILITY OF A SPACEBORNE FAULT-TOLERANT HYPERCUBE
    RENNELS, DA
    MATHUR, FP
    CHAU, SN
    ROHR, JA
    AIAA COMPUTERS IN AEROSPACE VII CONFERENCE, PTS 1 AND 2: A COLLECTION OF PAPERS, 1989, : 634 - 644
  • [27] FAULT-TOLERANT GOSSIPING ON HYPERCUBE MULTICOMPUTERS
    FRAIGNIAUD, P
    LECTURE NOTES IN COMPUTER SCIENCE, 1991, 487 : 463 - 472
  • [28] A fault-tolerant distributed scheme for grid information services
    Yang, Ming-Jeng
    Ku, Chin-Lin
    Lin, Shih-Hsiang
    Yeh, Yao-Ming
    ADVANCES IN GRID AND PERVASIVE COMPUTING, PROCEEDINGS, 2006, 3947 : 126 - 136
  • [29] Units of computation in fault-tolerant distributed systems
    Ahuja, M
    Mishra, S
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1997, 40 (02) : 194 - 209
  • [30] ON RELIABILITY MODELING OF FAULT-TOLERANT DISTRIBUTED SYSTEMS
    THAMBIDURAI, P
    PARK, YK
    TRIVEDI, KS
    9TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, 1989, : 136 - 142