Fault-Tolerant Adaptive Routing in Dragonfly Networks

被引:21
|
作者
Xiang, Dong [1 ]
Li, Bing [2 ]
Fu, Yi [1 ]
机构
[1] Tsinghua Univ, Sch Software, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
基金
美国国家科学基金会;
关键词
Dragonfly networks; flow-control scheme; deadlock-free adaptive fault-tolerant routing; SCHEME; ALGORITHM; IMMUNET; MESHES; DCELL;
D O I
10.1109/TDSC.2017.2693372
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Dragonfly networks have been widely used in the current high-performance computers or high-end servers. Fault-tolerant routing in dragonfly networks is essential. The rich interconnects provide good fault-tolerance ability for the network. A new deadlock-free adaptive fault-tolerant routing algorithm based on a new two-layer safety information model, is proposed by mapping routers in a group, and groups of the dragonfly network into two separate hypercubes. The new fault-tolerant routing algorithm tolerates static and dynamic faults. Our method can determine whether a packet can reach the destination at the source by using the new safety information model, which avoids dead-ends and aimless misrouting. Sufficient simulation results show that the proposed fault-tolerant routing algorithm even outperforms the previous minimal routing algorithm in fault-free networks in many cases.
引用
收藏
页码:259 / 271
页数:13
相关论文
共 50 条
  • [1] Adaptive fault-tolerant wormhole routing for torus networks
    Shih, JD
    1998 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 1998, : 558 - 565
  • [2] Adaptive Stochastic Routing in Fault-tolerant On-chip Networks
    Song, Wei
    Edwards, Doug
    Nunez-Yanez, Jose Luis
    Dasgupta, Sohini
    2009 3RD ACM/IEEE INTERNATIONAL SYMPOSIUM ON NETWORKS-ON-CHIP, 2009, : 32 - +
  • [3] A new adaptive fault-tolerant routing methodology for direct networks
    Gómez, ME
    Duato, J
    Flich, J
    López, P
    Robles, A
    Nordbotten, NA
    Skeie, T
    Lysne, O
    HIGH PERFORMANCE COMPUTING - HIPC 2004, 2004, 3296 : 462 - 473
  • [4] Compressionless routing: A framework for adaptive and fault-tolerant routing
    Kim, JH
    Liu, ZQ
    Chien, AA
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1997, 8 (03) : 229 - 244
  • [5] Deadlock-free adaptive routing in fault-tolerant mesh networks
    Xiang, Dong
    Zhang, Yue-Li
    Jisuanji Xuebao/Chinese Journal of Computers, 2007, 30 (11): : 1954 - 1962
  • [6] An Adaptive Learning Approach for Fault-Tolerant Routing in Ad Hoc Networks
    Misra, Sudip
    Krishna, P. Venkata
    Bhiwal, Akhil
    Chawla, Amardeep Singh
    Wolfinger, Bernd E.
    E-TECHNOLOGIES AND NETWORKS FOR DEVELOPMENT, 2011, 171 : 15 - 25
  • [7] Adaptive message routing in a class of fault-tolerant multistage interconnection networks
    Sch. of Elec. and Electron. Eng., Nanyang Technological University, Singapore 639798, Singapore
    不详
    不详
    Computers and Electrical Engineering, 1997, 23 (04): : 239 - 247
  • [8] Adaptive message routing in a class of fault-tolerant multistage interconnection networks
    Zhou, YQ
    Min, YH
    COMPUTERS & ELECTRICAL ENGINEERING, 1997, 23 (04) : 239 - 247
  • [9] An adaptive and fault-tolerant routing algorithm for meshes
    Shamaei, A.
    Sarbazi-Azad, H.
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2008, PT 1, PROCEEDINGS, 2008, 5072 : 1235 - +
  • [10] ADAPTIVE FAULT-TOLERANT ROUTING IN HYPERCUBE MULTICOMPUTERS
    CHEN, MS
    SHIN, KG
    IEEE TRANSACTIONS ON COMPUTERS, 1990, 39 (12) : 1406 - 1416