Fault-Tolerant Adaptive Routing in Dragonfly Networks

被引:21
|
作者
Xiang, Dong [1 ]
Li, Bing [2 ]
Fu, Yi [1 ]
机构
[1] Tsinghua Univ, Sch Software, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
基金
美国国家科学基金会;
关键词
Dragonfly networks; flow-control scheme; deadlock-free adaptive fault-tolerant routing; SCHEME; ALGORITHM; IMMUNET; MESHES; DCELL;
D O I
10.1109/TDSC.2017.2693372
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Dragonfly networks have been widely used in the current high-performance computers or high-end servers. Fault-tolerant routing in dragonfly networks is essential. The rich interconnects provide good fault-tolerance ability for the network. A new deadlock-free adaptive fault-tolerant routing algorithm based on a new two-layer safety information model, is proposed by mapping routers in a group, and groups of the dragonfly network into two separate hypercubes. The new fault-tolerant routing algorithm tolerates static and dynamic faults. Our method can determine whether a packet can reach the destination at the source by using the new safety information model, which avoids dead-ends and aimless misrouting. Sufficient simulation results show that the proposed fault-tolerant routing algorithm even outperforms the previous minimal routing algorithm in fault-free networks in many cases.
引用
收藏
页码:259 / 271
页数:13
相关论文
共 50 条
  • [41] Fault-tolerant routing algorithms for hypercube interconnection networks
    Kaneko, K
    Ito, H
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2001, E84D (01): : 121 - 128
  • [42] FAULT-TOLERANT ROUTING IN DEBRUIJN COMMUNICATION-NETWORKS
    ESFAHANIAN, AH
    HAKIMI, SL
    IEEE TRANSACTIONS ON COMPUTERS, 1985, 34 (09) : 777 - 788
  • [43] Fault-tolerant routing in dual-cube networks
    Jiang, Z
    Wu, J
    PROCEEDINGS OF THE 7TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2003, : 389 - 392
  • [44] Adaptive routing strategies for fault-tolerant on-chip networks in dynamically reconfigurable systems
    Nunez-Yanez, J. L.
    Edwards, D.
    Coppola, A. M.
    IET COMPUTERS AND DIGITAL TECHNIQUES, 2008, 2 (03): : 184 - 198
  • [45] A fault-tolerant fully adaptive routing algorithm for collaborative computing in wireless mesh networks
    Liang, Cao
    Huang, Xin-Ming
    Ma, Jing
    INDEPENDENT COMPONENT ANALYSES, WAVELETS, UNSUPERVISED NANO-BIOMIMETIC SENSORS, AND NEURAL NETWORKS V, 2007, 6576
  • [46] A self-adaptive and fault-tolerant routing algorithm for wireless sensor networks in microgrids
    Rui, Lanlan
    Wang, Xiaotong
    Zhang, Yao
    Wang, Xiaomei
    Qiu, Xuesong
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 100 : 35 - 45
  • [47] Optimal fault-tolerant routing algorithm and fault-tolerant diameter in directed double-loop networks
    Chen, Yebin
    Li, Ying
    Chen, Tao
    THEORETICAL COMPUTER SCIENCE, 2013, 468 : 50 - 58
  • [48] Shortest path routing and fault-tolerant routing on de Bruijn networks
    Mao, JW
    Yang, CB
    NETWORKS, 2000, 35 (03) : 207 - 215
  • [49] A Congestion-adaptive Fault-tolerant Routing Algorithm on HNoC
    Fang, Juan
    Cheng, Yanjin
    Zhao, Hui
    2019 9TH IEEE ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER 2019), 2019, : 911 - 916
  • [50] Performance of an adaptive fault-tolerant routing algorithm for multicast communications
    Borella, A
    Cancellieri, G
    ATM, NETWORKS AND LANS - NOC '96-II, 1996, : 295 - 296