All-to-All Broadcast Algorithm in Galaxyfly Networks †

被引:0
作者
Zhuang, Hongbin [1 ]
Chang, Jou-Ming [2 ]
Li, Xiao-Yan [1 ]
Song, Fangying [3 ]
Lin, Qinying [1 ]
机构
[1] Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350108, Peoples R China
[2] Natl Taipei Univ Business, Inst Informat & Decis Sci, Taipei 10051, Taiwan
[3] Fuzhou Univ, Sch Math & Stat, Fuzhou 350108, Peoples R China
基金
中国国家自然科学基金;
关键词
Galaxyfly network; all-to-all broadcast; interconnection network; algorithm; MULTICAST COMMUNICATION; ARCHITECTURE;
D O I
10.3390/math11112459
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The design of interconnection networks is a fundamental aspect of high-performance computing (HPC) systems. Among the available topologies, the Galaxyfly network stands out as a low-diameter and flexible-radix network for HPC applications. Given the paramount importance of collective communication in HPC performance, in this paper, we present two different all-to-all broadcast algorithms for the Galaxyfly network, which adhere to the supernode-first rule and the router-first rule, respectively. Our performance evaluation validates their effectiveness and shows that the first algorithm has a higher degree of utilization of network channels, and that the second algorithm can significantly reduce the average time for routers to collect packets from the supernode.
引用
收藏
页数:14
相关论文
共 33 条
  • [1] Ahn Jung Ho, 2009, P C HIGH PERF COMP N
  • [2] Slim Fly: A Cost Effective Low-Diameter Network Topology
    Besta, Maciej
    Hoefler, Torsten
    [J]. SC14: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2014, : 348 - 359
  • [3] Bharadwaj S., 2020, 2020 IEEE International Ultrasonics Symposium (IUS), P1
  • [4] Resource deadlocks and performance of wormhole multicast routing algorithms
    Boppana, RV
    Chalasani, S
    Raghavendra, CS
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1998, 9 (06) : 535 - 549
  • [5] Efficient Management and Intelligent Fault Tolerance for HPC Interconnect Networks
    Cao, Jijun
    Lai, Mingche
    Luo, Zhang
    Xu, Jiaqing
    Pang, Zhengbin
    [J]. 2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2019, : 343 - 351
  • [6] Low-latency Distributed Computation Offloading for Pervasive Environments
    Cicconetti, Claudio
    Conti, Marco
    Passarella, Andrea
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS (PERCOM), 2019,
  • [7] A Cost-Efficient Router Architecture for HPC Inter-Connection Networks: Design and Implementation
    Dai, Yi
    Lu, Kai
    Xiao, Liquan
    Su, Jinshu
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (04) : 738 - 753
  • [8] An In-Depth Analysis of the Slingshot Interconnect
    De Sensi, Daniele
    Di Girolamo, Salvatore
    McMahon, Kim H.
    Roweth, Duncan
    Hoefler, Torsten
    [J]. PROCEEDINGS OF SC20: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC20), 2020,
  • [9] Faanes G, 2012, INT CONF HIGH PERFOR
  • [10] Fault-Tolerant Routing With Load Balancing in LeTQ Networks
    Fan, Weibei
    Xiao, Fu
    Fan, Jianxi
    Han, Zhijie
    Sun, Lijuan
    Wang, Ruchuan
    [J]. IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (01) : 68 - 82