Load Balancing With Multi-Level Signals for Lossless Datacenter Networks

被引:18
|
作者
Hu, Jinbin [1 ,2 ]
Zeng, Chaoliang [3 ]
Wang, Zilong [3 ]
Zhang, Junxue [3 ]
Guo, Kun [4 ]
Xu, Hong [5 ]
Huang, Jiawei [6 ]
Chen, Kai [3 ]
机构
[1] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410076, Peoples R China
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[3] Hong Kong Univ Sci & Technol, Comp Sci & Engn Dept, Hong Kong, Peoples R China
[4] Fuzhou Univ, Comp Sci & Technol & Management Dept, Fuzhou 350108, Peoples R China
[5] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[6] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Peoples R China
基金
中国国家自然科学基金;
关键词
Load management; Switches; Delays; Receivers; Load modeling; Computer science; Transport protocols; Datacenter; lossless networks; load balancing;
D O I
10.1109/TNET.2024.3366336
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Various datacenter network (DCN) load balancing schemes have been proposed in the past decade. Unfortunately, most of these solutions designed for lossy DCNs do not work well for Priority Flow Control (PFC) enabled lossless DCNs, primarily due to the reason that the individual congestion signals used in these solutions, e.g., link load, queue length, Round Trip Time (RTT) and Explicit Congestion Notification (ECN), may not be able to correctly or timely reflect the hop-by-hop PFC pausing. This paper first reveals the above problems via extensive experiments, and then based on the insights learned, we present Proteus, a PFC-aware load balancing scheme that is resilient to PFC pausing by exploring a combination of multi-level congestion signals. At its heart, Proteus leverages RTT-level signals (i.e., RTT and link utilization) to detect path status for initial routing decision, and exploits sub-RTT level signal (i.e., cumulative sojourn time) to reflect instantaneous PFC pausing and make timely rerouting choices based on the idea of better-late-than-never. We have implemented Proteus in the hardware programmable switch. Our testbed experiments as well as large-scale simulations show that Proteus can effectively handle PFC pausing under realistic workloads and achieve up to 35%, 31%, 28%, 22% and 46%, 42%, 34%, 29% better average FCT and 99(th) percentile FCT than CONGA, DRILL, Hermes and MP-RDMA, respectively.
引用
收藏
页码:2736 / 2748
页数:13
相关论文
共 50 条
  • [1] RLB: Reordering-Robust Load Balancing in Lossless Datacenter Networks
    Hu, Jinbin
    He, Yi
    Wang, Jin
    Luo, Wangqing
    Huang, Jiawei
    PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2023, 2023, : 576 - 584
  • [2] A Novel Load Balancing Scheme Based on PFC Prediction in Lossless Datacenter Networks
    Wang, Jin
    He, Yi
    Luo, Wangqing
    Rao, Shuying
    Hu, Jinbin
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2024, 14
  • [3] MULTI-LEVEL LOAD BALANCING FOR PARALLEL PARTICLE SIMULATIONS
    Sutmann, Godehard
    VI INTERNATIONAL CONFERENCE ON PARTICLE-BASED METHODS (PARTICLES 2019): FUNDAMENTALS AND APPLICATIONS, 2019, : 80 - 92
  • [5] Multi-level Load Balancing with an Integrated Runtime Approach
    Bak, Seonmyeong
    Menon, Harshitha
    White, Sam
    Diener, Matthias
    Kale, Laxmikant
    2018 18TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2018, : 31 - 40
  • [6] Load balancing for heterogeneous traffic in datacenter networks
    Wang, Jin
    Rao, Shuying
    Liu, Ying
    Sharma, Pradip Kumar
    Hu, Jinbin
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2023, 217
  • [7] Meet: Rack-Level Pooling Based Load Balancing in Datacenter Networks
    Dong, Jiaqing
    Tan, Lijuan
    Tian, Chen
    Zhou, Yuhang
    Wang, Yi
    Dou, Wanchun
    Chen, Guihai
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (12) : 3628 - 3639
  • [8] Cooperative Fog Communications using A Multi-Level Load Balancing
    Mostafa, Nour
    2019 FOURTH INTERNATIONAL CONFERENCE ON FOG AND MOBILE EDGE COMPUTING (FMEC), 2019, : 45 - 51
  • [9] Lyapunov Stability Analysis of Load Balancing in Datacenter Networks
    Dhananjayan, Amrith
    Seow, Kiam Tian
    Foh, Chuan Heng
    2013 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2013, : 912 - 916
  • [10] Topology-Aware Load Balancing in Datacenter Networks
    Khan, Tahir Abbas
    Khan, Muhammad Saeed
    Abbas, Sagheer
    Janjua, Jamshaid Iqbal
    Muhammad, Syed Shah
    Asif, Muhammad
    2021 IEEE ASIA PACIFIC CONFERENCE ON WIRELESS AND MOBILE (APWIMOB), 2021, : 220 - 225