HULA: Scalable Load Balancing Using Programmable Data Planes

被引:236
作者
Katta, Naga [1 ]
Hira, Mukesh [2 ]
Kim, Changhoon [3 ]
Sivaraman, Anirudh [4 ]
Rexford, Jennifer [1 ]
机构
[1] Princeton Univ, Princeton, NJ 08544 USA
[2] VMware, Palo Alto, CA USA
[3] Barefoot Networks, Palo Alto, CA USA
[4] MIT CSAIL, Cambridge, MA USA
来源
SYMPOSIUM ON SOFTWARE DEFINED NETWORKING (SDN) RESEARCH (SOSR'16) | 2016年
基金
美国国家科学基金会;
关键词
In-Network Load Balancing; Programmable Switches; Network Congestion; Scalability;
D O I
10.1145/2890955.2890968
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Datacenter networks employ multi-rooted topologies (e.g., Leaf-Spine, Fat-Tree) to provide large bisection bandwidth. These topologies use a large degree of multipathing, and need a data-plane load-balancing mechanism to effectively utilize their bisection bandwidth. The canonical load-balancing mechanism is equal-cost multipath routing (ECMP), which spreads traffic uniformly across multiple paths. Motivated by ECMP's shortcomings, congestion-aware load-balancing techniques such as CONGA have been developed. These techniques have two limitations. First, because switch memory is limited, they can only maintain a small amount of congestion-tracking state at the edge switches, and do not scale to large topologies. Second, because they are implemented in custom hardware, they cannot be modified in the field. This paper presents HULA, a data-plane load-balancing algorithm that overcomes both limitations. First, instead of having the leaf switches track congestion on all paths to a destination, each HULA switch tracks congestion for the best path to a destination through a neighboring switch. Second, we design HULA for emerging programmable switches and program it in P4 to demonstrate that HULA could be run on such programmable chipsets, without requiring custom hardware. We evaluate HULA extensively in simulation, showing that it outperforms a scalable extension to CONGA in average flow completion time (1.6x at 50% load, 3x at 90% load).
引用
收藏
页数:12
相关论文
共 47 条
  • [41] Performance of Scalable Off-The-Shelf Hardware for Data-intensive Parallel Processing using MapReduce
    Fadzil, Ahmad Firdaus Ahmad
    Khalid, Noor Elaiza Abdul
    Manaf, Mazani
    2012 7TH INTERNATIONAL CONFERENCE ON COMPUTING AND CONVERGENCE TECHNOLOGY (ICCCT2012), 2012, : 379 - 384
  • [42] STG-MTL: scalable task grouping for multi-task learning using data maps
    Sherif, Ammar
    Abid, Abubakar
    Elattar, Mustafa
    Elhelw, Mohamed
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (02):
  • [43] Flexible Interconnection of Scalable Systems Integrated Using Optical Networks (FISSION) Data-Center-Concepts and Demonstration
    Kushwaha, Aniruddha
    Gumaste, Ashwin
    Das, Tamal
    Hote, Saurabh
    Wen, Yonggang
    JOURNAL OF OPTICAL COMMUNICATIONS AND NETWORKING, 2017, 9 (07) : 585 - 600
  • [44] Scalable High Speed Serial Interface for Data Converters Using The JESD204B Industry Standard
    Saheb, Hakim
    Haider, Syed
    2014 9TH INTERNATIONAL DESIGN & TEST SYMPOSIUM (IDT), 2014, : 6 - 11
  • [45] Scalable distributed collaborative editing for 3D models using conflict-free data structure
    Imae, Kengo
    Hayashibara, Naohiro
    INTERNATIONAL JOURNAL OF SPACE-BASED AND SITUATED COMPUTING, 2019, 9 (01) : 11 - 21
  • [46] Scalable Lossless Coding of Dynamic Medical CT Data Using Motion Compensated Wavelet Lifting with Denoised Prediction and Update
    Lanz, Daniela
    Schilling, Franz
    Kaup, Andr
    2019 PICTURE CODING SYMPOSIUM (PCS), 2019,
  • [47] Criso: An Incremental Scalable and Cost-Effective Data Center Interconnection by Using 2-port servers and low-end switches
    Feng, Hao
    Deng, Yuhui
    Zhao, Yufan
    2018 IEEE 26TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS), 2018, : 123 - 130