HULA: Scalable Load Balancing Using Programmable Data Planes

被引：236

作者：

Katta, Naga ^{[1
]}

Hira, Mukesh ^{[2
]}

Kim, Changhoon ^{[3
]}

Sivaraman, Anirudh ^{[4
]}

Rexford, Jennifer ^{[1
]}

机构：

[1] Princeton Univ, Princeton, NJ 08544 USA

[2] VMware, Palo Alto, CA USA

[3] Barefoot Networks, Palo Alto, CA USA

[4] MIT CSAIL, Cambridge, MA USA

来源：

SYMPOSIUM ON SOFTWARE DEFINED NETWORKING (SDN) RESEARCH (SOSR'16) | 2016年

基金：

美国国家科学基金会;

关键词：

In-Network Load Balancing; Programmable Switches; Network Congestion; Scalability;

D O I：

10.1145/2890955.2890968

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Datacenter networks employ multi-rooted topologies (e.g., Leaf-Spine, Fat-Tree) to provide large bisection bandwidth. These topologies use a large degree of multipathing, and need a data-plane load-balancing mechanism to effectively utilize their bisection bandwidth. The canonical load-balancing mechanism is equal-cost multipath routing (ECMP), which spreads traffic uniformly across multiple paths. Motivated by ECMP's shortcomings, congestion-aware load-balancing techniques such as CONGA have been developed. These techniques have two limitations. First, because switch memory is limited, they can only maintain a small amount of congestion-tracking state at the edge switches, and do not scale to large topologies. Second, because they are implemented in custom hardware, they cannot be modified in the field. This paper presents HULA, a data-plane load-balancing algorithm that overcomes both limitations. First, instead of having the leaf switches track congestion on all paths to a destination, each HULA switch tracks congestion for the best path to a destination through a neighboring switch. Second, we design HULA for emerging programmable switches and program it in P4 to demonstrate that HULA could be run on such programmable chipsets, without requiring custom hardware. We evaluate HULA extensively in simulation, showing that it outperforms a scalable extension to CONGA in average flow completion time (1.6x at 50% load, 3x at 90% load).

引用

页数：12

共 47 条

[41] Performance of Scalable Off-The-Shelf Hardware for Data-intensive Parallel Processing using MapReduce
Fadzil, Ahmad Firdaus Ahmad
Khalid, Noor Elaiza Abdul
Manaf, Mazani
2012 7TH INTERNATIONAL CONFERENCE ON COMPUTING AND CONVERGENCE TECHNOLOGY (ICCCT2012), 2012, : 379 - 384
[42] STG-MTL: scalable task grouping for multi-task learning using data maps
Sherif, Ammar
Abid, Abubakar
Elattar, Mustafa
Elhelw, Mohamed
MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (02):
[43] Flexible Interconnection of Scalable Systems Integrated Using Optical Networks (FISSION) Data-Center-Concepts and Demonstration
Kushwaha, Aniruddha
Gumaste, Ashwin
Das, Tamal
Hote, Saurabh
Wen, Yonggang
JOURNAL OF OPTICAL COMMUNICATIONS AND NETWORKING, 2017, 9 (07) : 585 - 600
[44] Scalable High Speed Serial Interface for Data Converters Using The JESD204B Industry Standard
Saheb, Hakim
Haider, Syed
2014 9TH INTERNATIONAL DESIGN & TEST SYMPOSIUM (IDT), 2014, : 6 - 11
[45] Scalable distributed collaborative editing for 3D models using conflict-free data structure
Imae, Kengo
Hayashibara, Naohiro
INTERNATIONAL JOURNAL OF SPACE-BASED AND SITUATED COMPUTING, 2019, 9 (01) : 11 - 21
[46] Scalable Lossless Coding of Dynamic Medical CT Data Using Motion Compensated Wavelet Lifting with Denoised Prediction and Update
Lanz, Daniela
Schilling, Franz
Kaup, Andr
2019 PICTURE CODING SYMPOSIUM (PCS), 2019,
[47] Criso: An Incremental Scalable and Cost-Effective Data Center Interconnection by Using 2-port servers and low-end switches
Feng, Hao
Deng, Yuhui
Zhao, Yufan
2018 IEEE 26TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS), 2018, : 123 - 130

← 1 2 3 4 5 →