TEL: Low-Latency Failover Traffic Engineering in Data Plane

被引:10
作者
Mostafaei, Habib [1 ]
Shojafar, Mohammad [2 ]
Conti, Mauro [3 ]
机构
[1] Tech Univ Berlin, Dept Telecommun Syst, D-10587 Berlin, Germany
[2] Univ Surrey, 5G & 6G Innovat Ctr, Inst Commun Syst, Guildford GU2 7XH, Surrey, England
[3] Univ Padua, Dept Math, I-35121 Padua, Italy
来源
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT | 2021年 / 18卷 / 04期
关键词
Traffic engineering; network monitoring; programmable data plane; low-latency; link failure; reinforcement algorithm; ALGORITHMS;
D O I
10.1109/TNSM.2021.3099620
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Modern network applications demand low-latency traffic engineering in the presence of network failure, while preserving the quality of service constraints like delay and capacity. Fast Re-Route (FRR) mechanisms are widely used for traffic re-routing purposes in failure scenarios. Control plane FRR typically computes the backup forwarding rules to detour the traffic in the data plane when the failure occurs. This mechanism could be computed in the data plane with the emergence of programmable data planes. In this paper, we propose a system (called TEL) that contains two FRR mechanisms, namely, TEL-C and TEL-D. The first one computes backup forwarding rules in the control plane, satisfying max-min fair allocation. The second mechanism provides FRR in the data plane. Both algorithms require minimal memory on programmable data planes and are well-suited with modern line rate match-action forwarding architectures (e.g., PISA). We implement both mechanisms on P4 programmable software switches (e.g., BMv2 and Tofino) and measure their performance on various topologies. The obtained results from a datacenter topology show that our FRR mechanism can improve the flow completion time up to 4.6x-7.3x (i.e., small flows) and 3.1x-12x (i.e., large flows) compared to recirculation-based mechanisms, such as F10, respectively.
引用
收藏
页码:4697 / 4710
页数:14
相关论文
共 57 条
  • [1] Data Center TCP (DCTCP)
    Alizadeh, Mohammad
    Greenberg, Albert
    Maltz, David A.
    Padhye, Jitendra
    Patel, Parveen
    Prabhakar, Balaji
    Sengupta, Sudipta
    Sridharan, Murari
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2010, 40 (04) : 63 - 74
  • [2] [Anonymous], 2021, DATA TRANSFERRED OUT
  • [3] [Anonymous], 2020, APACHE FLINK
  • [4] [Anonymous], 2020, TEL SOURCE CODE
  • [5] [Anonymous], 2020, 8021D2004 IEEE
  • [6] [Anonymous], 2000, FORW ERR CORR SUBM S
  • [7] A DAG-Based Forwarding Paradigm for Large Scale Software Defined Networks
    Avallone, Stefano
    Ashraf, Usman
    [J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2020, 17 (01): : 577 - 591
  • [8] Bai W, 2016, 13TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION (NSDI '16), P537
  • [9] Barth, 2017, REP
  • [10] PINT: Probabilistic In-band Network Telemetry
    Ben Basat, Ran
    Ramanathan, Sivaramakrishnan
    Li, Yuliang
    Antichi, Gianni
    Yu, Minlan
    Mitzenmacher, Michael
    [J]. SIGCOMM '20: PROCEEDINGS OF THE 2020 ANNUAL CONFERENCE OF THE ACM SPECIAL INTEREST GROUP ON DATA COMMUNICATION ON THE APPLICATIONS, TECHNOLOGIES, ARCHITECTURES, AND PROTOCOLS FOR COMPUTER COMMUNICATION, 2020, : 662 - 680