One More Config is Enough: Saving (DC)TCP for High-Speed Extremely Shallow-Buffered Datacenters

被引:31
作者
Bai, Wei [1 ,2 ]
Hu, Shuihai [1 ,3 ]
Chen, Kai [4 ]
Tan, Kun [5 ]
Xiong, Yongqiang [6 ]
机构
[1] Hong Kong Univ Sci & Technol, iSING Lab, Hong Kong, Peoples R China
[2] Microsoft Res Lab, Redmond, WA 98052 USA
[3] Clustar Technol, Shenzhen 518052, Peoples R China
[4] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, iSING Lab, Hong Kong, Peoples R China
[5] Huawei Technol, Shenzhen 518129, Peoples R China
[6] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
Datacenter networks; congestion control; ECN; TAIL;
D O I
10.1109/TNET.2020.3032999
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The link speed in production datacenters is growing fast, from 1 Gbps to 40 Gbps or even 100 Gbps. However, the buffer size of commodity switches increases slowly, e.g., from 4 MB at 1 Gbps to 16 MB at 100 Gbps, thus significantly outpaced by the link speed. In such extremely shallow-buffered networks, today's TCP/ECN solutions, such as DCTCP, suffer from either excessive packet losses or significant throughput degradation. Motivated by this, we introduce BCC,(1) a simple yet effective solution that requires only one more ECN configuration (i.e., shared buffer ECN/RED) at commodity switches. BCC operates upon real-time global shared buffer utilization. When available buffer space suffices, BCC delivers both high throughput and low packet loss rate as prior work; When it gets insufficient, BCC automatically triggers the shared buffer ECN to prevent packet loss at the cost of sacrificing a small amount of throughput. BCC is readily deployable with existing commodity switches. We validate BCC's efficacy in a 100G testbed and evaluate its performance using extensive simulations. Our results show that BCC maintains low packet loss rate persistently while only slightly degrading throughput when the buffer becomes insufficient. For example, compared to current practice, BCC achieves up to 94.4% lower 99th percentile flow completion time (FCT) for small flows while only degrading average FCT for large flows by up to 3%.(1) BCC: Buffer-aware Active Queue Management (AQM) scheme for Congestion Control in extremely shallow-buffered datacenters.
引用
收藏
页码:489 / 502
页数:14
相关论文
共 54 条
  • [1] Alizadeh M, 2012, P 9 USENIX C NETW SY
  • [2] Alizadeh M., 2011, P ACM SIGMETRICS JOI, V39, P73, DOI DOI 10.1145/2007116.2007125
  • [3] CONGA: Distributed Congestion-Aware Load Balancing for Datacenters
    Alizadeh, Mohammad
    Edsall, Tom
    Dharmapurikar, Sarang
    Vaidyanathan, Ramanan
    Chu, Kevin
    Fingerhut, Andy
    Lam, Vinh The
    Matus, Francis
    Pan, Rong
    Yadav, Navindra
    Varghese, George
    [J]. SIGCOMM'14: PROCEEDINGS OF THE 2014 ACM CONFERENCE ON SPECIAL INTEREST GROUP ON DATA COMMUNICATION, 2014, : 503 - 514
  • [4] pFabric: Minimal Near-Optimal Datacenter Transport
    Alizadeh, Mohammad
    Yang, Shuang
    Sharif, Milad
    Katti, Sachin
    McKeown, Nick
    Prabhakar, Balaji
    Shenker, Scott
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2013, 43 (04) : 435 - 446
  • [5] Data Center TCP (DCTCP)
    Alizadeh, Mohammad
    Greenberg, Albert
    Maltz, David A.
    Padhye, Jitendra
    Patel, Parveen
    Prabhakar, Balaji
    Sengupta, Sudipta
    Sridharan, Murari
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2010, 40 (04) : 63 - 74
  • [6] [Anonymous], 2014, 11 USENIX S NETWORKE
  • [7] [Anonymous], 2015, ACM CONEXT
  • [8] Sizing router buffers
    Appenzeller, G
    Keslassy, I
    McKeown, N
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2004, 34 (04) : 281 - 292
  • [9] Bai W, 2020, IEEE INFOCOM SER, P2007, DOI [10.1109/INFOCOM41043.2020.9155280, 10.1109/infocom41043.2020.9155280]
  • [10] Enabling ECN over Generic Packet Scheduling
    Bai, Wei
    Chen, Kai
    Chen, Li
    Kim, Changhoon
    Wu, Haitao
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON EMERGING NETWORKING EXPERIMENTS AND TECHNOLOGIES (CONEXT'16), 2016, : 191 - 204