Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities

被引:285
作者
Montazeri, Behnam [1 ]
Li, Yilong [1 ]
Alizadeh, Mohammad [2 ]
Ousterhout, John [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] MIT, Cambridge, MA 02139 USA
来源
PROCEEDINGS OF THE 2018 CONFERENCE OF THE ACM SPECIAL INTEREST GROUP ON DATA COMMUNICATION (SIGCOMM '18) | 2018年
关键词
Data centers; low latency; network stacks; transport protocols;
D O I
10.1145/3230543.3230564
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Homa is a new transport protocol for datacenter networks. It provides exceptionally low latency, especially for workloads with a high volume of very short messages, and it also supports large messages and high network utilization. Homa uses in-network priority queues to ensure low latency for short messages; priority allocation is managed dynamically by each receiver and integrated with a receiver-driven flow control mechanism. Homa also uses controlled overcommitment of receiver downlinks to ensure efficient bandwidth utilization at high load. Our implementation of Homa delivers 99th percentile round-trip times less than 15 mu s for short messages on a 10 Gbps network running at 80% load. These latencies are almost 100x lower than the best published measurements of an implementation. In simulations, Homa's latency is roughly equal to pFabric and significantly better than pHost, PIAS, and NDP for almost all message sizes and workloads. Homa can also sustain higher network loads than pFabric, pHost, or PIAS.
引用
收藏
页码:221 / 235
页数:15
相关论文
共 31 条
[1]   CONGA: Distributed Congestion-Aware Load Balancing for Datacenters [J].
Alizadeh, Mohammad ;
Edsall, Tom ;
Dharmapurikar, Sarang ;
Vaidyanathan, Ramanan ;
Chu, Kevin ;
Fingerhut, Andy ;
Vinh The Lam ;
Matus, Francis ;
Pan, Rong ;
Yadav, Navindra ;
Varghese, George .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2014, 44 (04) :503-514
[2]   pFabric: Minimal Near-Optimal Datacenter Transport [J].
Alizadeh, Mohammad ;
Yang, Shuang ;
Sharif, Milad ;
Katti, Sachin ;
McKeown, Nick ;
Prabhakar, Balaji ;
Shenker, Scott .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2013, 43 (04) :435-446
[3]  
Alizadeh M, 2012, IEEE CONF OPEN SYST, P196
[4]  
[Anonymous], 2010, SIGCOMM Comput. Commun. Rev., DOI DOI 10.1145/1851275.1851192
[5]  
[Anonymous], CORR
[6]  
[Anonymous], 2013, 10 USENIX S NETW SYS
[7]  
[Anonymous], [No title captured]
[8]  
[Anonymous], 2014, P 11 USENIX C NETW S
[9]  
[Anonymous], 2015, P 2015 ACM C EM NETW
[10]  
[Anonymous], 2015, P 12 USENIX S NETWOR