Efficient Coflow Scheduling Without Prior Knowledge

被引:50
作者
Chowdhury, Mosharaf [1 ]
Stoica, Ion [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
来源
SIGCOMM'15: PROCEEDINGS OF THE 2015 ACM CONFERENCE ON SPECIAL INTEREST GROUP ON DATA COMMUNICATION | 2015年
关键词
Coflow; data-intensive applications; datacenter networks;
D O I
10.1145/2785956.2787480
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Inter-coflow scheduling improves application-level communication performance in data-parallel clusters. However, existing efficient schedulers require a priori coflow information and ignore cluster dynamics like pipelining, task failures, and speculative executions, which limit their applicability. Schedulers without prior knowledge compromise on performance to avoid head-of-line blocking. In this paper, we present Aalo that strikes a balance and efficiently schedules coflows without prior knowledge. Aalo employs Discretized Coflow-Aware Least-Attained Service (D-CLAS) to separate coflows into a small number of priority queues based on how much they have already sent across the cluster. By performing prioritization across queues and by scheduling coflows in the FIFO order within each queue, Aalo's non-clairvoyant scheduler reduces coflow completion times while guaranteeing starvation freedom. EC2 deployments and trace-driven simulations show that communication stages complete 1.93 x faster on average and 3.59x faster at the 95th percentile using Aalo in comparison to per-flow mechanisms. Aalo's performance is comparable to that of solutions using prior knowledge, and Aalo outperforms them in presence of cluster dynamics
引用
收藏
页码:393 / 406
页数:14
相关论文
共 45 条
[1]  
Al-Fares M., 2010, Nsdi, V10, P89
[2]  
Alizadeh M., 2013, SIGCOMM
[3]  
Alizadeh Mohammad., 2014, SIGCOMM
[4]  
Ananthanarayanan G, 2010, 9 USENIX S OP SYST D, P24
[5]  
Ananthanarayanan G., 2012, NSDI
[6]  
[Anonymous], SOSP
[7]  
[Anonymous], 2010, HPCA
[8]  
[Anonymous], 2008, 8 USENIX S OP SYST D
[9]  
Arpaci-Dusseau R.H., 2018, OPERATING SYSTEMS 3
[10]  
Bai Wei, 2015, 2015 USENIX S NETWOR, P455