Multi-Resource Packing for Cluster Schedulers

被引:306
作者
Grandl, Robert [1 ,2 ]
Ananthanarayanan, Ganesh [1 ,3 ]
Kandula, Srikanth [1 ]
Rao, Sriram [1 ]
Akella, Aditya [1 ,2 ]
机构
[1] Microsoft, Redmond, WA 98052 USA
[2] Univ Wisconsin, Madison, WI 53706 USA
[3] Univ Calif Berkeley, Berkeley, CA 94720 USA
关键词
Cluster schedulers; multi-dimensional packing; makespan; completion time; fairness;
D O I
10.1145/2740070.2626334
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Tasks in modern data-parallel clusters have highly diverse resource requirements along CPU, memory, disk and network. We present Tetris, a multi-resource cluster scheduler that packs tasks to machines based on their requirements of all resource types. Doing so avoids resource fragmentation as well as over-allocation of the resources that are not explicitly allocated, both of which are drawbacks of current schedulers. Tetris adapts heuristics for the multidimensional bin packing problem to the context of cluster schedulers wherein task arrivals and machine availability change in an online manner and wherein task's resource needs change with time and with the machine that the task is placed at. In addition, Tetris improves average job completion time by preferentially serving jobs that have less remaining work. We observe that fair allocations do not offer the best performance and the above heuristics are compatible with a large class of fairness policies; hence, we show how to simultaneously achieve good performance and fairness. Trace-driven simulations and deployment of our Apache YARN prototype on a 250 node cluster show gains of over 30% in makespan and job completion time while achieving nearly perfect fairness.
引用
收藏
页码:455 / 466
页数:12
相关论文
共 24 条
[1]  
Agarwal Sameer., 2012, NSDI
[2]  
Al-Fares M., 2008, SIGCOMM
[3]  
[Anonymous], 2001, Approximation algorithms
[4]  
[Anonymous], 2009, SIGCOMM
[5]  
[Anonymous], P VLDB ENDOW
[6]  
Azar Yossi., 2013, STOC
[7]  
Chowdhury M., 2013, SIGCOMM
[8]  
Chowdhury Mosharaf., 2011, SIGCOMM
[9]  
Gulwani Sumit., 2009, POPL
[10]  
Guo C., 2009, SIGCOMM