Phurti: Application and Network-Aware Flow Scheduling for Multi-Tenant MapReduce Clusters

被引:10
作者
Cai, Chris X. [1 ]
Saeed, Shayan [1 ]
Gupta, Indranil [1 ]
Campbell, Roy H. [1 ]
Le, Franck [2 ]
机构
[1] Univ Illinois, Dept Comp Sci, Champaign, IL 61801 USA
[2] IBM Corp, Res TJ Watson, Armonk, NY USA
来源
PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E) | 2016年
关键词
D O I
10.1109/IC2E.2016.21
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Traffic for a typical MapReduce job in a data center consists of multiple network flows. Traditionally, network resources have been allocated to optimize network-level metrics such as flow completion time or throughput. Some recent schemes propose using application-aware scheduling which can shorten the average job completion time. However, most of them treat the core network as a black box with sufficient capacity. Even if only one network link in the core network becomes a bottleneck, it can hurt application performance. We design and implement a centralized flow-scheduling framework called Phurti with the goal of improving the completion time for jobs in a cluster shared among multiple Hadoop jobs (multi-tenant). Phurti communicates both with the Hadoop framework to retrieve job-level network traffic information and the OpenFlow-based switches to learn about the network topology. Phurti implements a novel heuristic called Smallest Maximum Sequential-traffic First (SMSF) that uses collected application and network information to perform traffic scheduling for MapReduce jobs. Our evaluation with real Hadoop workloads shows that compared to application and network-agnostic scheduling strategies, Phurti improves job completion time for 95% of the jobs, decreases average job completion time by 20%, tail job completion time by 13% and scales well with the cluster size and number of jobs.
引用
收藏
页码:161 / 170
页数:10
相关论文
共 25 条
[1]  
Agarwal Sameer., 2012, P 9 USENIX C NETWORK, P21
[2]  
AL-FARES M., 2010, USENIX NSDI, V10
[3]   A scalable, commodity data center network architecture [J].
Al-Fares, Mohammad ;
Loukissas, Alexander ;
Vahdat, Amin .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2008, 38 (04) :63-74
[4]   pFabric: Minimal Near-Optimal Datacenter Transport [J].
Alizadeh, Mohammad ;
Yang, Shuang ;
Sharif, Milad ;
Katti, Sachin ;
McKeown, Nick ;
Prabhakar, Balaji ;
Shenker, Scott .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2013, 43 (04) :435-446
[5]   Data Center TCP (DCTCP) [J].
Alizadeh, Mohammad ;
Greenberg, Albert ;
Maltz, David A. ;
Padhye, Jitendra ;
Patel, Parveen ;
Prabhakar, Balaji ;
Sengupta, Sudipta ;
Sridharan, Murari .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2010, 40 (04) :63-74
[6]  
ALKAFF H., 2015, CLOUD ENG IC2E
[7]  
Ananthanarayanan Ganesh., 2012, NSDI, P267
[8]  
Chen Yanpei., 2011, Modeling, Analysis Simulation of Computer and Telecommunication Systems (MAS- COTS), 2011 IEEE 19th International Symposium on, P390, DOI DOI 10.1109/MASCOTS.2011.12
[9]  
CHO B., 2013, IEEE SOCC, P6
[10]  
Chowdhury M., 2012, ACM HOTNETS, P31