Performance evaluation of job schedulers on Hadoop YARN

被引:13
作者
Lin, Jia-Chun [1 ]
Lee, Ming-Chang [1 ]
机构
[1] Univ Oslo, Dept Informat, Oslo, Norway
关键词
Hadoop; YARN; capacity scheduler; fair scheduler; queue structure; performance evaluation; MAPREDUCE;
D O I
10.1002/cpe.3736
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
To solve the limitation of Hadoop on scalability, resource sharing, and application support, the open-source community proposes the next generation of Hadoop's compute platform called Yet Another Resource Negotiator (YARN) by separating resource management functions from the programming model. This separation enables various application types to run on YARN in parallel. To achieve fair resource sharing and high resource utilization, YARN provides the capacity scheduler and the fair scheduler. However, the performance impacts of the two schedulers are not clear when mixed applications run on a YARN cluster. Therefore, in this paper, we study four scheduling-policy combinations (SPCs for short) derived from the two schedulers and then evaluate the four SPCs in extensive scenarios, which consider not only four application types, but also three different queue structures for organizing applications. The experimental results enable YARN managers to comprehend the influences of different SPCs and different queue structures on mixed applications. The results also help them to select a proper SPC and an appropriate queue structure to achieve better application execution performance. Copyright (c) 2016 John Wiley & Sons, Ltd.
引用
收藏
页码:2711 / 2728
页数:18
相关论文
共 18 条
[1]  
[Anonymous], NSDI
[2]   REEF: Retainable Evaluator Execution Framework [J].
Chun, Byung-Gon ;
Douglas, Chris ;
Narayanamurthy, Shravan ;
Rosen, Josh ;
Condie, Tyson ;
Matusevych, Sergiy ;
Ramakrishnan, Raghu ;
Sears, Russell ;
Curino, Carlo ;
Myers, Brandon ;
Rao, Sriram ;
Weimer, Markus .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (12) :1370-1373
[3]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[4]   Memory or Time: Performance Evaluation for Iterative Operation on Hadoop and Spark [J].
Gu, Lei ;
Li, Huan .
2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, :721-727
[5]  
Gupta Shekhar., 2013, Proceedings of the 10th International Conference on Autonomic Computing (ICAC 13), P159
[6]  
Kulkarni A.P., 2014, International Journal of Emerging Technology and Advanced Engineering, V4, P82
[7]  
Kumar KA, 2012, PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI'12), P52
[8]   Hybrid Job-Driven Scheduling for Virtual MapReduce Clusters [J].
Lee, Ming-Chang ;
Lin, Jia-Chun ;
Yahyapour, Ramin .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (06) :1687-1699
[9]   Impact of MapReduce Policies on Job Completion Reliability and Job Energy Consumption [J].
Lin, Jia-Chun ;
Leu, Fang-Yie ;
Chen, Ying-ping .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (05) :1364-1378
[10]  
Pakize Seyed Reza, 2014, Int. J. Comput. Netw. Commun. Secur., V2, P308