Resource Aware Scheduling in Hadoop for Heterogeneous Workloads based on Load Estimation

被引:0
作者
Kapil, Sutariya B. [1 ]
Kamath, Sowmya S. [1 ]
机构
[1] Natl Inst Technol, Dept Informat Technol, Surathkal, Karnataka, India
来源
2013 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND NETWORKING TECHNOLOGIES (ICCCNT) | 2013年
关键词
Hadoop; heterogeneity; Job Scheduling; Resource aware; Load balancing; MAPREDUCE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Currently, most cloud based applications require large scale data processing capability. Data to be processed is growing at a rate much faster than available computing power. Hadoop is used to enable distributed processing on large clusters of commodity hardware. In large clusters, the workloads may be heterogeneous in nature, that is, I/O bound, CPU bound or network intensive jobs that demand different types of resources requirement so as to run simultaneously on large cluster. Hadoops job scheduling is based on FIFO where, parallelization based on types of job has not been taken into account for scheduling. In this paper, we propose a new scheduling algorithm for Hadoop based distributed system, based on the classification of workloads to assign a specific category to a particular cluster according to current load of the cluster. The proposed scheduler increases the performance of both CPU and I/O resources in a cluster under heterogeneous workloads, by approximately 12% when compared to Hadoops FIFO scheduler.
引用
收藏
页数:5
相关论文
共 13 条
[1]  
[Anonymous], 2008, 8 USENIX S OP SYST D
[2]  
[Anonymous], 2003, P 19 ACM S OP SYST P, DOI [10.1145/1165389.945450, DOI 10.1145/1165389.945450]
[3]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[4]  
Dhok Jaideep, 2010, USING PATTERN CLASSI
[5]   GANG SCHEDULING PERFORMANCE BENEFITS FOR FINE-GRAIN SYNCHRONIZATION [J].
FEITELSON, DG ;
RUDOLPH, L .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1992, 16 (04) :306-318
[6]  
Kc K., 2010, Proceedings of the 2010 IEEE 2nd International Conference on Cloud Computing Technology and Science (CloudCom 2010), P388, DOI 10.1109/CloudCom.2010.97
[7]   Google's MapReduce programming model -: Revisited [J].
Laemmel, Ralf .
SCIENCE OF COMPUTER PROGRAMMING, 2008, 70 (01) :1-30
[8]  
Ousterhout J. K., 1982, Proceedings of the 3rd International Conference on Distributed Computing Systems, P22
[9]  
Page A. J., 2005, PAR DISTR PROC S 200, p189a
[10]  
Sandholm T, 2010, LECT NOTES COMPUT SC, V6253, P110, DOI 10.1007/978-3-642-16505-4_7