A Review of Scheduling Algorithms in Hadoop

被引:4
作者
Sharma, Anil [1 ]
Singh, Gurwinder [1 ]
机构
[1] Lovely Profess Univ, Sch Comp Applicat, Phagwara, Punjab, India
来源
PROCEEDINGS OF RECENT INNOVATIONS IN COMPUTING, ICRIC 2019 | 2020年 / 597卷
关键词
Big data; Hadoop; TaskTracker; JobTracker; Scheduling; MapReduce; BIG DATA; RESOURCE;
D O I
10.1007/978-3-030-29407-6_11
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this epoch of data surge, big data is one of the significant areas of research being widely pondered over by computer science research community, and Hadoop is the broadly used tool to store and process it. Hadoop is fabricated to work effectively for the clusters having homogeneous environment but when the cluster environment is heterogeneous then its performance decreases which result in various challenges surfacing in the areas like query execution time, data movement cost, selection of best Cluster and Racks for data placement, preserving privacy, load distribution: imbalance in input splits, computations, partition sizes and heterogeneous hardware, and scheduling. The epicenter of Hadoop is scheduling and all incoming jobs are multiplexed on existing resources by the schedulers. Enhancing the performance of schedulers in Hadoop is very vigorous. Keeping this idea in mind as inspiration, this paper introduces the concept of big data, market share of popular vendors for big data, various tools in Hadoop ecosystem and emphasizing to study various scheduling algorithms for MapReduce model in Hadoop and make a comparison based on varied parameters.
引用
收藏
页码:125 / 135
页数:11
相关论文
共 41 条
[1]  
Al-Janabi S. T. F., 2011, Proceedings of the 2011 4th International Conference on Developments in e-systems Engineering (DeSE 2011), P209, DOI 10.1109/DeSE.2011.16
[2]  
[Anonymous], 2015, Hadoop-The Definitive Guide: Storage and Analysis at Internet Scale
[3]  
[Anonymous], 1997, ACM SIGGRAPH
[4]  
[Anonymous], 2011, BIG DATA NEXT FRONTI
[5]  
[Anonymous], 2013, Harness the Power of Big Data: The IBM Big Data Platform
[6]  
Berman JJ, 2013, PRINCIPLES OF BIG DATA: PREPARING, SHARING, AND ANALYZING COMPLEX INFORMATION, P1
[7]  
Bhosale H.S., 2014, Int J Sci Res, V4, P1
[8]   Big Data: A Survey [J].
Chen, Min ;
Mao, Shiwen ;
Liu, Yunhao .
MOBILE NETWORKS & APPLICATIONS, 2014, 19 (02) :171-209
[9]   Resource and Deadline-aware Job Scheduling in Dynamic Hadoop Clusters [J].
Cheng, Dazhao ;
Rao, Jia ;
Jiang, Changjun ;
Zhou, Xiaobo .
2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2015, :956-965
[10]  
Dai X., 2016, IEEE ICC 2016 SAC CL, P3627