Locality Premised Reducer Scheduling in Hadoop

被引:0
作者
Fatma, Nusrat [1 ]
Singh, Remant Kr. [2 ]
Ahmad, Shafeeq [2 ]
Srivastava, Prachi [1 ]
机构
[1] Azad IET, Dept Comp Sci & Engn, Lucknow, Uttar Pradesh, India
[2] Azad IET, Dept Comp Applicat, Lucknow, Uttar Pradesh, India
来源
PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON SYSTEM MODELING & ADVANCEMENT IN RESEARCH TRENDS (SMART-2016) | 2016年
关键词
Hadoop; Map Reduce; Data Locality; Reduce Task Scheduling; Job Tracker; Task Tracker;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data is generated in high speed and volume called Big Data. Hadoop has ability to handle Big Data. Programs get executed in parallel manner in Hadoop. In Hadoop the problem gets divide in two parts namely, map task and reduce task. In the existing Hadoop version map task scheduling premise is the locality of input data lowers the network traffic and hence improve performance of the mappers. But reduce task get scheduled without any consideration of data locality, resulting to poor performance at requesting node. This paper propose a modified reduce task scheduling algorithm on the basis of data locality that will minimize data-local traffic. In the evaluation of algorithm it is observed that up to 80 % of bytes shuffling has reduced in Hadoop clusters.
引用
收藏
页码:222 / 224
页数:3
相关论文
共 7 条
[1]  
[Anonymous], 2009, Hadoop: The Definitive Guide
[2]  
[Anonymous], 2008, 8 USENIX S OP SYST D
[3]  
Chen PC, 2010, LECT NOTES COMPUT SC, V6104, P234, DOI 10.1007/978-3-642-13067-0_27
[4]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[5]  
Hammoud M., 2011, Proceedings of the 2011 IEEE 3rd International Conference on Cloud Computing Technology and Science (CloudCom 2011), P570, DOI 10.1109/CloudCom.2011.87
[6]  
Seo Sangwon., 2009, P 11 IEEE INT C CLUS, P16
[7]  
Zaharia M, 2010, EUROSYS'10: PROCEEDINGS OF THE EUROSYS 2010 CONFERENCE, P265