Optimizing Task Assignment in Hadoop Using an Efficient Job Size-Based Scheduler

被引:0
作者
Mirza, Masarrat [1 ]
Nagori, M. B. [2 ]
机构
[1] GECA, Dept Comp Sci & Engn, Aurangabad, Maharashtra, India
[2] GECA, CSE Dept, Aurangabad, Maharashtra, India
来源
2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS) | 2017年
关键词
hadoop; MapReduce; scheduling; different workloads; user priority;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hadoop is a distributed master-slave platform that comprises of two main components viz. Hadoop Distributed File System (HDFS) and MapReduce. HDFS provides distributed storage whereas MapReduce is useful for computational processing. A MapReduce cluster when receives multiple jobs simultaneously, the whole system performance might seriously deteriorate because of poor job response time. Thus, a real challenging issue in the MapReduce world is the efficient scheduling of jobs. Nevertheless, we see that traditional scheduling algorithms that work with Hadoop does not always assure significant average job-response times under distinct workloads. In order to address this problem, we put forward an efficient Hadoop scheduler that collects the information of workload patterns and distributes the jobs according to our hybrid scheduling technique. The experimental results exhibit that our scheduler enhances the average job-response time for MapReduce systems with different workload patterns.
引用
收藏
页码:1287 / 1292
页数:6
相关论文
共 17 条
[1]  
[Anonymous], NSDI
[2]  
[Anonymous], 2008, 8 USENIX S OP SYST D
[3]  
[Anonymous], 1983, Signals and Systems
[4]  
Chang H, 2011, IEEE INFOCOM SER, P3074, DOI 10.1109/INFCOM.2011.5935152
[5]  
Chen Y., 2010, UCBEECS2010135
[6]  
Dhok J., 2010, P ISEC
[7]  
Holmes Alex, HADOOP PRACTICE 2012
[8]  
Isard M, 2009, SOSP'09: PROCEEDINGS OF THE TWENTY-SECOND ACM SIGOPS SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES, P261
[9]  
Moseley B, 2011, SPAA 11: PROCEEDINGS OF THE TWENTY-THIRD ANNUAL SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, P289
[10]   Performance-Driven Task Co-Scheduling for MapReduce Environments [J].
Polo, Jorda ;
Carrera, David ;
Becerra, Yolanda ;
Torres, Jordi ;
Ayguade, Eduard ;
Steinder, Malgorzata ;
Whalley, Ian .
PROCEEDINGS OF THE 2010 IEEE-IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, 2010, :373-380