Optimization of Task Assignment Strategy for Map-Reduce

被引:0
作者
Jin, Songchang [1 ]
Yang, Shuqiang [1 ]
Jia, Yan [1 ]
机构
[1] Natl Univ Def Technol, Sch Comp, Changsha, Hunan, Peoples R China
来源
PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012) | 2012年
关键词
Map-Reduce; task assignment; FIFO; Hadoop; replica selection; MAPREDUCE;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the coming of this big data age, parallel processing is essential to processing a massive volume of data in a timely manner. Map-Reduce, which has been popularized, is a scalable and fault-tolerant data processing framework. It enables to process a massive volume of data in parallel way with many low-end computing nodes. As an important part of the framework, map task assignment has a significant impact on the performance of Map-Reduce. But in the allocation of the input files for map tasks, Map-Reduce framework does not take into account the distribution of the input data blocks in the file system and the load of the computing nodes themselves, which leading to increase the amount of network data transfer and system load when running map tasks. Especially when the framework uses the FIFO job scheduling strategy to deal with a large number of small jobs, the performance of the framework will be very low. In this paper, we design and implement a new task assignment strategy to increase the performance and efficiency of the Map-Reduce framework.
引用
收藏
页码:57 / 61
页数:5
相关论文
共 18 条
[1]  
Anand A., 2008, Scaling Hadoop to 4000 nodes at Yahoo!
[2]  
Anderson E., 2010, ACM SIGOPS OPERATING, V44, P40
[3]  
[Anonymous], 2011, BIG DATA NEXT FRONTI
[4]  
[Anonymous], 2004, OSDI 04
[5]  
Apache, 2011, HDFS ARCH GUID
[6]  
Apache Hadoop, 2009, CAP SCHED
[7]  
Apache Hadoop, 2010, FAIR SCHED
[8]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[9]   Mapreduce: Simplified data processing on large clusters [J].
Dean, Jeffrey ;
Ghemawat, Sanjay .
COMMUNICATIONS OF THE ACM, 2008, 51 (01) :107-113
[10]  
DeWitt D, 2008, MAPREDUCE MAJOR STEP, V1