Scheduling Data Intensive Workloads through Virtualization on MapReduce based Clouds

被引:0
作者
Rao, B. Thirumala [1 ]
Reddy, L. S. S. [2 ]
机构
[1] Lakireddy Bali Reddy Coll Engn, Dept Comp Sci & Engn, Mylavaram, India
[2] Lakireddy Bali Reddy Coll Engn, Mylavaram, India
来源
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY | 2013年 / 13卷 / 06期
关键词
Cloud Computing; Data Locality; MapReduce; Virtualization; Hadoop;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
MapReduce has become a popular programming model for running data intensive applications on cloud. Completion time goals or deadlines of MapReduce jobs set by users are becoming crucial in existing cloud-based data processing environments like Hadoop. There is a conflict between the scheduling MR jobs to meet deadlines and "data locality" (assigning tasks to nodes that contain their input data). To meet the deadline a task may be scheduled on a node without local input data for that task causing expensive data transfer from a remote node. In this paper, a novel scheduler is proposed to address the above problem which is primarily based on dynamic resource reconfiguration approach. It has two components: 1) Resource Predictor: which dynamically determines the required number of Map/Reduce slots for every job to meet completion time guarantee; 2) Resource Reconfigurator: that adjusts the CPU resources while not violating completion time goals of the users by dynamically increasing or decreasing individual VMs to maximize data locality and also to maximize the use of resources within the system among the active jobs. The proposed scheduler has been evaluated against Fair Scheduler on virtual cluster built on a physical cluster of 9 machines. The results demonstrate a gain of about 12% increase in through put of Jobs.
引用
收藏
页码:105 / 112
页数:8
相关论文
共 15 条
[1]  
[Anonymous], [No title captured]
[2]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[3]  
Isard M., 2007, ACM SIGOPS OS REV, V41, P72
[4]  
Isard M., 2009, P 22 ACM S SOP SYST
[5]  
Kang H, 2011, P 20 INT S HIGH PERF
[6]  
Kc K., 2010, Proceedings of the 2010 IEEE 2nd International Conference on Cloud Computing Technology and Science (CloudCom 2010), P388, DOI 10.1109/CloudCom.2010.97
[7]  
Palanisamy B, 2011, P 2011 INT C HIGH PE
[8]  
Phan L., 2010, MSCIS1032 UPENN
[9]  
Polo J., 2010, 12 IEEE IFIP NETW OP
[10]  
Rao B, 2011, INT J COMPUT APPL, V34, P29