Cloud-Based Mapreduce Workflow Execution Platform

被引:3
作者
Jung, In-Yong [1 ]
Han, Byong-John [1 ]
Jeong, Chang-Sung [1 ]
Rho, Seungmin [2 ]
机构
[1] Korea Univ, Dept Elect Engn, Seoul, South Korea
[2] Sungkyul Univ, Dept Multimedia, Anyang Si, South Korea
来源
JOURNAL OF INTERNET TECHNOLOGY | 2014年 / 15卷 / 06期
基金
新加坡国家研究基金会;
关键词
Cloud computing; PaaS; Mapreduce workflow; Job scheduling; MANAGEMENT; SYSTEM; TASK;
D O I
10.6138/JIT.2014.15.6.17
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With increasing demand of data-intensive applications, mapreduce technologies have become useful tools to develop large scale applications efficiently by integrating various existing mapreduce jobs. However, there are few existing researches of workflow systems which can integrates mapreduce jobs with on-demand cloud resource provisioning. In this paper, we present a new cloud-based mapreduce workflow execution platform named DIVE-CWM (Distributed-parallel Virtual Environment on Cloud computing for Workflow for launching Mapreduce jobs) which integrates multiple mapreduce jobs and legacy programs into a single workflow. It provides a transparent and selective job scheduling by estimating the execution time in advance for workflow to execute all its jobs. Also, it supports automatic resource provisioning scheme which offers on-demand VM resources automatically to launch a workflow onto cloud. Furthermore, it provides an agent based resource management for automatic job deployment and execution of workflow on mapreduce clusters. Additionally, service oriented architecture based on web service API and graphical user interface offers high accessibility and convenience to user and other systems. We show the experimental results which compares the different scheduling schemes for various workflows.
引用
收藏
页码:1059 / 1067
页数:9
相关论文
共 22 条
[1]  
[Anonymous], P 2 INT WORKSH SOFTW
[2]  
[Anonymous], INT J MULTIMEDIA UBI
[3]  
Backman N., 2012, Proceedings of third international workshop on MapReduce and its Applications Date, P1
[4]  
Bittencourt L. F., 2010, NOMS 2010 - 2010 IEEE/IFIP Network Operations and Management Symposium Workshops, P343, DOI 10.1109/NOMSW.2010.5486553
[5]   Cost optimized provisioning of elastic resources for application workflows [J].
Byun, Eun-Kyu ;
Kee, Yang-Suk ;
Kim, Jin-Soo ;
Maeng, Seungryoul .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2011, 27 (08) :1011-1026
[6]   BTS: Resource capacity estimate for time-targeted science workflows [J].
Byun, Eun-Kyu ;
Kee, Yang-Suk ;
Kim, Jin-Soo ;
Deelman, Ewa ;
Maeng, Seungryoul .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2011, 71 (06) :848-862
[7]  
Çelik D, 2011, J INTERNET TECHNOL, V12, P153
[8]  
Kee Y.S., 2008, P 3 WORKSH WORKFL SU, P1, DOI DOI 10.1109/IPDPS.2008.4536167
[9]   Grid Management System and Information System for Semantic Grid Middleware [J].
Kim, Hyeong-Rae ;
Han, Byong-John ;
Jeong, In-Yong ;
Jeong, Chang-Sung .
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2010, 4 (06) :1080-1097
[10]   A Hadoop-based Multimedia Transcoding System for Processing Social Media in the PaaS Platform of SMCCSE [J].
Kim, Myoungjin ;
Han, Seungho ;
Cui, Yun ;
Lee, Hanku ;
Jeong, Changsung .
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2012, 6 (11) :2827-2848