Scheduling Big Data Workflows in the Cloud under Deadline Constraints

被引:8
作者
Ebrahimi, Mahdi [1 ]
Mohan, Aravind [2 ]
Lu, Shiyong [3 ]
机构
[1] Lawrence Technol Univ, Dept Math & Comp Sci, Southfield, MI 48075 USA
[2] Allegheny Coll, Dept Comp Sci, Meadville, PA 16335 USA
[3] Wayne State Univ, Dept Comp Sci, Detroit, MI 48202 USA
来源
2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (IEEE BIGDATASERVICE 2018) | 2018年
基金
美国国家科学基金会;
关键词
big data workflows; big data; scheduling; BORRIS;
D O I
10.1109/BigDataService.2018.00014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the advent of cloud computing, an unbound number of compute resources can be leased from the cloud providers. In such an environment, the number of assigned resources to a workflow can be elastically scaled in and out on a demand basis using the added Quality of Service (QoS) constraints such as the budget and the deadline. The heterogeneous nature of the cloud resources makes the decision of selecting resource type for each workflow a challenging problem. Although there are several existing research studies that propose both static and dynamic scheduling algorithms for both homogeneous and heterogeneous cloud resource types, they do not take advantage of the data dependency information that is part of the workflow structure during the scheduling process. There is still room for improvement, since the scheduling problem is an NP-hard problem. In this paper we propose a new Big data wOrkflow scheduleR undeR deadlIne conStraint (BORRIS) that is used to minimize the execution cost of the workflow under a provided deadline constraint in a heterogeneous cloud computing environment. We have implemented the proposed algorithm in our big data workflow system called DATAVIEW and the experimental results show the competitive advantage of our approach.
引用
收藏
页码:33 / 40
页数:8
相关论文
共 17 条
[1]  
[Anonymous], 2012, PROC IEEE INT C HIGH
[2]  
[Anonymous], 2010, P 11 IEEE ACM INT C
[3]   List Scheduling Algorithm for Heterogeneous Systems by an Optimistic Cost Table [J].
Arabnejad, Hamid ;
Barbosa, Jorge G. .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (03) :682-694
[4]   A clustering-based approach to static scheduling of multiple workflows with soft deadlines in heterogeneous distributed systems [J].
Bochenina, Klavdiya ;
Butakov, Nikolay ;
Dukhanov, Alexey ;
Nasonov, Denis .
INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 :2827-2831
[5]  
Deldari A., 2014, P AMIRKABIR INT J MO, V46, P19, DOI DOI 10.22060/MISCJ.2014.532
[6]  
Ebrahimi M, 2015, PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, P523, DOI 10.1109/BigData.2015.7363795
[7]   BDAP: A Big Data Placement Strategy for Cloud-Based Scientific Workflows [J].
Ebrahimi, Mahdi ;
Mohan, Aravind ;
Kashlev, Andrey ;
Lu, Shiyong .
2015 IEEE FIRST INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2015), 2015, :105-114
[8]  
Jia Yu, 2006, Scientific Programming, V14, P217
[9]   A System Architecture for Running Big Data Workflows in the Cloud [J].
Kashlev, Andrey ;
Lu, Shiyong .
2014 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (SCC 2014), 2014, :51-58
[10]  
Lin Cheng-Li., 2011, IEEE INT NANOELECTRO, P1