An integrated task computation and data management scheduling strategy for workflow applications in cloud environments

被引:38
作者
Zeng, Lingfang [1 ]
Veeravalli, Bharadwaj [2 ]
Zomaya, Albert Y. [3 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp, Wuhan Natl Lab Optoelect, Wuhan 430074, Peoples R China
[2] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117576, Singapore
[3] Univ Sydney, Sch Informat Technol, Sydney, NSW 2006, Australia
关键词
Cloud computing; Workflow; Scheduling; Makespan; Utilization; ALGORITHM; RESOURCES;
D O I
10.1016/j.jnca.2015.01.001
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A workflow is a systematic computation or a data-intensive application that has a regular computation and data access patterns. It is a key to design scalable scheduling algorithms in Cloud environments to address these runtime regularities effectively. While existing researches ignore to join the tasks scheduling and the optimization of data management for workflow, little attention has been paid so far to understand the combination between the two. The proposed scheme indicates that the coordination between task computation and data management can improve the scheduling performance. Our model considers data management to obtain satisfactory makespan on multiple datacenters. At the same time, our adaptive data-dependency analysis can reveal parallelization opportunities. In this paper, we introduce an adaptive data-aware scheduling (ADAS) strategy for workflow applications. It consist of a set-up stage which builds the clusters for the workflow tasks and datasets, and a run-time stage which makes the overlapped execution for the workflows. Through rigorous performance evaluation studies, we demonstrate that our strategy can effectively improve the workflow completion time and utilization of resources in a Cloud environment. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:39 / 48
页数:10
相关论文
共 47 条
  • [1] Security Policies in Distributed CSCW and Workflow Systems
    Ahmed, Tanvir
    Tripathi, Anand R.
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2010, 40 (06): : 1220 - 1231
  • [2] Dynamic QoS-aware data replication in grid environments based on data "importance"
    Andronikou, Vassiliki
    Mamouras, Konstantinos
    Tserpes, Konstantinos
    Kyriazis, Dimosthenis
    Varvarigou, Theodora
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2012, 28 (03): : 544 - 553
  • [3] [Anonymous], 2008, P 3 WORKSH WORKFL SU
  • [4] [Anonymous], P 10 INT S PERV SYST
  • [5] [Anonymous], STORAGE MANAGEMENT D
  • [6] A View of Cloud Computing
    Armbrust, Michael
    Fox, Armando
    Griffith, Rean
    Joseph, Anthony D.
    Katz, Randy
    Konwinski, Andy
    Lee, Gunho
    Patterson, David
    Rabkin, Ariel
    Stoica, Ion
    Zaharia, Matei
    [J]. COMMUNICATIONS OF THE ACM, 2010, 53 (04) : 50 - 58
  • [7] Cost optimized provisioning of elastic resources for application workflows
    Byun, Eun-Kyu
    Kee, Yang-Suk
    Kim, Jin-Soo
    Maeng, Seungryoul
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2011, 27 (08): : 1011 - 1026
  • [8] Chakaravarthy V. T., 2011, Proceedings of the 25th IEEE International Parallel & Distributed Processing Symposium (IPDPS 2011), P14, DOI 10.1109/IPDPS.2011.12
  • [9] Deelman E, 2008, IEEE ACM INT SYMP, P687, DOI 10.1109/CCGRID.2008.24
  • [10] Di S, 2013, P INT C HIGH PERF CO