Scheduling of scientific workflow in non-dedicated heterogeneous multicluster platform

被引:17
作者
Zhang, Jinghui [1 ]
Luo, Junzhou [1 ]
Dong, Fang [1 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, Nanjing, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Multicluster; Non-dedicated; Scientific workflow; Scheduling; Branch-and-cut; TASK;
D O I
10.1016/j.jss.2012.10.029
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many scientific workflows can be structured as Parallel Task Graphs (PTGs), that is, graphs of data-parallel tasks. Adding data parallelism to a workflow provides opportunities for higher performance and scalability. Workflow tasks are data-parallel and moldable, and clusters are not only heterogeneous but also non-dedicated for workflow execution. Therefore, scheduling such scientific workflow in a multicluster platform becomes a challenging task. To address this problem, we study the scheduling of scientific workflow in a non-dedicated heterogeneous multicluster platform aimed at minimizing the makespan for workflow execution. In this paper, three scheduling algorithms for effective workflow task mapping and resource allocation are proposed, among them MHEFT-RSV and MHEFT-RSV-BD are heuristic algorithms. An exact branch-and-cut scheduling algorithm is implemented, which exploits the intertask precedence and resource constraints thereby accelerating the process of obtaining a feasible schedule with minimized makespan. Detailed simulation experiments show that on average the exact branch-and-cut algorithm obtains shorter makespan for small and medium size workflows, while MHEFT-RSV and MHEFT-RSV-BD achieves better tradeoff between makespan and computation time for large scientific workflows. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:1806 / 1818
页数:13
相关论文
共 31 条
  • [1] Aida K., 2008, P 17 INT S HIGH PERF, P65, DOI DOI 10.1145/1383422.1383432
  • [2] Scheduling mixed-parallel applications with advance reservations
    Aida, Kento
    Casanova, Henri
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2009, 12 (02): : 205 - 220
  • [3] An improved two-step algorithm for task and data parallel scheduling in distributed memory machines
    Bansal, Savina
    Kumar, Padam
    Singh, Kuldip
    [J]. PARALLEL COMPUTING, 2006, 32 (10) : 759 - 774
  • [4] Collaborative e-Science Experiments and Scientific Workflows
    Belloum, Adam
    Inda, Marcia A.
    Vasunin, Dmitry
    Korkhov, Vladimir
    Zhao, Zhiming
    Rauwerda, Han
    Breit, Timo M.
    Bubak, Marian
    Hertzberger, Luis O.
    [J]. IEEE INTERNET COMPUTING, 2011, 15 (04) : 39 - 47
  • [5] Chakrabarti S., 1995, SPAA '95. 7th Annual ACM Symposium on Parallel Algorithms and Architectures, P74, DOI 10.1145/215399.215423
  • [6] Exponential lower bounds on the lengths of some classes of branch-and-cut proofs
    Dash, S
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2005, 30 (03) : 678 - 700
  • [7] Dash S., 2006, LECT NOTES COMPUTER, V2337, P145
  • [8] A collaborative scheduling approach for service-driven scientific workflow execution
    Dou, Wanchun
    Zhao, J. Leon
    Fan, Shaokun
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2010, 76 (06) : 416 - 427
  • [9] Scheduling Parallel Task Graphs on (Almost) Homogeneous Multicluster Platforms
    Dutot, Pierre-Francois
    N'Takpe, Tchimou
    Suter, Frederic
    Casanova, Henri
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2009, 20 (07) : 940 - 952
  • [10] Cooperative negotiation and scheduling of scientific workflows in the collaborative climate community data and processing grid
    Grimme, Christian
    Papaspyrou, Alexander
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2009, 25 (03): : 301 - 307