Resource Allocation and Scheduling of Linear Workflow Applications with Ageing Priorities and Transient Failures

被引:2
作者
Stavrinides, Georgios L. [1 ]
Karatza, Helen D. [1 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Thessaloniki, Greece
来源
2022 IEEE/ACS 19TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA) | 2022年
关键词
linear workflows; resource allocation; scheduling; distributed resources; ageing priorities; transient software failures; NETWORK; FOG;
D O I
10.1109/AICCSA56895.2022.10017928
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In distributed environments, applications are usually complex and computationally demanding, having a linear workflow (LW) structure. Additionally, such LW jobs may also have different priorities for processing. This entails the danger of long delays for low priority jobs. Furthermore, transient software failures may occur during the execution of the workload. Consequently, resource allocation, scheduling and fault tolerance are three crucial aspects that should be efficiently and effectively addressed in such environments, in order to achieve good system performance. To this end, in this paper we investigate the resource allocation and scheduling of LW jobs that arrive dynamically in an environment of distributed resources. We consider that the LW jobs have different priorities and that transient software failures may occur during their execution. A novel scheduling technique is proposed, which takes into account the ageing priorities of the LW jobs, as well as the resulting scheduling overhead. We examine the performance of three routing strategies in this framework, under various load cases and different failure probabilities, taking also into account their implementation complexity. The simulation results reveal how each routing strategy is affected in each of the examined scenarios.
引用
收藏
页数:8
相关论文
共 34 条
  • [1] Aba MA., 2017, P 23 INT EUR C PAR D, P353, DOI DOI 10.1007/978-3-319-75178-8_29
  • [2] Agrawal K., 2010, P 2010 IEEE INT S PA, P1, DOI [10.1109/IPDPS.2010.5470403, DOI 10.1109/IPDPS.2010.5470403, DOI 10.1109/IPDPS.2010.5470346]
  • [3] Mapping Linear Workflows with Computation/Communication Overlap
    Agrawal, Kunal
    Benoit, Anne
    Robert, Yves
    [J]. PROCEEDINGS OF THE 2008 14TH IEEE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, 2008, : 195 - +
  • [4] Efficient dynamic resource allocation method for cloud computing environment
    Belgacem, Ali
    Beghdad-Bey, Kadda
    Nacer, Hassina
    Bouznad, Sofiane
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (04): : 2871 - 2889
  • [5] Optimizing Buffer Sizes for Pipeline Workflow Scheduling with Setup Times
    Benoit, Anne
    Nicod, Jean-Marc
    Rehn-Sonigo, Veronika
    [J]. PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2014, : 663 - 671
  • [6] Multi-level checkpointing and silent error detection for linear workflows
    Benoit, Anne
    Cavelan, Aurelien
    Robert, Yves
    Sun, Hongyang
    [J]. JOURNAL OF COMPUTATIONAL SCIENCE, 2018, 28 : 398 - 415
  • [7] Stream Processing on Clustered Edge Devices
    Dautov, Rustem
    Distefano, Salvatore
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (02) : 885 - 898
  • [8] The Design Space of Emergent Scheduling for Distributed Execution Frameworks
    Dean, Paul
    Porter, Barry
    [J]. 2021 INTERNATIONAL SYMPOSIUM ON SOFTWARE ENGINEERING FOR ADAPTIVE AND SELF-MANAGING SYSTEMS (SEAMS 2021), 2021, : 186 - 195
  • [9] TOSCAdata: Modeling data pipeline applications in TOSCA✩
    Dehury, Chinmaya Kumar
    Jakovits, Pelle
    Srirama, Satish Narayana
    Giotis, Giorgos
    Garg, Gaurav
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2022, 186
  • [10] Optimal periodic software rejuvenation policies based on interval reliability criteria
    Dohi, Tadashi
    Zheng, Junjun
    Okamura, Hiroyuki
    Trivedi, Kishor S.
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2018, 180 : 463 - 475