Provenance-based fault tolerance technique recommendation for cloud-based scientific workflows: a practical approach

被引:0
|
作者
Thaylon Guedes
Leonardo A. Jesus
Kary A. C. S. Ocaña
Lucia M. A. Drummond
Daniel de Oliveira
机构
[1] Instituto de Computação - Universidade Federal Fluminense,
[2] Laboratório Nacional de Computação Científica,undefined
来源
Cluster Computing | 2020年 / 23卷
关键词
Cloud computing; Scientific workflow; Fault-tolerance; Recommendation;
D O I
暂无
中图分类号
学科分类号
摘要
Scientific workflows are abstractions composed of activities, data and dependencies that model a computer simulation and are managed by complex engines named scientific workflow management system (SWfMS). Many workflows demand many computational resources once their executions may involve a number of different programs processing a massive volume of data. Thus, the use of high-performance computing (HPC) and data-intensive scalable computing environments allied to parallelization techniques provides the necessary support for the execution of such workflows. Clouds are environments that already offer HPC capabilities and workflows can explore them. Although clouds offer advantages such as elasticity and availability, failures are a reality rather than a possibility in this environment. Thus, existing SWfMS must be fault-tolerant. There are several types of fault tolerance techniques used in SWfMS such as Checkpoint/Restart, Re-Execution and Over-provisioning, but it is far from trivial to choose the suitable fault tolerance technique for a workflow execution that is not going to jeopardize the parallel execution. The major problem is that the suitable fault tolerance technique may be different for each workflow, activity or activation since programs associated with activities may present different behaviors. This article aims at analyzing several fault-tolerance techniques in a cloud-based SWfMS named SciCumulus, and recommend the suitable one for user’s workflow activities and activations using machine learning techniques and provenance data, thus aiming at improving resiliency.
引用
收藏
页码:123 / 148
页数:25
相关论文
共 50 条
  • [1] Provenance-based fault tolerance technique recommendation for cloud-based scientific workflows: a practical approach
    Guedes, Thaylon
    Jesus, Leonardo A.
    Ocana, Kary A. C. S.
    Drummond, Lucia M. A.
    de Oliveira, Daniel
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (01): : 123 - 148
  • [2] A Provenance-Based Fault Tolerance Mechanism for Scientific Workflows
    Crawl, Daniel
    Altintas, Ilkay
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, 2008, 5272 : 152 - 159
  • [3] A provenance-based heuristic for preserving results confidentiality in cloud-based scientific workflows
    Guerine, Marcos
    Stockinger, Murilo B.
    Rosseti, Isabel
    Simonetti, Luidi G.
    Ocana, Kary A. C. S.
    Plastino, Alexandre
    de Oliveira, Daniel
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 97 : 697 - 713
  • [4] Evaluating provenance-based trust for scientific workflows
    Rajbhandari, Shrija
    Wootten, Ian
    Ali, Shaikh Ali
    Rana, Omer F.
    SIXTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID: SPANNING THE WORLD AND BEYOND, 2006, : 365 - +
  • [5] SciLightning: A Cloud Provenance-Based Event Notification for Parallel Workflows
    Pintas, Julliano Trindade
    de Oliveira, Daniel
    Ocana, Kary A. C. S.
    Ogasawara, Eduardo
    Mattoso, Marta
    SERVICE-ORIENTED COMPUTING - ICSOC 2013 WORKSHOPS, 2014, 8377 : 352 - 365
  • [6] A Provenance-based Adaptive Scheduling Heuristic for Parallel Scientific Workflows in Clouds
    Daniel de Oliveira
    Kary A. C. S. Ocaña
    Fernanda Baião
    Marta Mattoso
    Journal of Grid Computing, 2012, 10 : 521 - 552
  • [7] A Provenance-based Adaptive Scheduling Heuristic for Parallel Scientific Workflows in Clouds
    de Oliveira, Daniel
    Ocana, Kary A. C. S.
    Baiao, Fernanda
    Mattoso, Marta
    JOURNAL OF GRID COMPUTING, 2012, 10 (03) : 521 - 552
  • [8] A Provenance-Based Approach to Resource Discovery in Distributed Molecular Dynamics Workflows
    da Cruz, Sergio Manuel Serra
    Barros, Patricia M.
    Bisch, Paulo M.
    Campos, Maria Luiza M.
    Mattoso, Marta
    RESOURCE DISCOVERY, 2010, 6162 : 66 - +
  • [9] Provenance-based Scientific Workflow Search
    Abu Jabal, Amani
    Bertino, Elisa
    de Mel, Geeth
    2017 IEEE 13TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE), 2017, : 119 - 127
  • [10] A Provenance-based Execution Strategy for Variant GPU-accelerated Scientific Workflows in Clouds
    Stockinger, Murilo B.
    Guerine, Marcos A.
    de Paula, Ubiratam
    Santiago, Filipe
    Frota, Yuri
    Rosseti, Isabel
    Plastino, Alexandre
    de Oliveira, Daniel
    JOURNAL OF GRID COMPUTING, 2022, 20 (04)