Towards an Adaptive and Distributed Architecture for Managing Workflow Provenance Data

被引:3
作者
Costa, Flavio [1 ]
de Oliveira, Daniel [2 ]
Mattoso, Marta [1 ]
机构
[1] Univ Fed Rio de Janeiro, COPPE, Rio De Janeiro, Brazil
[2] Fluminense Fed Univ, Comp Inst, Niteroi, RJ, Brazil
来源
2014 IEEE 10TH INTERNATIONAL CONFERENCE ON ESCIENCE WORKSHOPS (ESCIENCE 2014), VOL 2 | 2014年
关键词
distributed provenance; scientific workflow; scientific workflow management system;
D O I
10.1109/eScience.2014.59
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Workflow provenance data represents the workflow execution behavior, allowing for tracing the generation of the scientific data-flow. Provenance is an important asset to analyze data, identify and handle errors that occurred during the workflow execution through runtime monitoring. The workflow execution engine can also use provenance data to set the initial amount of resources and plan adaptive task scheduling. However, efficiently managing provenance data from distributed workflow execution has several challenges. As the size of workflows increases (in terms of number of activity executions or volume of data to process), the amount of provenance data to be managed also grows, especially in fine grain. Thus, centralized approaches become unviable. In this work we propose an architecture that combines distributed workflow management techniques with distributed provenance data management.
引用
收藏
页码:79 / 82
页数:4
相关论文
共 18 条
  • [1] Allen M., 2011, TAPP 2011
  • [2] [Anonymous], P 2 INT WORKSH CLOUD
  • [3] [Anonymous], 2009, USENIX ANN TECHN C
  • [4] [Anonymous], P 7 IEEE INT C E SCI
  • [5] [Anonymous], P ACM INT S HIGH PER
  • [6] [Anonymous], 2008, P 2008 ACM SIGMOD IN
  • [7] Costa F, 2012, LECT NOTES COMPUT SC, V7525, P229
  • [8] de Oliveira D., 2010, 2010 IEEE 3rd International Conference on Cloud Computing (CLOUD 2010), P378, DOI 10.1109/CLOUD.2010.64
  • [9] A Provenance-based Adaptive Scheduling Heuristic for Parallel Scientific Workflows in Clouds
    de Oliveira, Daniel
    Ocana, Kary A. C. S.
    Baiao, Fernanda
    Mattoso, Marta
    [J]. JOURNAL OF GRID COMPUTING, 2012, 10 (03) : 521 - 552
  • [10] Provenance for computational tasks: A survey
    Freire, Juliana
    Koop, David
    Santos, Emanuele
    Silva, Claudio T.
    [J]. COMPUTING IN SCIENCE & ENGINEERING, 2008, 10 (03) : 11 - 21