An Algebraic Approach for Data-Centric Scientific Workflows

被引:0
|
作者
Ogasawara, Eduardo [1 ,2 ]
Dias, Jonas [1 ]
de Oliveira, Daniel [1 ]
Porto, Fabio [3 ]
Valduriez, Patrick [4 ]
Mattoso, Marta [1 ]
机构
[1] Univ Fed Rio de Janeiro, COPPE, Rio de Janeiro, Brazil
[2] CEFET RJ, Rio De Janeiro, Brazil
[3] LNCC, Petropolis, Brazil
[4] INRIA & LIRMM, Montpellier, France
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2011年 / 4卷 / 12期
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scientific workflows have emerged as a basic abstraction for structuring and executing scientific experiments in computational environments. In many situations, these workflows are computationally and data intensive, thus requiring execution in large-scale parallel computers. However, parallelization of scientific workflows remains low-level, ad-hoc and laborintensive, which makes it hard to exploit optimization opportunities. To address this problem, we propose an algebraic approach (inspired by relational algebra) and a parallel execution model that enable automatic optimization of scientific workflows. We conducted a thorough validation of our approach using both a real oil exploitation application and synthetic data scenarios. The experiments were run in Chiron, a data-centric scientific workflow engine implemented to support our algebraic approach. Our experiments demonstrate performance improvements of up to 226% compared to an ad-hoc workflow implementation.
引用
收藏
页码:1328 / 1339
页数:12
相关论文
共 50 条
  • [41] Reliability evaluation of individual predictions: a data-centric approach
    Shahbazi, Nima
    Asudeh, Abolfazl
    VLDB JOURNAL, 2024, 33 (04): : 1203 - 1230
  • [42] A data-centric approach for scalable state machine replication
    Chockler, G
    Malkhi, D
    Dolev, D
    FUTURE DIRECTIONS IN DISTRIBUTED COMPUTING: RESEARCH AND POSITION PAPERS, 2003, 2584 : 159 - 163
  • [43] Dynamic Load Balancing in Cloud A Data-Centric Approach
    Dasoriya, Rayan
    Kotadiya, Purvi
    Arya, Garima
    Nayak, Priyanshu
    Mistry, Kamal
    2017 INTERNATIONAL CONFERENCE ON NETWORKS & ADVANCES IN COMPUTATIONAL TECHNOLOGIES (NETACT), 2017, : 162 - 166
  • [44] Identification of the Barriers to Data-Centric Approach in the Construction Industry
    Karji, Ali
    Messner, John
    Leicht, Robert
    McComb, Christopher
    CONSTRUCTION RESEARCH CONGRESS 2022: PROJECT MANAGEMENT AND DELIVERY, CONTRACTS, AND DESIGN AND MATERIALS, 2022, : 1002 - 1011
  • [45] Cooperative approach for data-centric and node-centric misbehavior detection in VANET
    Sultana, Rukhsar
    Grover, Jyoti
    Tripathi, Meenakshi
    VEHICULAR COMMUNICATIONS, 2024, 50
  • [46] Materials data science using CRADLE: A distributed, data-centric approach
    Ciardi, Thomas G.
    Nihar, Arafath
    Chawla, Rounak
    Akanbi, Olatunde
    Tripathi, Pawan K.
    Wu, Yinghui
    Chaudhary, Vipin
    French, Roger H.
    MRS COMMUNICATIONS, 2024, 14 (04) : 601 - 611
  • [47] Have data, will travel: A data-centric approach to enterprise systems development
    Zumbado, J
    Iller, W
    Naecker, PA
    CONFERENCE XXII - GEOSPATIAL INFORMATION & TECHNOLOGY ASSOCIATION, PROCEEDINGS, 1999, : 121 - 131
  • [48] A Data Mesh Approach for Enabling Data-Centric Applications at the Tactical Edge
    Dahdal, Simon
    Poltronieri, Filippo
    Tortonesi, Mauro
    Stefanelli, Cesare
    Suri, Niranjan
    2023 INTERNATIONAL CONFERENCE ON MILITARY COMMUNICATIONS AND INFORMATION SYSTEMS, ICMCIS, 2023,
  • [49] Data-centric automated data mining
    Campos, MM
    Stengard, PJ
    Milenova, BL
    ICMLA 2005: FOURTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2005, : 97 - 104
  • [50] RDF Data-Centric Storage
    Levandoski, Justin J.
    Mokbel, Mohamed F.
    2009 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, VOLS 1 AND 2, 2009, : 911 - 918