An Algebraic Approach for Data-Centric Scientific Workflows

被引:0
|
作者
Ogasawara, Eduardo [1 ,2 ]
Dias, Jonas [1 ]
de Oliveira, Daniel [1 ]
Porto, Fabio [3 ]
Valduriez, Patrick [4 ]
Mattoso, Marta [1 ]
机构
[1] Univ Fed Rio de Janeiro, COPPE, Rio de Janeiro, Brazil
[2] CEFET RJ, Rio De Janeiro, Brazil
[3] LNCC, Petropolis, Brazil
[4] INRIA & LIRMM, Montpellier, France
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2011年 / 4卷 / 12期
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scientific workflows have emerged as a basic abstraction for structuring and executing scientific experiments in computational environments. In many situations, these workflows are computationally and data intensive, thus requiring execution in large-scale parallel computers. However, parallelization of scientific workflows remains low-level, ad-hoc and laborintensive, which makes it hard to exploit optimization opportunities. To address this problem, we propose an algebraic approach (inspired by relational algebra) and a parallel execution model that enable automatic optimization of scientific workflows. We conducted a thorough validation of our approach using both a real oil exploitation application and synthetic data scenarios. The experiments were run in Chiron, a data-centric scientific workflow engine implemented to support our algebraic approach. Our experiments demonstrate performance improvements of up to 226% compared to an ad-hoc workflow implementation.
引用
收藏
页码:1328 / 1339
页数:12
相关论文
共 50 条
  • [31] Data-centric approach for miscellaneous optical sensing and imaging
    Tanida, Jun
    Horisaki, Ryoichi
    HOLOGRAPHY, DIFFRACTIVE OPTICS, AND APPLICATIONS IX, 2019, 11188
  • [32] Safe Distribution and Parallel Execution of Data-Centric Workflows over the Publish/Subscribe Abstraction
    Sadoghi, Mohammad
    Jergler, Martin
    Jacobsen, Hans-Arno
    Hull, Richard
    Vaculin, Roman
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (10) : 2824 - 2838
  • [33] Data-centric AI approach for automated wildflower monitoring
    Schouten, Gerard
    Michielsen, Bas S. H. T.
    Gravendeel, Barbara
    PLOS ONE, 2024, 19 (09):
  • [34] A data-centric approach to understanding the pricing of financial options
    Healy, J
    Dixon, M
    Read, B
    Cai, FF
    EUROPEAN PHYSICAL JOURNAL B, 2002, 27 (02): : 219 - 227
  • [35] A data-centric approach to understanding the pricing of financial options
    J. Healy
    M. Dixon
    B. Read
    F.F. Cai
    The European Physical Journal B - Condensed Matter and Complex Systems, 2002, 27 : 219 - 227
  • [36] A data-centric approach for ethical and trustworthy AI in journalism
    Dierickx, Laurence
    Opdahl, Andreas Lothe
    Khan, Sohail Ahmed
    Linden, Carl-Gustav
    Guerrero Rojas, Diana Carolina
    ETHICS AND INFORMATION TECHNOLOGY, 2024, 26 (04)
  • [37] Understanding the Indian Labour Market: A Data-Centric Approach
    Shabana, K. M.
    Gracious, Tony
    Subramonian, Hrishikesh
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON DATA SCIENCE & ENGINEERING (ICDSE), 2016, : 26 - 31
  • [38] A participatory data-centric approach to AI Ethics by Design
    Gerdes, Anne
    APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [39] A data-centric approach to high-level synthesis
    Tarafdar, S
    Leeser, M
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2000, 19 (11) : 1251 - 1267
  • [40] Data-Centric Optimization Approach for Small, Imbalanced Datasets
    Tanov, Vladislav
    JOURNAL OF INFORMATION AND ORGANIZATIONAL SCIENCES, 2023, 47 (01) : 167 - 177