Orchestrating Data-Centric Workflows

被引:0
|
作者
Barker, Adam [1 ]
Weissman, Jon B. [1 ]
van Hemert, Jano [1 ]
机构
[1] Univ Edinburgh, NeSC, Edinburgh EH8 9YL, Midlothian, Scotland
来源
CCGRID 2008: EIGHTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, VOLS 1 AND 2, PROCEEDINGS | 2008年
关键词
Systems architecture; workflow optimisation; Web services; decentralised orchestration;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
When orchestrating data-centric workflows as are commonly found in the sciences, centralised servers can become a bottleneck to the performance of a workflow; output from service invocations are normally transferred via a centralised orchestration engine, when they should be passed directly to where they are needed at the next service in the workflow. To address this performance bottleneck, this paper presents a lightweight hybrid workflow architecture and concrete API, based on a centralised control flow, distributed data flow model. Our architecture maintains the robustness and simplicity of centralised orchestration, but facilitates choreography by allowing services to exchange data directly with one another, reducing data that needs to be transferred through a centralised server. Furthermore our architecture is standards compliment, flexible and is a non-disruptive solution; service definitions do not have to be altered prior to enactment.
引用
收藏
页码:210 / 217
页数:8
相关论文
共 50 条
  • [11] D2WORM: A Management Infrastructure for Distributed Data-centric Workflows
    Jergler, Martin
    Sadoghi, Mohammad
    Jacobsen, Hans-Arno
    SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 1427 - 1432
  • [12] Data-Centric AI
    Malerba, Donato
    Pasquadibisceglie, Vincenzo
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2024, 62 (06) : 1493 - 1502
  • [13] Safe Distribution and Parallel Execution of Data-centric Workflows over the Publish/Subscribe Abstraction
    Jergler, Matin
    Jacobsen, Hans-Arno
    Sadoghi, Mohammad
    Hull, Richard
    Vaculin, Roman
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1498 - 1499
  • [14] In-memory staging and data-centric task placement for coupled scientific simulation workflows
    Zhang, Fan
    Jin, Tong
    Sun, Qian
    Romanus, Melissa
    Bui, Hoang
    Klasky, Scott
    Parashar, Manish
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (12):
  • [15] Safe Distribution and Parallel Execution of Data-Centric Workflows over the Publish/Subscribe Abstraction
    Sadoghi, Mohammad
    Jergler, Martin
    Jacobsen, Hans-Arno
    Hull, Richard
    Vaculin, Roman
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (10) : 2824 - 2838
  • [16] Data-centric automated data mining
    Campos, MM
    Stengard, PJ
    Milenova, BL
    ICMLA 2005: FOURTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2005, : 97 - 104
  • [17] RDF Data-Centric Storage
    Levandoski, Justin J.
    Mokbel, Mohamed F.
    2009 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, VOLS 1 AND 2, 2009, : 911 - 918
  • [18] The Principles of Data-Centric AI
    Jarrahi, Mohammad Hossein
    Memariani, Ali
    Guha, Shion
    COMMUNICATIONS OF THE ACM, 2023, 66 (08) : 84 - 92
  • [19] Unpacking data-centric geotechnics
    Phoon, Kok-Kwang
    Ching, Jianye
    Cao, Zijun
    UNDERGROUND SPACE, 2022, 7 (06) : 967 - 989
  • [20] Data-centric decision support
    Kulhavy, R
    PROCEEDINGS OF THE 2002 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2002, 1-6 : 3395 - 3400