The many faces of data-centric workflow optimization: a survey

被引:24
|
作者
Kougka G. [1 ]
Gounaris A. [1 ]
Simitsis A. [2 ]
机构
[1] Aristotle University of Thessaloniki, Thessaloníki
[2] HP Labs, Palo Alto
关键词
Data analysis; Data flows; Data science; Workflow management systems; Workflow optimization;
D O I
10.1007/s41060-018-0107-0
中图分类号
学科分类号
摘要
Workflow technology is rapidly evolving and, rather than being limited to modeling the control flow in business processes, is becoming a key mechanism to perform advanced data management, such as big data analytics. This survey focuses on data-centric workflows (or workflows for data analytics or data flows), where a key aspect is data passing through and getting manipulated by a sequence of steps. The large volume and variety of data, the complexity of operations performed, and the long time such workflows take to compute give rise to the need for optimization. In general, data-centric workflow optimization is a technology in evolution. This survey focuses on techniques applicable to workflows comprising arbitrary types of data manipulation steps and semantic inter-dependencies between such steps. Further, it serves a twofold purpose: firstly, to present the main dimensions of the relevant optimization problems and the types of optimizations that occur before flow execution and secondly, to provide a concise overview of the existing approaches with a view to highlighting key observations and areas deserving more attention from the community. © 2018, Springer International Publishing AG, part of Springer Nature.
引用
收藏
页码:81 / 107
页数:26
相关论文
共 50 条
  • [1] WAD Workflow System: Data-Centric Workflow System
    Robinson, Andrew J.
    Rahayu, Wenny J.
    Dillon, Tharam
    ASWEC 2009: 20TH AUSTRALIAN SOFTWARE ENGINEERING CONFERENCE, PROCEEDINGS, 2009, : 337 - +
  • [2] Data-Centric Graph Learning: A Survey
    Guo, Yuxin
    Bo, Deyu
    Yang, Cheng
    Lu, Zhiyuan
    Zhang, Zhongjian
    Liu, Jixi
    Peng, Yufei
    Shi, Chuan
    IEEE TRANSACTIONS ON BIG DATA, 2025, 11 (01) : 1 - 20
  • [3] Data-centric Artificial Intelligence: A Survey
    Zha, Daochen
    Bhat, Zaid Pervaiz
    Lai, Kwei-Herng
    Yang, Fan
    Jiang, Zhimeng
    Zhong, Shaochen
    Hu, Xia
    ACM COMPUTING SURVEYS, 2025, 57 (05)
  • [4] Survey of data-centric Smart City
    Wang, Jingyuan
    Li, Chao
    Xiong, Zhang
    Shan, Zhiguang
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2014, 51 (02): : 239 - 259
  • [5] Bridging Control-Centric and Data-Centric Optimization
    Ben-Nun, Tal
    Ates, Berke
    Calotoiu, Alexandru
    Hoefler, Torsten
    PROCEEDINGS OF THE 21ST ACM/IEEE INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, CGO 2023, 2023, : 173 - 185
  • [6] DLProv: A Data-Centric Support for Deep Learning Workflow Analyses
    Pina, Debora
    Chapman, Adriane
    Kunstmann, Liliane
    de Oliveira, Daniel
    Mattoso, Marta
    PROCEEDINGS OF THE 8TH WORKSHOP ON DATA MANAGEMENT FOR END-TO-END MACHINE LEARNING, DEEM 2024, 2024,
  • [7] Enabling effective workflow model reuse: A data-centric approach
    Liu, Zhiyong
    Fan, Shaokun
    Wang, Harry Jiannan
    Zhao, J. Leon
    DECISION SUPPORT SYSTEMS, 2017, 93 : 11 - 25
  • [8] Data-Centric Green Artificial Intelligence: A Survey
    Salehi S.
    Schmeink A.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (05): : 1973 - 1989
  • [9] Data-centric Combinatorial Optimization of Parallel Code
    Luo, Hao
    Chen, Guoyang
    Li, Pengcheng
    Ding, Chen
    Shen, Xipeng
    ACM SIGPLAN NOTICES, 2016, 51 (08) : 379 - 380
  • [10] Pansharpening using data-centric optimization approach
    Devi, Mutum Bidyarani
    Devanathan, R.
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2019, 40 (20) : 7784 - 7804