A characterization of workflow management systems for extreme-scale applications

被引:79
作者
da Silva, Rafael Ferreira [1 ]
Filgueira, Rosa [2 ,3 ]
Pietri, Ilia [4 ]
Jiang, Ming [5 ]
Sakellariou, Rizos [6 ]
Deelman, Ewa [1 ]
机构
[1] Univ Southern Calif, Informat Sci Inst, Marina Del Rey, CA 90292 USA
[2] British Geol Survey, Lyell Ctr, Edinburgh, Midlothian, Scotland
[3] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
[4] Univ Athens, Dept Informat & Telecommun, Athens, Greece
[5] Lawrence Livermore Natl Lab, Livermore, CA USA
[6] Univ Manchester, Sch Comp Sci, Manchester, Lancs, England
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2017年 / 75卷
关键词
Scientific workflows; Workflow management systems; Extreme-scale computing; in situ processing; TAVERNA; TOOL; VISUALIZATION; SCIENCE; SUITE;
D O I
10.1016/j.future.2017.02.026
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Automation of the execution of computational tasks is at the heart of improving scientific productivity. Over the last years, scientific workflows have been established as an important abstraction that captures data processing and computation of large and complex scientific applications. By allowing scientists to model and express entire data processing steps and their dependencies, workflow management systems relieve scientists from the details of an application and manage its execution on a computational infrastructure. As the resource requirements of today's computational and data science applications that process vast amounts of data keep increasing, there is a compelling case for a new generation of advances in high-performance computing, commonly termed as extreme-scale computing, which will bring forth multiple challenges for the design of workflow applications and management systems. This paper presents a novel characterization of workflow management systems using features commonly associated with extreme-scale computing applications. We classify 15 popular workflow management systems in terms of workflow execution models, heterogeneous computing environments, and data access methods. The paper also surveys workflow applications and identifies gaps for future research on the road to extreme-scale workflows and management systems. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:228 / 238
页数:11
相关论文
共 82 条
  • [31] de Oliveira D., 2010, 2010 IEEE 3rd International Conference on Cloud Computing (CLOUD 2010), P378, DOI 10.1109/CLOUD.2010.64
  • [32] Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
  • [33] Deelman E, 2004, LECT NOTES COMPUT SC, V3165, P11
  • [34] PANORAMA: An approach to performance modeling and diagnosis of extreme-scale workflows
    Deelman, Ewa
    Carothers, Christopher
    Mandal, Anirban
    Tierney, Brian
    Vetter, Jeffrey S.
    Baldin, Ilya
    Castillo, Claris
    Juve, Gideon
    Krol, Dariusz
    Lynch, Vickie
    Mayer, Ben
    Meredith, Jeremy
    Proffen, Thomas
    Ruth, Paul
    da Silva, Rafael Ferreira
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2017, 31 (01) : 4 - 18
  • [35] Pegasus, a workflow management system for science automation
    Deelman, Ewa
    Vahi, Karan
    Juve, Gideon
    Rynge, Mats
    Callaghan, Scott
    Maechling, Philip J.
    Mayani, Rajiv
    Chen, Weiwei
    da Silva, Rafael Ferreira
    Livny, Miron
    Wenger, Kent
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2015, 46 : 17 - 35
  • [36] Workflows and e-Science: An overview of workflow system features and capabilities
    Deelman, Ewa
    Gannon, Dennis
    Shields, Matthew
    Taylor, Ian
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2009, 25 (05): : 528 - 540
  • [37] Docan C., 2010, P 19 ACM INT S HIGH, P25
  • [38] With Extreme Scale Computing the Rules Have Changed
    Dongarra, Jack
    [J]. MATHEMATICAL SOFTWARE, ICMS 2016, 2016, 9725 : 3 - 6
  • [39] Dun N., 2010, P 19 ACM INT S HIGH, P37
  • [40] Concurrent visualization in a production supercomputing environment
    Ellsworth, David
    Green, Bryan
    Henze, Chris
    Moran, Patrick
    Sandstrom, Timothy
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2006, 12 (05) : 997 - 1004