A characterization of workflow management systems for extreme-scale applications

被引:79
作者
da Silva, Rafael Ferreira [1 ]
Filgueira, Rosa [2 ,3 ]
Pietri, Ilia [4 ]
Jiang, Ming [5 ]
Sakellariou, Rizos [6 ]
Deelman, Ewa [1 ]
机构
[1] Univ Southern Calif, Informat Sci Inst, Marina Del Rey, CA 90292 USA
[2] British Geol Survey, Lyell Ctr, Edinburgh, Midlothian, Scotland
[3] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
[4] Univ Athens, Dept Informat & Telecommun, Athens, Greece
[5] Lawrence Livermore Natl Lab, Livermore, CA USA
[6] Univ Manchester, Sch Comp Sci, Manchester, Lancs, England
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2017年 / 75卷
关键词
Scientific workflows; Workflow management systems; Extreme-scale computing; in situ processing; TAVERNA; TOOL; VISUALIZATION; SCIENCE; SUITE;
D O I
10.1016/j.future.2017.02.026
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Automation of the execution of computational tasks is at the heart of improving scientific productivity. Over the last years, scientific workflows have been established as an important abstraction that captures data processing and computation of large and complex scientific applications. By allowing scientists to model and express entire data processing steps and their dependencies, workflow management systems relieve scientists from the details of an application and manage its execution on a computational infrastructure. As the resource requirements of today's computational and data science applications that process vast amounts of data keep increasing, there is a compelling case for a new generation of advances in high-performance computing, commonly termed as extreme-scale computing, which will bring forth multiple challenges for the design of workflow applications and management systems. This paper presents a novel characterization of workflow management systems using features commonly associated with extreme-scale computing applications. We classify 15 popular workflow management systems in terms of workflow execution models, heterogeneous computing environments, and data access methods. The paper also surveys workflow applications and identifies gaps for future research on the road to extreme-scale workflows and management systems. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:228 / 238
页数:11
相关论文
共 82 条
  • [71] Shvachko K., 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST), P1
  • [72] Provenance for visualizations - Reproducibility and beyond
    Silva, Claudio T.
    Freire, Juliana
    Callahan, Steven P.
    [J]. COMPUTING IN SCIENCE & ENGINEERING, 2007, 9 (05) : 82 - 89
  • [73] Taylor I.J., 2007, Workflows for e-Science: Scientific Workflows for Grids, V1, DOI [DOI 10.1007/978-1-84628-757-2_20, 10.1007/978-1-84628-757-2, DOI 10.1007/978-1-84628-757-2]
  • [74] Vishwanath V., 2011, Proceedings of the IEEE Symposium on Large Data Analysis and Visualization (LDAV 2011), P9, DOI 10.1109/LDAV.2011.6092178
  • [75] A pipeline virtual service pre-scheduling pattern and its application in astronomy data processing
    Wang, Man
    Du, Zhihui
    Cheng, Zhili
    Zhu, Suihui
    [J]. SIMULATION-TRANSACTIONS OF THE SOCIETY FOR MODELING AND SIMULATION INTERNATIONAL, 2007, 83 (01): : 123 - 132
  • [76] White BT, 2011, EMERGENCE OF MINORITIES IN THE MIDDLE EAST: THE POLITICS OF COMMUNITY IN FRENCH MANDATE SYRIA, P101
  • [77] Swift: A language for distributed parallel scripting
    Wilde, Michael
    Hategan, Mihael
    Wozniak, Justin M.
    Clifford, Ben
    Katz, Daniel S.
    Foster, Ian
    [J]. PARALLEL COMPUTING, 2011, 37 (09) : 633 - 652
  • [78] The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud
    Wolstencroft, Katherine
    Haines, Robert
    Fellows, Donal
    Williams, Alan
    Withers, David
    Owen, Stuart
    Soiland-Reyes, Stian
    Dunlop, Ian
    Nenadic, Aleksandra
    Fisher, Paul
    Bhagat, Jiten
    Belhajjame, Khalid
    Bacall, Finn
    Hardisty, Alex
    de la Hidalga, Abraham Nieva
    Vargas, Maria P. Balcazar
    Sufi, Shoaib
    Goble, Carole
    [J]. NUCLEIC ACIDS RESEARCH, 2013, 41 (W1) : W557 - W561
  • [79] In Situ Visualization for Large-Scale Combustion Simulations
    Yu, Hongfeng
    Wang, Chaoli
    Grout, Ray W.
    Chen, Jacqueline H.
    Ma, Kwan-Liu
    [J]. IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2010, 30 (03) : 45 - 57
  • [80] Yu J., 2005, J. Grid Comput, V3, P171, DOI [10.1007/s10723-005-9010-8, DOI 10.1007/S10723-005-9010-8]