Developing accurate and scalable simulators of production workflow management systems with WRENCH

被引:35
作者
Casanova, Henri [1 ]
Silva, Rafael Ferreira da [2 ,3 ]
Tanaka, Ryan [1 ,2 ]
Pandey, Suraj [1 ]
Jethwani, Gautam [3 ]
Koch, William [1 ]
Albrecht, Spencer [3 ]
Oeth, James [3 ]
Suter, Frederic [4 ]
机构
[1] Univ Hawaii, Informat & Comp Sci, POST Bldg,Rm 317,1680 East West Rd, Honolulu, HI 96822 USA
[2] Univ Southern Calif, Informat Sci Inst, Marina Del Rey, CA USA
[3] Univ Southern Calif, Dept Comp Sci, Los Angeles, CA 90007 USA
[4] CNRS, IN2P3 Comp Ctr, Villeurbanne, France
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2020年 / 112卷
基金
美国国家科学基金会;
关键词
Scientific workflows; Workflow management systems; Simulation; Distributed computing; CLOUD; EXECUTION; CLUSTERS; TOOLKIT;
D O I
10.1016/j.future.2020.05.030
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Scientific workflows are used routinely in numerous scientific domains, and Workflow Management Systems (WMSs) have been developed to orchestrate and optimize workflow executions on distributed platforms. WMSs are complex software systems that interact with complex software infrastructures. Most WMS research and development activities rely on empirical experiments conducted with full-fledged software stacks on actual hardware platforms. These experiments, however, are limited to hardware and software infrastructures at hand and can be labor-and/or time-intensive. As a result, relying solely on real-world experiments impedes WMS research and development. An alternative is to conduct experiments in simulation. In this work we present WRENCH, a WMS simulation framework, whose objectives are (i) accurate and scalable simulations; and (ii) easy simulation software development. WRENCH achieves its first objective by building on the SimGrid framework. While SimGrid is recognized for the accuracy and scalability of its simulation models, it only provides low-level simulation abstractions and thus large software development efforts are required when implementing simulators of complex systems. WRENCH thus achieves its second objective by providing high-level and directly re-usable simulation abstractions on top of SimGrid. After describing and giving rationales for WRENCH's software architecture and APIs, we present two case studies in which we apply WRENCH to simulate the Pegasus production WMS and the WorkQueue application execution framework. We report on ease of implementation, simulation accuracy, and simulation scalability so as to determine to which extent WRENCH achieves its objectives. We also draw both qualitative and quantitative comparisons with a previously proposed workflow simulator. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:162 / 175
页数:14
相关论文
共 62 条
[1]  
Albrecht Michael, 2012, P 1 ACM SIGMOD WORKS, DOI DOI 10.1145/2443416.2443417
[2]  
Altintas I, 2004, 16TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, P423
[3]  
[Anonymous], 2010, ARR ARR METH GLOB, DOI DOI 10.1109/ICCCN.2010.5560146
[4]  
[Anonymous], 2002, TECH REP
[5]   Parallel simulation of large-scale parallel applications [J].
Bagrodia, R ;
Deelman, E ;
Phan, T .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2001, 15 (01) :3-12
[6]   OverSim: A flexible overlay network simulation framework [J].
Baumgart, Ingmar ;
Heep, Bernhard ;
Krause, Stephan .
2007 IEEE GLOBAL INTERNET SYMPOSIUM, 2007, :79-84
[7]   Toward Better Simulation of MPI Applications on Ethernet/TCP Networks [J].
Bedaride, Paul ;
Degomme, Augustin ;
Genaud, Stephane ;
Legrand, Arnaud ;
Markomanolis, George S. ;
Quinson, Martin ;
Stillwell, Mark ;
Suter, Frederic ;
Videau, Brice .
HIGH PERFORMANCE COMPUTING SYSTEMS: PERFORMANCE MODELING, BENCHMARKING AND SIMULATION, 2014, 8551 :158-181
[8]   OptorSim: A grid simulator for studying dynamic data replication strategies [J].
Bell, WH ;
Cameron, DG ;
Capozza, L ;
Millar, AP ;
Stockinger, K ;
Zini, F .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2003, 17 (04) :403-416
[9]  
Benoit A., 2008, 2008 IEEE INT S PARA, P1
[10]  
Beranek Jakub, 2019, INT C HIGH PERFORMAN