Managing workflows on top of a cloud computing orchestrator for using heterogeneous environments on e-Science

被引:4
作者
Carrion, Abel [1 ]
Caballer, Miguel [1 ]
Blanquer, Ignacio [1 ]
Kotowski, Nelson [2 ]
Jardim, Rodrigo [2 ]
Rivera Davila, Alberto Martin [2 ]
机构
[1] Univ Politecn Valencia, Ctr Mixto CSIC, GRyCAP Grp DeaGrid & Computac Altas Prestac, CIEMAT,I3M, Camino Vera S-N, E-46022 Valencia, Spain
[2] Oswaldo Cruz Inst, Computat & Syst Biol Lab, BR-21040360 Rio De Janeiro, RJ, Brazil
关键词
cloud computing; cloud orchestrator; comparative genomics; e-Science; multi-platform; workflow; workflow management systems; SEQUENCE; MANAGEMENT; PEGASUS; GENOME; WEB;
D O I
10.1504/IJWGS.2017.087326
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scientific workflows (SWFs) are widely used to model processes in e-Science. SWFs are executed by means of workflow management systems (WMSs), which orchestrate the workload on top of computing infrastructures. The advent of cloud computing infrastructures has opened the door of using on-demand infrastructures to complement or even replace local infrastructures. However, new issues have arisen, such as the integration of hybrid resources or the compromise between infrastructure reutilisation and elasticity. In this article, we present an ad hoc solution for managing workflows exploiting the capabilities of cloud orchestrators to deploy resources on demand according to the workload and to combine heterogeneous cloud providers (such as on-premise clouds and public clouds) and traditional infrastructures (clusters) to minimise costs and response time. The work does not propose yet another WMS but demonstrates the benefits of the integration of cloud orchestration when running complex workflows. The article shows several configuration experiments from a realistic comparative genomics workflow called Orthosearch, to migrate memory-intensive workload to public infrastructures while keeping other blocks of the experiment running locally. The article computes running time and cost suggesting best practices.
引用
收藏
页码:375 / 402
页数:28
相关论文
共 29 条
[1]   Complete genome sequence of the apicomplexan, Cryptosporidium parvum [J].
Abrahamsen, MS ;
Templeton, TJ ;
Enomoto, S ;
Abrahante, JE ;
Zhu, G ;
Lancto, CA ;
Deng, MQ ;
Liu, C ;
Widmer, G ;
Tzipori, S ;
Buck, GA ;
Xu, P ;
Bankier, AT ;
Dear, PH ;
Konfortov, BA ;
Spriggs, HF ;
Iyer, L ;
Anantharaman, V ;
Aravind, L ;
Kapur, V .
SCIENCE, 2004, 304 (5669) :441-445
[2]  
Ananthakrishnan R., 2013, Proc. of 2013 IEEE International Conference on Cluster Computing (CLUSTER), P1
[3]  
[Anonymous], 2009, P C HIGH PERF COMP N
[4]   Management of trypanosomiasis and leishmaniasis [J].
Barrett, Michael P. ;
Croft, Simon L. .
BRITISH MEDICAL BULLETIN, 2012, 104 (01) :175-196
[5]   Dynamic Management of Virtual Infrastructures [J].
Caballer, Miguel ;
Blanquer, Ignacio ;
Molto, German ;
de Alfonso, Carlos .
JOURNAL OF GRID COMPUTING, 2015, 13 (01) :53-70
[6]  
CalHeiros R.N., 2015, CLOUD COMPUTING E SC
[7]  
Carrion J. V., 2010, 2 INT ICST C CLOUD C
[8]  
da Cruz S.M.S., 2008, P 2008 ACM S APPL CO, P1282
[9]  
de Oliveira D., 2010, 2010 IEEE 3rd International Conference on Cloud Computing (CLOUD 2010), P378, DOI 10.1109/CLOUD.2010.64
[10]   Pegasus in the Cloud: Science Automation through Workflow Technologies [J].
Deelman, Ewa ;
Vahi, Karan ;
Rynge, Mats ;
Juve, Gideon ;
Mayani, Rajiv ;
da Silva, Rafael Ferreira .
IEEE INTERNET COMPUTING, 2016, 20 (01) :70-76