Life science data analysis workflow development using the bioextract server leveraging the iPlant collaborative cyberinfrastructure

被引:7
作者
Lushbough, Carol M. [1 ]
Gnimpieba, Etienne Z. [2 ]
Dooley, Rion [3 ,4 ]
机构
[1] Univ S Dakota, Vermillion, SD 57069 USA
[2] Univ S Dakota, Dept Comp Sci, Vermillion, SD 57069 USA
[3] Univ Texas Austin, Web & Cloud Serv Grp, Austin, TX 78712 USA
[4] Univ Texas Austin, Texas Adv Comp Ctr, Austin, TX 78712 USA
基金
美国国家科学基金会;
关键词
BioExtract server; iPlant data store; iPlant foundation API; bioinformatic workflows; WEB; GENERATION; TAVERNA;
D O I
10.1002/cpe.3237
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In order to handle the vast quantities of biological data gener6ated by high-throughput experimental technologies, the BioExtract Server (bioextract.org) has leveraged iPlant Collaborative () functionality to help address big data storage and analysis issues in the bioinformatics field. The BioExtract Server is a Web-based, workflow-enabling system that offers researchers a flexible environment for analyzing genomic data. It provides researchers with the ability to save a series of BioExtract Server tasks (e.g., query a data source, save a data extract, and execute an analytic tool) as a workflow and the opportunity for researchers to share their data extracts, analytic tools, and workflows with collaborators. The iPlant Collaborative is a community of researchers, educators, and students working to enrich science through the development of cyberinfrastructurethe physical computing resources, collaborative environment, virtual machine resources, and interoperable analysis software and data servicesthat are essential components of modern biology. The iPlant AGAVE Advanced Programming Interface, developed through the iPlant Collaborative, is a hosted, Software-as-a-Service resource providing access to a collection of high performance computing and cloud resources. Leveraging AGAVE, the BioExtract Server gives researchers easy access to multiple high performance computers and delivers computation and storage as dynamically allocated resources via the Internet. (c) 2014 The Authors. Concurrency and Computation: Practice and Experience published by John Wiley & Sons Ltd.
引用
收藏
页码:408 / 419
页数:12
相关论文
共 33 条
[1]   Tavaxy: Integrating Taverna and Galaxy workflows with cloud computing support [J].
Abouelhoda, Mohamed ;
Issa, Shadi Alaa ;
Ghanem, Moustafa .
BMC BIOINFORMATICS, 2012, 13
[2]  
Altintas I, 2004, 16TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, P423
[3]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[4]   NCBI2RDF: Enabling Full RDF-Based Access to NCBI Databases [J].
Anguita, Alberto ;
Garcia-Remesal, Miguel ;
de la Iglesia, Diana ;
Maojo, Victor .
BIOMED RESEARCH INTERNATIONAL, 2013, 2013
[5]  
[Anonymous], BRIEF BIOINFORM
[6]  
[Anonymous], 2011, Proceedings of the 2Nd International Workshop on Petascal Data Analytics: Challenges and Opportunities, PDAC'11
[7]   Update on activities at the Universal Protein Resource (UniProt) in 2013 [J].
Apweiler, Rolf ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Alam-Faruque, Yasmin ;
Alpi, Emanuela ;
Antunes, Ricardo ;
Arganiska, Joanna ;
Casanova, Elisabet Barrera ;
Bely, Benoit ;
Bingley, Mark ;
Bonilla, Carlos ;
Britto, Ramona ;
Bursteinas, Borisas ;
Chan, Wei Mun ;
Chavali, Gayatri ;
Cibrian-Uhalte, Elena ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dimmer, Emily ;
Fazzini, Francesco ;
Gane, Paul ;
Fedotov, Alexander ;
Castro, Leyla Garcia ;
Garmiri, Penelope ;
Hatton-Ellis, Emma ;
Hieta, Reija ;
Huntley, Rachael ;
Jacobsen, Julius ;
Jones, Rachel ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
MacDougall, Alistair ;
Mutowo, Prudence ;
Nightingale, Andrew ;
Orchard, Sandra ;
Patient, Samuel ;
Pichler, Klemens ;
Poggioli, Diego ;
Pundir, Sangya ;
Pureza, Luis ;
Qi, Guoying ;
Rosanoff, Steven ;
Sawford, Tony ;
Sehra, Harminder ;
Turner, Edward ;
Volynkin, Vladimir ;
Wardell, Tony ;
Watkins, Xavier .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D43-D47
[8]  
Deelman E., 2005, Scientific Programming, V13, P219
[9]  
Dooley R, 2012, YOU DATA YOURWAY
[10]   Executable cell biology [J].
Fisher, Jasmin ;
Henzinger, Thomas A. .
NATURE BIOTECHNOLOGY, 2007, 25 (11) :1239-1249