CyVerse: Cyberinfrastructure for open science

被引:12
作者
Swetnam, Tyson L. [1 ]
Antin, Parker B. [1 ]
Bartelme, Ryan [1 ,2 ]
Bucksch, Alexander [1 ]
Camhy, David [3 ]
Chism, Greg [1 ]
Choi, Illyoung [1 ]
Cooksey, Amanda M. [1 ]
Cosi, Michele [1 ]
Cowen, Cindy [1 ]
Culshaw-Maurer, Michael [1 ,4 ]
Davey, Robert [4 ,5 ]
Davey, Sean [1 ]
Devisetty, Upendra [1 ,6 ]
Edgin, Tony [1 ]
Edmonds, Andy [1 ]
Fedorov, Dmitry [7 ]
Frady, Jeremy [1 ]
Fonner, John [8 ]
Gillan, Jeffrey K. [1 ]
Hossain, Iqbal [1 ]
Joyce, Blake [1 ]
Lang, Konrad [9 ]
Lee, Tina [1 ]
Littin, Shelley [1 ]
Mcewen, Ian [1 ]
Merchant, Nirav [1 ]
Micklos, David [10 ]
Nelson, Andrew [11 ]
Ramsey, Ashley [1 ]
Roberts, Sarah [1 ]
Sarando, Paul [1 ]
Skidmore, Edwin [1 ]
Song, Jawon [8 ]
Sprinkle, Mary Margaret [1 ]
Srinivasan, Sriram [1 ]
Stanzione, Dan [8 ]
Strootman, Jonathan D. [1 ]
Stryeck, Sarah [3 ,9 ]
Tuteja, Reetu [1 ,6 ]
Vaughn, Matthew [8 ]
Wali, Mojib [3 ]
Wall, Mariah [1 ]
Walls, Ramona [1 ,12 ]
Wang, Liya [10 ]
Wickizer, Todd [1 ]
Williams, Jason [10 ]
Wregglesworth, John [1 ]
Lyons, Eric [1 ]
机构
[1] Univ Arizona, Tucson, AZ 85721 USA
[2] Pivot Bio, Berkeley, CA USA
[3] Graz Univ Technol, Graz, Austria
[4] The Carpentries, Oakland, CA USA
[5] Earlham Inst, Norwich, England
[6] Greenlight Biosci, Durham, NC USA
[7] Viqi Inc, SantaBarbara, CA USA
[8] Texas Adv Comp Ctr, Austin, TX USA
[9] Know Ctr GmbH, Graz, Austria
[10] Cold Spring Harbor Lab, DNA Learning Ctr, Long Isl City, NY USA
[11] Boyce Thompson Inst Plant Res, Ithaca, NY USA
[12] Crit Path Inst, Tucson, AZ USA
基金
英国生物技术与生命科学研究理事会; 美国国家科学基金会;
关键词
DATA-INTENSIVE SCIENCE; CLOUD;
D O I
10.1371/journal.pcbi.1011270
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
CyVerse, the largest publicly-funded open-source research cyberinfrastructure for life sciences, has played a crucial role in advancing data-driven research since the 2010s. As the technology landscape evolved with the emergence of cloud computing platforms, machine learning and artificial intelligence (AI) applications, CyVerse has enabled access by providing interfaces, Software as a Service (SaaS), and cloud-native Infrastructure as Code (IaC) to leverage new technologies. CyVerse services enable researchers to integrate institutional and private computational resources, custom software, perform analyses, and publish data in accordance with open science principles. Over the past 13 years, CyVerse has registered more than 124,000 verified accounts from 160 countries and was used for over 1,600 peer-reviewed publications. Since 2011, 45,000 students and researchers have been trained to use CyVerse. The platform has been replicated and deployed in three countries outside the US, with additional private deployments on commercial clouds for US government agencies and multinational corporations. In this manuscript, we present a strategic blueprint for creating and managing SaaS cyberinfrastructure and IaC as free and open-source software.
引用
收藏
页数:16
相关论文
共 89 条
[1]   Cloud-Native Repositories for Big Scientific Data [J].
Abernathey, Ryan P. ;
Blackmon-Luca, Charles C. ;
Crone, Timothy J. ;
Henderson, Naomi ;
Lepore, Chiara ;
Augspurger, Tom ;
Banihirwe, Anderson ;
Gentemann, Chelle L. ;
Hamman, Joseph J. ;
Henderson, Naomi ;
Lepore, Chiara ;
McCaie, Theo A. ;
Robinson, Niall H. ;
Signell, Richard P. .
COMPUTING IN SCIENCE & ENGINEERING, 2021, 23 (02) :26-35
[2]  
[Anonymous], 2003, NIH Data Sharing Policy and Implementation Guidance
[3]  
[Anonymous], 2020, Austrian DataLAB and Services-Cluster Forschungsdaten
[4]  
[Anonymous], 2015, RStudio: integrated development for R
[5]  
Ansible R.H., Ansible is simple it automation
[6]  
Atkins Daniel E., 2003, Report on the National Science Foundation Blue-Ribbon Advisory Panel on Cyberinfrastructure
[7]   Unmet needs for analyzing biological big data: A survey of 704 NSF principal investigators [J].
Barone, Lindsay ;
Williams, Jason ;
Micklos, David .
PLOS COMPUTATIONAL BIOLOGY, 2017, 13 (10)
[8]   The Internet2 distributed storage infrastructure project: an architecture for Internet content channels [J].
Beck, M ;
Moore, T .
COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (22-23) :2141-2148
[9]  
Belhajjame K, 2012, SePublica@ ESWC, P1
[10]   Containers and Cloud: From LXC to Docker to Kubernetes [J].
Bernstein, David .
IEEE CLOUD COMPUTING, 2014, 1 (03) :81-84