CloudMan as a platform for tool, data, and analysis distribution

被引:30
作者
Afgan, Enis [1 ,3 ,4 ]
Chapman, Brad [2 ]
Taylor, James [3 ,4 ]
机构
[1] Rudjer Boskovic Inst, Ctr Informat & Comp CIR, Zagreb, Croatia
[2] Harvard Univ, Sch Publ Hlth, Bioinformat Core, Boston, MA 02115 USA
[3] Emory Univ, Dept Biol, Atlanta, GA 30322 USA
[4] Emory Univ, Dept Math & Comp Sci, Atlanta, GA 30322 USA
关键词
Cloud computing; Service customization; Reproducibility; Accessibility; Galaxy; GALAXY;
D O I
10.1186/1471-2105-13-315
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Cloud computing provides an infrastructure that facilitates large scale computational analysis in a scalable, democratized fashion, However, in this context it is difficult to ensure sharing of an analysis environment and associated data in a scalable and precisely reproducible way. Results: CloudMan (usecloudman.org) enables individual researchers to easily deploy, customize, and share their entire cloud analysis environment, including data, tools, and configurations. Conclusions: With the enabled customization and sharing of instances, CloudMan can be used as a platform for collaboration. The presented solution improves accessibility of cloud resources, tools, and data to the level of an individual researcher and contributes toward reproducibility and transparency of research solutions.
引用
收藏
页数:7
相关论文
共 9 条
[1]   A reference model for deploying applications in virtualized environments [J].
Afgan, Enis ;
Baker, Dannon ;
Nekrutenko, Anton ;
Taylor, James .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2012, 24 (12) :1349-1361
[2]   Harnessing cloud computing with Galaxy Cloud [J].
Afgan, Enis ;
Baker, Dannon ;
Coraor, Nate ;
Goto, Hiroki ;
Paul, Ian M. ;
Makova, Kateryna D. ;
Nekrutenko, Anton ;
Taylor, James .
NATURE BIOTECHNOLOGY, 2011, 29 (11) :972-974
[3]  
Afgan E, 2011, COMPUT COMMUN NETW S, P145, DOI 10.1007/978-0-85729-439-5_6
[4]   Galaxy CloudMan: delivering cloud compute clusters [J].
Afgan, Enis ;
Baker, Dannon ;
Coraor, Nate ;
Chapman, Brad ;
Nekrutenko, Anton ;
Taylor, James .
BMC BIOINFORMATICS, 2010, 11
[5]   A framework for variation discovery and genotyping using next-generation DNA sequencing data [J].
DePristo, Mark A. ;
Banks, Eric ;
Poplin, Ryan ;
Garimella, Kiran V. ;
Maguire, Jared R. ;
Hartl, Christopher ;
Philippakis, Anthony A. ;
del Angel, Guillermo ;
Rivas, Manuel A. ;
Hanna, Matt ;
McKenna, Aaron ;
Fennell, Tim J. ;
Kernytsky, Andrew M. ;
Sivachenko, Andrey Y. ;
Cibulskis, Kristian ;
Gabriel, Stacey B. ;
Altshuler, David ;
Daly, Mark J. .
NATURE GENETICS, 2011, 43 (05) :491-+
[6]   Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences [J].
Goecks, Jeremy ;
Nekrutenko, Anton ;
Taylor, James .
GENOME BIOLOGY, 2010, 11 (08)
[7]   Searching for SNPs with cloud computing [J].
Langmead, Ben ;
Schatz, Michael C. ;
Lin, Jimmy ;
Pop, Mihai ;
Salzberg, Steven L. .
GENOME BIOLOGY, 2009, 10 (11)
[8]   Cloud computing and the DNA data race [J].
Schatz, Michael C. ;
Langmead, Ben ;
Salzberg, Steven L. .
NATURE BIOTECHNOLOGY, 2010, 28 (07) :691-693
[9]   CloudBurst: highly sensitive read mapping with MapReduce [J].
Schatz, Michael C. .
BIOINFORMATICS, 2009, 25 (11) :1363-1369