Advancing analytical algorithms and pipelines for billions of microbial sequences

被引:33
作者
Gonzalez, Antonio [2 ]
Knight, Rob [1 ,3 ]
机构
[1] Univ Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA
[2] Univ Colorado, Dept Comp Sci, Boulder, CO 80309 USA
[3] Univ Colorado, Howard Hughes Med Inst, Boulder, CO 80309 USA
基金
美国国家卫生研究院;
关键词
COMMUNITIES; MACHINE; UNIFRAC;
D O I
10.1016/j.copbio.2011.11.028
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The vast number of microbial sequences resulting from sequencing efforts using new technologies require us to reassess currently available analysis methodologies and tools. Here we describe trends in the development and distribution of software for analyzing microbial sequence data. We then focus on one widely used set of methods, dimensionality reduction techniques, which allow users to summarize and compare these vast datasets. We conclude by emphasizing the utility of formal software engineering methods for the development of computational biology tools, and the need for new algorithms for comparing microbial communities. Such large-scale comparisons will allow us to fulfill the dream of rapid integration and comparison of microbial sequence data sets, in a replicable analytical environment, in order to describe the microbial world we inhabit.
引用
收藏
页码:64 / 71
页数:8
相关论文
共 43 条
[1]  
Abrahamsson P., 2002, VTT publications, P167
[2]   CloVR: A virtual machine for automated and portable sequence analysis from the desktop using cloud computing [J].
Angiuoli, Samuel V. ;
Matalka, Malcolm ;
Gussman, Aaron ;
Galens, Kevin ;
Vangala, Mahesh ;
Riley, David R. ;
Arze, Cesar ;
White, James R. ;
White, Owen ;
Fricke, W. Florian .
BMC BIOINFORMATICS, 2011, 12
[3]  
[Anonymous], 2007, Dimension reduction
[4]  
[Anonymous], 2010, v7.10.0 (R2010a)
[5]   SmashCommunity: a metagenomic annotation and analysis tool [J].
Arumugam, Manimozhiyan ;
Harrington, Eoghan D. ;
Foerstner, Konrad U. ;
Raes, Jeroen ;
Bork, Peer .
BIOINFORMATICS, 2010, 26 (23) :2977-2978
[6]   Host-bacterial mutualism in the human intestine [J].
Bäckhed, F ;
Ley, RE ;
Sonnenburg, JL ;
Peterson, DA ;
Gordon, JI .
SCIENCE, 2005, 307 (5717) :1915-1920
[7]   Generation of Multimillion-Sequence 16S rRNA Gene Libraries from Complex Microbial Communities by Assembling Paired-End Illumina Reads [J].
Bartram, Andrea K. ;
Lynch, Michael D. J. ;
Stearns, Jennifer C. ;
Moreno-Hagelsieb, Gabriel ;
Neufeld, Josh D. .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2011, 77 (11) :3846-3852
[8]   Moving pictures of the human microbiome [J].
Caporaso, J. Gregory ;
Lauber, Christian L. ;
Costello, Elizabeth K. ;
Berg-Lyons, Donna ;
Gonzalez, Antonio ;
Stombaugh, Jesse ;
Knights, Dan ;
Gajer, Pawel ;
Ravel, Jacques ;
Fierer, Noah ;
Gordon, Jeffrey I. ;
Knight, Rob .
GENOME BIOLOGY, 2011, 12 (05)
[9]   Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample [J].
Caporaso, J. Gregory ;
Lauber, Christian L. ;
Walters, William A. ;
Berg-Lyons, Donna ;
Lozupone, Catherine A. ;
Turnbaugh, Peter J. ;
Fierer, Noah ;
Knight, Rob .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 :4516-4522
[10]   QIIME allows analysis of high-throughput community sequencing data [J].
Caporaso, J. Gregory ;
Kuczynski, Justin ;
Stombaugh, Jesse ;
Bittinger, Kyle ;
Bushman, Frederic D. ;
Costello, Elizabeth K. ;
Fierer, Noah ;
Pena, Antonio Gonzalez ;
Goodrich, Julia K. ;
Gordon, Jeffrey I. ;
Huttley, Gavin A. ;
Kelley, Scott T. ;
Knights, Dan ;
Koenig, Jeremy E. ;
Ley, Ruth E. ;
Lozupone, Catherine A. ;
McDonald, Daniel ;
Muegge, Brian D. ;
Pirrung, Meg ;
Reeder, Jens ;
Sevinsky, Joel R. ;
Tumbaugh, Peter J. ;
Walters, William A. ;
Widmann, Jeremy ;
Yatsunenko, Tanya ;
Zaneveld, Jesse ;
Knight, Rob .
NATURE METHODS, 2010, 7 (05) :335-336