Integrating taxonomic, functional, and strain-level profiling of diverse microbial communities with bioBakery 3

被引:1208
作者
Beghini, Francesco [1 ]
McIver, Lauren J. [2 ]
Blanco-Miguez, Aitor [1 ]
Dubois, Leonard [1 ]
Asnicar, Francesco [1 ]
Maharjan, Sagun [2 ,3 ]
Mailyan, Ana [2 ,3 ]
Manghi, Paolo [1 ]
Scholz, Matthias [4 ]
Thomas, Andrew Maltez [1 ]
Valles-Colomer, Mireia [1 ]
Weingart, George [2 ,3 ]
Zhang, Yancong [2 ,3 ]
Zolfo, Moreno [1 ]
Huttenhower, Curtis [2 ,3 ]
Franzosa, Eric A. [2 ,3 ]
Segata, Nicola [1 ,5 ]
机构
[1] Univ Trento, Dept CIBIO, Trento, Italy
[2] Harvard TH Chan Sch Publ Hlth, Boston, MA 02115 USA
[3] Broad Inst MIT & Harvard, Cambridge, MA 02142 USA
[4] Edmund Mach Fdn, Res & Innovat Ctr, Dept Food Qual & Nutr, San Michele All Adige, Italy
[5] European Inst Oncol IRCCS, IEO, Milan, Italy
基金
美国国家卫生研究院; 欧洲研究理事会; 欧盟地平线“2020”;
关键词
BACTERIAL TRANSMISSION; METAGENOMICS; ALIGNMENT; GENOMES; METABOLISM; BENCHMARKING; DEGRADATION; INFERENCE; PATTERNS; REVEALS;
D O I
10.7554/eLife.65088
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Culture-independent analyses of microbial communities have progressed dramatically in the last decade, particularly due to advances in methods for biological profiling via shotgun metagenomics. Opportunities for improvement continue to accelerate, with greater access to multi-omics, microbial reference genomes, and strain-level diversity. To leverage these, we present bioBakery 3, a set of integrated, improved methods for taxonomic, strain-level, functional, and phylogenetic profiling of metagenomes newly developed to build on the largest set of reference sequences now available. Compared to current alternatives, MetaPhlAn 3 increases the accuracy of taxonomic profiling, and HUMAnN 3 improves that of functional potential and activity. These methods detected novel disease-microbiome links in applications to CRC (1262 metagenomes) and IBD (1635 metagenomes and 817 metatranscriptomes). Strain-level profiling of an additional 4077 metagenomes with StrainPhlAn 3 and PanPhlAn 3 unraveled the phylogenetic and functional structure of the common gut microbe Ruminococcus bromii, previously described by only 15 isolate genomes. With open-source implementations and cloud-deployable reproducible workflows, the bioBakery 3 platform can help researchers deepen the resolution, scale, and accuracy of multi-omic profiling for microbial community studies.
引用
收藏
页数:42
相关论文
共 131 条
[11]   Large-scale comparative metagenomics of Blastocystis, a common member of the human gut microbiome [J].
Beghini, Francesco ;
Pasolli, Edoardo ;
Tin Duy Truong ;
Putignani, Lorenza ;
Caccio, Simone M. ;
Segata, Nicola .
ISME JOURNAL, 2017, 11 (12) :2848-2863
[12]   Bioboxes: standardised containers for interchangeable bioinformatics software [J].
Belmann, Peter ;
Droege, Johannes ;
Bremges, Andreas ;
McHardy, Alice C. ;
Sczyrba, Alexander ;
Barton, Michael D. .
GIGASCIENCE, 2015, 4
[13]   Tandem repeats finder: a program to analyze DNA sequences [J].
Benson, G .
NUCLEIC ACIDS RESEARCH, 1999, 27 (02) :573-580
[14]   Toward a Predictive Understanding of Earth's Microbiomes to Address 21st Century Challenges [J].
Blaser, Martin J. ;
Cardon, Zoe G. ;
Cho, Mildred K. ;
Dangl, Jeffrey L. ;
Donohue, Timothy J. ;
Green, Jessica L. ;
Knight, Rob ;
Maxon, Mary E. ;
Northen, Trent R. ;
Pollard, Katherine S. ;
Brodie, Eoin L. .
MBIO, 2016, 7 (03)
[15]   Trimmomatic: a flexible trimmer for Illumina sequence data [J].
Bolger, Anthony M. ;
Lohse, Marc ;
Usadel, Bjoern .
BIOINFORMATICS, 2014, 30 (15) :2114-2120
[16]   Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2 [J].
Bolyen, Evan ;
Rideout, Jai Ram ;
Dillon, Matthew R. ;
Bokulich, NicholasA. ;
Abnet, Christian C. ;
Al-Ghalith, Gabriel A. ;
Alexander, Harriet ;
Alm, Eric J. ;
Arumugam, Manimozhiyan ;
Asnicar, Francesco ;
Bai, Yang ;
Bisanz, Jordan E. ;
Bittinger, Kyle ;
Brejnrod, Asker ;
Brislawn, Colin J. ;
Brown, C. Titus ;
Callahan, Benjamin J. ;
Caraballo-Rodriguez, Andres Mauricio ;
Chase, John ;
Cope, Emily K. ;
Da Silva, Ricardo ;
Diener, Christian ;
Dorrestein, Pieter C. ;
Douglas, Gavin M. ;
Durall, Daniel M. ;
Duvallet, Claire ;
Edwardson, Christian F. ;
Ernst, Madeleine ;
Estaki, Mehrbod ;
Fouquier, Jennifer ;
Gauglitz, Julia M. ;
Gibbons, Sean M. ;
Gibson, Deanna L. ;
Gonzalez, Antonio ;
Gorlick, Kestrel ;
Guo, Jiarong ;
Hillmann, Benjamin ;
Holmes, Susan ;
Holste, Hannes ;
Huttenhower, Curtis ;
Huttley, Gavin A. ;
Janssen, Stefan ;
Jarmusch, Alan K. ;
Jiang, Lingjing ;
Kaehler, Benjamin D. ;
Bin Kang, Kyo ;
Keefe, Christopher R. ;
Keim, Paul ;
Kelley, Scott T. ;
Knights, Dan .
NATURE BIOTECHNOLOGY, 2019, 37 (08) :852-857
[17]   Human contamination in bacterial genomes has created thousands of spurious proteins [J].
Breitwieser, Florian P. ;
Pertea, Mihaela ;
Zimin, Aleksey V. ;
Salzberg, Steven L. .
GENOME RESEARCH, 2019, 29 (06) :954-960
[18]   Fast and sensitive protein alignment using DIAMOND [J].
Buchfink, Benjamin ;
Xie, Chao ;
Huson, Daniel H. .
NATURE METHODS, 2015, 12 (01) :59-60
[19]  
Callahan BJ, 2016, NAT METHODS, V13, P581, DOI [10.1038/nmeth.3869, 10.1038/NMETH.3869]
[20]  
Chaumeil P.-A., 2019, BIOINFORMATICS