Utilization of defined microbial communities enables effective evaluation of meta-genomic assemblies

被引:18
作者
Greenwald, William W. [1 ]
Klitgord, Niels [2 ]
Seguritan, Victor [2 ]
Yooseph, Shibu [4 ]
Venter, J. Craig [2 ,3 ]
Garner, Chad [2 ]
Nelson, Karen E. [2 ,3 ]
Li, Weizhong [2 ,3 ]
机构
[1] Univ Calif San Diego, Bioinformat & Syst Biol, La Jolla, CA 92093 USA
[2] Human Longevity Inc, San Diego, CA 92121 USA
[3] J Craig Venter Inst, La Jolla, CA 92037 USA
[4] Univ Cent Florida, Dept Comp Sci, Orlando, FL 32816 USA
来源
BMC GENOMICS | 2017年 / 18卷
关键词
DE-NOVO ASSEMBLER; GUT; SEQUENCES; PROJECT; DISEASE; READS;
D O I
10.1186/s12864-017-3679-5
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Metagenomics is the study of the microbial genomes isolated from communities found on our bodies or in our environment. By correctly determining the relation between human health and the human associated microbial communities, novel mechanisms of health and disease can be found, thus enabling the development of novel diagnostics and therapeutics. Due to the diversity of the microbial communities, strategies developed for aligning human genomes cannot be utilized, and genomes of the microbial species in the community must be assembled de novo. However, in order to obtain the best metagenomic assemblies, it is important to choose the proper assembler. Due to the rapidly evolving nature of metagenomics, new assemblers are constantly created, and the field has not yet agreed on a standardized process. Furthermore, the truth sets used to compare these methods are either too simple (computationally derived diverse communities) or complex (microbial communities of unknown composition), yielding results that are hard to interpret. In this analysis, we interrogate the strengths and weaknesses of five popular assemblers through the use of defined biological samples of known genomic composition and abundance. We assessed the performance of each assembler on their ability to reassemble genomes, call taxonomic abundances, and recreate open reading frames (ORFs). Results: We tested five metagenomic assemblers: Omega, metaSPAdes, IDBA-UD, metaVelvet and MEGAHIT on known and synthetic metagenomic data sets. MetaSPAdes excelled in diverse sets, IDBA-UD performed well all around, metaVelvet had high accuracy in high abundance organisms, and MEGAHIT was able to accurately differentiate similar organisms within a community. At the ORF level, metaSPAdes and MEGAHIT had the least number of missing ORFs within diverse and similar communities respectively. Conclusions: Depending on the metagenomics question asked, the correct assembler for the task at hand will differ. It is important to choose the appropriate assembler, and thus clearly define the biological problem of an experiment, as different assemblers will give different answers to the same question.
引用
收藏
页数:11
相关论文
共 25 条
[1]  
[Anonymous], ARXIV160403071
[2]   Ray Meta: scalable de novo metagenome assembly and profiling [J].
Boisvert, Sebastien ;
Raymond, Frederic ;
Godzaridis, Elenie ;
Laviolette, Francois ;
Corbeil, Jacques .
GENOME BIOLOGY, 2012, 13 (12)
[3]   Trimmomatic: a flexible trimmer for Illumina sequence data [J].
Bolger, Anthony M. ;
Lohse, Marc ;
Usadel, Bjoern .
BIOINFORMATICS, 2014, 30 (15) :2114-2120
[4]  
Cho I, 2012, HUMAN MICROBIOME INT
[5]   Gut-liver axis: The impact of gut microbiota on non alcoholic fatty liver disease [J].
Compare, D. ;
Coccoli, P. ;
Rocco, A. ;
Nardone, O. M. ;
De Maria, S. ;
Carteni, M. ;
Nardone, G. .
NUTRITION METABOLISM AND CARDIOVASCULAR DISEASES, 2012, 22 (06) :471-476
[6]   Molecular-phylogenetic characterization of microbial community imbalances in human inflammatory bowel diseases [J].
Frank, Daniel N. ;
Amand, Allison L. St. ;
Feldman, Robert A. ;
Boedeker, Edgar C. ;
Harpaz, Noam ;
Pace, Norman R. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (34) :13780-13785
[7]   Metagenomic analysis of the human distal gut microbiome [J].
Gill, Steven R. ;
Pop, Mihai ;
DeBoy, Robert T. ;
Eckburg, Paul B. ;
Turnbaugh, Peter J. ;
Samuel, Buck S. ;
Gordon, Jeffrey I. ;
Relman, David A. ;
Fraser-Liggett, Claire M. ;
Nelson, Karen E. .
SCIENCE, 2006, 312 (5778) :1355-1359
[8]   Omega: an Overlap-graph de novo Assembler for Metagenomics [J].
Haider, Bahlul ;
Ahn, Tae-Hyuk ;
Bushnell, Brian ;
Chai, Juanjuan ;
Copeland, Alex ;
Pan, Chongle .
BIOINFORMATICS, 2014, 30 (19) :2717-2722
[9]   Microbiota Modulate Behavioral and Physiological Abnormalities Associated with Neurodevelopmental Disorders [J].
Hsiao, Elaine Y. ;
McBride, Sara W. ;
Hsien, Sophia ;
Sharon, Gil ;
Hyde, Embriette R. ;
McCue, Tyler ;
Codelli, Julian A. ;
Chow, Janet ;
Reisman, Sarah E. ;
Petrosino, Joseph F. ;
Patterson, Paul H. ;
Mazmanian, Sarkis K. .
CELL, 2013, 155 (07) :1451-1463
[10]   Library preparation methodology can influence genomic and functional predictions in human microbiome research [J].
Jones, Marcus B. ;
Highlander, Sarah K. ;
Anderson, Ericka L. ;
Li, Weizhong ;
Dayrit, Mark ;
Klitgord, Niels ;
Fabani, Martin M. ;
Seguritan, Victor ;
Green, Jessica ;
Pride, David T. ;
Yooseph, Shibu ;
Biggs, William ;
Nelson, Karen E. ;
Venter, J. Craig .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (45) :14024-14029