Microbial resolution of whole genome shotgun and 16S amplicon metagenomic sequencing using publicly available NEON data

被引:115
作者
Brumfield, Kyle D. [1 ,2 ]
Huq, Anwar [1 ]
Colwell, Rita R. [1 ,2 ,3 ]
Olds, James L. [4 ]
Leddy, Menu B. [5 ]
机构
[1] Univ Maryland, Maryland Pathogen Res Inst, College Pk, MD 20742 USA
[2] Univ Maryland, Inst Adv Comp Studies, College Pk, MD 20742 USA
[3] CosmosID Inc, Rockville, MD USA
[4] George Mason Univ, Schar Sch, Arlington, VA USA
[5] Essential Environm & Engn Syst, Huntington Beach, CA 92649 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
RIBOSOMAL-RNA SEQUENCES; GENE DATABASE; EXTRAPOLATION; RAREFACTION; COMMUNITIES; DIVERSITY; ECOLOGY; DNA; IDENTIFICATION; EXPRESSION;
D O I
10.1371/journal.pone.0228899
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Microorganisms are ubiquitous in the biosphere, playing a crucial role in both biogeochemistry of the planet and human health. However, identifying these microorganisms and defining their function are challenging. Widely used approaches in comparative metagenomics, 16S amplicon sequencing and whole genome shotgun sequencing (WGS), have provided access to DNA sequencing analysis to identify microorganisms and evaluate diversity and abundance in various environments. However, advances in parallel high-throughput DNA sequencing in the past decade have introduced major hurdles, namely standardization of methods, data storage, reproducible interoperability of results, and data sharing. The National Ecological Observatory Network (NEON), established by the National Science Foundation, enables all researchers to address queries on a regional to continental scale around a variety of environmental challenges and provide high-quality, integrated, and standardized data from field sites across the U.S. As the amount of metagenomic data continues to grow, standardized procedures that allow results across projects to be assessed and compared is becoming increasingly important in the field of metagenomics. We demonstrate the feasibility of using publicly available NEON soil metagenomic sequencing datasets in combination with open access Metagenomics Rapid Annotation using the Subsystem Technology (MG-RAST) server to illustrate advantages of WGS compared to 16S amplicon sequencing. Four WGS and four 16S amplicon sequence datasets, from surface soil samples prepared by NEON investigators, were selected for comparison, using standardized protocols collected at the same locations in Colorado between April-July 2014. The dominant bacterial phyla detected across samples agreed between sequencing methodologies. However, WGS yielded greater microbial resolution, increased accuracy, and allowed identification of more genera of bacteria, archaea, viruses, and eukaryota, and putative functional genes that would have gone undetected using 16S amplicon sequencing. NEON open data will be useful for future studies characterizing and quantifying complex ecological processes associated with changing aquatic and terrestrial ecosystems.
引用
收藏
页数:21
相关论文
共 78 条
[1]   Divergence and redundancy of 16S rRNA sequences in genomes with multiple rrn operons [J].
Acinas, SG ;
Marcelino, LA ;
Klepac-Ceraj, V ;
Polz, MF .
JOURNAL OF BACTERIOLOGY, 2004, 186 (09) :2629-2635
[2]   Back to Basics - The Influence of DNA Extraction and Primer Choice on Phylogenetic Analysis of Activated Sludge Communities [J].
Albertsen, Mads ;
Karst, Soren M. ;
Ziegler, Anja S. ;
Kirkegaard, Rasmus H. ;
Nielsen, Per H. .
PLOS ONE, 2015, 10 (07)
[3]  
[Anonymous], DATA STANDARDS OMICS
[4]  
[Anonymous], NEONS SCI DES STAND
[5]  
[Anonymous], FREQ ASK QUEST
[6]  
[Anonymous], PROT IMPL OP ACC DAT
[7]  
BAIROCH A, 1994, NUCLEIC ACIDS RES, V22, P3578
[8]   Generation of Multimillion-Sequence 16S rRNA Gene Libraries from Complex Microbial Communities by Assembling Paired-End Illumina Reads [J].
Bartram, Andrea K. ;
Lynch, Michael D. J. ;
Stearns, Jennifer C. ;
Moreno-Hagelsieb, Gabriel ;
Neufeld, Josh D. .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2011, 77 (11) :3846-3852
[9]  
Benson DA, 2010, NUCLEIC ACIDS RES, V38, pD46, DOI [10.1093/nar/gkw1070, 10.1093/nar/gkp1024, 10.1093/nar/gkl986, 10.1093/nar/gkg057, 10.1093/nar/gks1195, 10.1093/nar/gkx1094, 10.1093/nar/gkn723, 10.1093/nar/gkq1079, 10.1093/nar/gkr1202]
[10]   Metagenomic signatures of the Peru Margin subseafloor biosphere show a genetically distinct environment [J].
Biddle, Jennifer F. ;
Fitz-Gibbon, Sorel ;
Schuster, Stephan C. ;
Brenchley, Jean E. ;
House, Christopher H. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (30) :10583-10588