Correcting for 16S rRNA gene copy numbers in microbiome surveys remains an unsolved problem

被引:226
作者
Louca, Stilianos [1 ,2 ]
Doebeli, Michael [1 ,2 ,3 ]
Parfrey, Laura Wegener [1 ,2 ,4 ]
机构
[1] Univ British Columbia, Biodivers Res Ctr, Vancouver, BC, Canada
[2] Univ British Columbia, Dept Zool, Vancouver, BC, Canada
[3] Univ British Columbia, Dept Math, Vancouver, BC, Canada
[4] Univ British Columbia, Dept Bot, Vancouver, BC, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
16S rRNA; Gene copy number; Microbiome surveys; Phylogenetic reconstruction; COMMUNITY; PROFILES; TAXONOMY; TRAITS; TREES;
D O I
10.1186/s40168-018-0420-9
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
The 16S ribosomal RNA gene is the most widely used marker gene in microbial ecology. Counts of 16S sequence variants, often in PCR amplicons, are used to estimate proportions of bacterial and archaeal taxa in microbial communities. Because different organisms contain different 16S gene copy numbers (GCNs), sequence variant counts are biased towards clades with greater GCNs. Several tools have recently been developed for predicting GCNs using phylogenetic methods and based on sequenced genomes, in order to correct for these biases. However, the accuracy of those predictions has not been independently assessed. Here, we systematically evaluate the predictability of 16S GCNs across bacterial and archaeal clades, based on similar to 6,800 public sequenced genomes and using several phylogenetic methods. Further, we assess the accuracy of GCNs predicted by three recently published tools (PICRUSt, CopyRighter, and PAPRICA) over a wide range of taxa and for 635 microbial communities from varied environments. We find that regardless of the phylogenetic method tested, 16S GCNs could only be accurately predicted for a limited fraction of taxa, namely taxa with closely to moderately related representatives (less than or similar to 15% divergence in the 16S rRNA gene). Consistent with this observation, we find that all considered tools exhibit low predictive accuracy when evaluated against completely sequenced genomes, in some cases explaining less than 10% of the variance. Substantial disagreement was also observed between tools (R-2 < 0.5) for the majority of tested microbial communities. The nearest sequenced taxon index (NSTI) of microbial communities, i.e., the average distance to a sequenced genome, was a strong predictor for the agreement between GCN prediction tools on non-animal-associated samples, but only a moderate predictor for animal-associated samples. We recommend against correcting for 16S GCNs in microbiome surveys by default, unless OTUs are sufficiently closely related to sequenced genomes or unless a need for true OTU proportions warrants the additional noise introduced, so that community profiles remain interpretable and comparable between studies.
引用
收藏
页数:12
相关论文
共 37 条
[1]   CopyRighter: a rapid tool for improving the accuracy of microbial community profiles through lineage-specific gene copy number correction [J].
Angly, Florent E. ;
Dennis, Paul G. ;
Skarshewski, Adam ;
Vanwonterghem, Inka ;
Hugenholtz, Philip ;
Tyson, Gene W. .
MICROBIOME, 2014, 2
[2]  
[Anonymous], 1994, Accuracy trueness and precision of measurement methods and results
[3]   Testing for phylogenetic signal in comparative data: Behavioral traits are more labile [J].
Blomberg, SP ;
Garland, T ;
Ives, AR .
EVOLUTION, 2003, 57 (04) :717-745
[4]   Microbial Communities Can Be Described by Metabolic Structure: A General Framework and Application to a Seasonally Variable, Depth-Stratified Microbial Community from the Coastal West Antarctic Peninsula [J].
Bowman, Jeff S. ;
Ducklow, Hugh W. .
PLOS ONE, 2015, 10 (08)
[5]  
Edgar R.C., 2017, BIORXIV
[6]  
FELSENSTEIN J, 1985, AM NAT, V125, P1, DOI 10.1086/284325
[7]   The Earth Microbiome project: successes and aspirations [J].
Gilbert, Jack A. ;
Jansson, Janet K. ;
Knight, Rob .
BMC BIOLOGY, 2014, 12
[8]   25 years of serving the community with ribosomal RNA gene reference databases and tools [J].
Gloeckner, Frank Oliver ;
Yilmaz, Pelin ;
Quast, Christian ;
Gerken, Jan ;
Beccati, Alan ;
Ciuprina, Andreea ;
Bruns, Gerrit ;
Yarza, Pablo ;
Peplies, Joerg ;
Westram, Ralf ;
Ludwig, Wolfgang .
JOURNAL OF BIOTECHNOLOGY, 2017, 261 :169-176
[9]   Predicting microbial traits with phylogenies [J].
Goberna, Marta ;
Verdu, Miguel .
ISME JOURNAL, 2016, 10 (04) :959-967
[10]   Rfam: an RNA family database [J].
Griffiths-Jones, S ;
Bateman, A ;
Marshall, M ;
Khanna, A ;
Eddy, SR .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :439-441