Improved taxonomic assignment of human intestinal 16S rRNA sequences by a dedicated reference database

被引:110
作者
Ritari, Jarmo [1 ]
Salojaervi, Jarkko [1 ]
Lahti, Leo [1 ,2 ]
de Vos, Willem M. [1 ,2 ,3 ]
机构
[1] Univ Helsinki, Dept Vet Biosci, Helsinki, Finland
[2] Wageningen Univ, Lab Microbiol, NL-6700 AP Wageningen, Netherlands
[3] Univ Helsinki, Dept Bacteriol & Immunol, Helsinki, Finland
基金
欧洲研究理事会; 芬兰科学院;
关键词
Next-generation sequencing; 16S; Ribosomal RNA; Human intestinal microbiota; Bacteria; Archaea; Taxonomy; GUT MICROBIOME; GENE DATABASE; DIVERSITY; CHILDREN; IDENTIFICATION; ACCURATE; CATALOG; TOOLS; SILVA;
D O I
10.1186/s12864-015-2265-y
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Current sequencing technology enables taxonomic profiling of microbial ecosystems at high resolution and depth by using the 16S rRNA gene as a phylogenetic marker. Taxonomic assignation of newly acquired data is based on sequence comparisons with comprehensive reference databases to find consensus taxonomy for representative sequences. Nevertheless, even with well-characterised ecosystems like the human intestinal microbiota it is challenging to assign genus and species level taxonomy to 16S rRNA amplicon reads. A part of the explanation may lie in the sheer size of the search space where competition from a multitude of highly similar sequences may not allow reliable assignation at low taxonomic levels. However, when studying a particular environment such as the human intestine, it can be argued that a reference database comprising only sequences that are native to the environment would be sufficient, effectively reducing the search space. Results: We constructed a 16S rRNA gene database based on high-quality sequences specific for human intestinal microbiota, resulting in curated data set consisting of 2473 unique prokaryotic species-like groups and their taxonomic lineages, and compared its performance against the Greengenes and Silva databases. The results showed that regardless of used assignment algorithm, our database improved taxonomic assignation of 16S rRNA sequencing data by enabling significantly higher species and genus level assignation rate while preserving taxonomic diversity and demanding less computational resources. Conclusion: The curated human intestinal 16S rRNA gene taxonomic database of about 2500 species-like groups described here provides a practical solution for significantly improved taxonomic assignment for phylogenetic studies of the human intestinal microbiota.
引用
收藏
页数:10
相关论文
共 52 条
[1]   PHYLOGENETIC IDENTIFICATION AND IN-SITU DETECTION OF INDIVIDUAL MICROBIAL-CELLS WITHOUT CULTIVATION [J].
AMANN, RI ;
LUDWIG, W ;
SCHLEIFER, KH .
MICROBIOLOGICAL REVIEWS, 1995, 59 (01) :143-169
[2]   The Use of Coded PCR Primers Enables High-Throughput Sequencing of Multiple Homolog Amplification Products by 454 Parallel Sequencing [J].
Binladen, Jonas ;
Gilbert, M. Thomas P. ;
Bollback, Jonathan P. ;
Panitz, Frank ;
Bendixen, Christian ;
Nielsen, Rasmus ;
Willerslev, Eske .
PLOS ONE, 2007, 2 (02)
[3]   Genome Sequence of "Candidatus Methanomassiliicoccus intestinalis" Issoire-Mx1, a Third Thermoplasmatales-Related Methanogenic Archaeon from Human Feces [J].
Borrel, Guillaume ;
Harris, Hugh M. B. ;
Parisot, Nicolas ;
Gaci, Nadia ;
Tottey, William ;
Mihajlovski, Agnes ;
Deane, Jennifer ;
Gribaldo, Simonetta ;
Bardot, Olivier ;
Peyretaillade, Eric ;
Peyret, Pierre ;
O'Toole, Paul W. ;
Brugere, Jean-Francois .
GENOME ANNOUNCEMENTS, 2013, 1 (04)
[4]   Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms [J].
Caporaso, J. Gregory ;
Lauber, Christian L. ;
Walters, William A. ;
Berg-Lyons, Donna ;
Huntley, James ;
Fierer, Noah ;
Owens, Sarah M. ;
Betley, Jason ;
Fraser, Louise ;
Bauer, Markus ;
Gormley, Niall ;
Gilbert, Jack A. ;
Smith, Geoff ;
Knight, Rob .
ISME JOURNAL, 2012, 6 (08) :1621-1624
[5]   QIIME allows analysis of high-throughput community sequencing data [J].
Caporaso, J. Gregory ;
Kuczynski, Justin ;
Stombaugh, Jesse ;
Bittinger, Kyle ;
Bushman, Frederic D. ;
Costello, Elizabeth K. ;
Fierer, Noah ;
Pena, Antonio Gonzalez ;
Goodrich, Julia K. ;
Gordon, Jeffrey I. ;
Huttley, Gavin A. ;
Kelley, Scott T. ;
Knights, Dan ;
Koenig, Jeremy E. ;
Ley, Ruth E. ;
Lozupone, Catherine A. ;
McDonald, Daniel ;
Muegge, Brian D. ;
Pirrung, Meg ;
Reeder, Jens ;
Sevinsky, Joel R. ;
Tumbaugh, Peter J. ;
Walters, William A. ;
Widmann, Jeremy ;
Yatsunenko, Tanya ;
Zaneveld, Jesse ;
Knight, Rob .
NATURE METHODS, 2010, 7 (05) :335-336
[6]   Contribution of the Intestinal Microbiota to Human Health: From Birth to 100 Years of Age [J].
Cheng, Jing ;
Palva, Airi M. ;
de Vos, Willem M. ;
Satokari, Reetta .
BETWEEN PATHOGENICITY AND COMMENSALISM, 2013, 358 :323-346
[7]   Ribosomal Database Project: data and tools for high throughput rRNA analysis [J].
Cole, James R. ;
Wang, Qiong ;
Fish, Jordan A. ;
Chai, Benli ;
McGarrell, Donna M. ;
Sun, Yanni ;
Brown, C. Titus ;
Porras-Alfaro, Andrea ;
Kuske, Cheryl R. ;
Tiedje, James M. .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D633-D642
[8]   Quality-Score Refinement of SSU rRNA Gene Pyrosequencing Differs Across Gene Region for Environmental Samples [J].
De Leon, Kara Bowen ;
Ramsay, Bradley D. ;
Fields, Matthew W. .
MICROBIAL ECOLOGY, 2012, 64 (02) :499-508
[9]   Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB [J].
DeSantis, T. Z. ;
Hugenholtz, P. ;
Larsen, N. ;
Rojas, M. ;
Brodie, E. L. ;
Keller, K. ;
Huber, T. ;
Dalevi, D. ;
Hu, P. ;
Andersen, G. L. .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2006, 72 (07) :5069-5072
[10]   The human gut and groundwater harbor non-photosynthetic bacteria belonging to a new candidate phylum sibling to Cyanobacteria [J].
Di Rienzi, Sara C. ;
Sharon, Itai ;
Wrighton, Kelly C. ;
Koren, Omry ;
Hug, Laura A. ;
Thomas, Brian C. ;
Goodrich, Julia K. ;
Bell, Jordana T. ;
Spector, Timothy D. ;
Banfield, Jillian F. ;
Ley, Ruth E. .
ELIFE, 2013, 2