Accurate, Rapid Taxonomic Classification of Fungal Large-Subunit rRNA Genes

被引:144
作者
Liu, Kuan-Liang [1 ,2 ]
Porras-Alfaro, Andrea [3 ,4 ]
Kuske, Cheryl R. [1 ]
Eichorst, Stephanie A. [1 ]
Xie, Gary [1 ]
机构
[1] Biosci Div, Los Alamos Natl Lab, Los Alamos, NM USA
[2] Natl Cheng Kung Univ, Inst Informat Management, Tainan, Taiwan
[3] Western Illinois Univ, Dept Biol Sci, Macomb, IL 61455 USA
[4] Univ New Mexico, Dept Biol, Albuquerque, NM 87131 USA
基金
美国国家科学基金会;
关键词
KINGDOM FUNGI; SEQUENCE-DATA; COMMUNITIES; PHYLOGENY; DIVERSITY; NETWORKS; IMPACT;
D O I
10.1128/AEM.06826-11
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Taxonomic and phylogenetic fingerprinting based on sequence analysis of gene fragments from the large-subunit rRNA (LSU) gene or the internal transcribed spacer (ITS) region is becoming an integral part of fungal classification. The lack of an accurate and robust classification tool trained by a validated sequence database for taxonomic placement of fungal LSU genes is a severe limitation in taxonomic analysis of fungal isolates or large data sets obtained from environmental surveys. Using a hand-curated set of 8,506 fungal LSU gene fragments, we determined the performance characteristics of a naive Bayesian classifier across multiple taxonomic levels and compared the classifier performance to that of a sequence similarity-based (BLASTN) approach. The naive Bayesian classifier was computationally more rapid (>460-fold with our system) than the BLASTN approach, and it provided equal or superior classification accuracy. Classifier accuracies were compared using sequence fragments of 100 bp and 400 bp and two different PCR primer anchor points to mimic sequence read lengths commonly obtained using current high-throughput sequencing technologies. Accuracy was higher with 400-bp sequence reads than with 100-bp reads. It was also significantly affected by sequence location across the 1,400-bp test region. The highest accuracy was obtained across either the D1 or D2 variable region. The naive Bayesian classifier provides an effective and rapid means to classify fungal LSU sequences from large environmental surveys. The training set and tool are publicly available through the Ribosomal Database Project (http://rdp.cme.msu.edu/classifiericlassifier.jsp).
引用
收藏
页码:1523 / 1533
页数:11
相关论文
共 27 条
[1]   A Phylogenetic Estimation of Trophic Transition Networks for Ascomycetous Fungi: Are Lichens Cradles of Symbiotrophic Fungal Diversification? [J].
Arnold, A. Elizabeth ;
Miadlikowska, Jolanta ;
Higgins, K. Lindsay ;
Sarvate, Snehal D. ;
Gugger, Paul ;
Way, Amanda ;
Hofstetter, Valerie ;
Kauff, Frank ;
Lutzoni, Francois .
SYSTEMATIC BIOLOGY, 2009, 58 (03) :283-297
[2]   Research coordination networks: a phylogeny for kingdom Fungi (Deep Hypha) [J].
Blackwell, Meredith ;
Hibbett, David S. ;
Taylor, John W. ;
Spatafora, Joseph W. .
MYCOLOGIA, 2006, 98 (06) :829-837
[3]   THE FUNGI: 1, 2, 3 ... 5.1 MILLION SPECIES? [J].
Blackwell, Meredith .
AMERICAN JOURNAL OF BOTANY, 2011, 98 (03) :426-438
[4]   454 Pyrosequencing analyses of forest soils reveal an unexpectedly high fungal diversity [J].
Buee, M. ;
Reich, M. ;
Murat, C. ;
Morin, E. ;
Nilsson, R. H. ;
Uroz, S. ;
Martin, F. .
NEW PHYTOLOGIST, 2009, 184 (02) :449-456
[5]   BLAST plus : architecture and applications [J].
Camacho, Christiam ;
Coulouris, George ;
Avagyan, Vahram ;
Ma, Ning ;
Papadopoulos, Jason ;
Bealer, Kevin ;
Madden, Thomas L. .
BMC BIOINFORMATICS, 2009, 10
[6]   Comparative Analysis of Pyrosequencing and a Phylogenetic Microarray for Exploring Microbial Community Structures in the Human Distal Intestine [J].
Claesson, Marcus J. ;
O'Sullivan, Orla ;
Wang, Qiong ;
Nikkilae, Janne ;
Marchesi, Julian R. ;
Smidt, Hauke ;
de Vos, Willem M. ;
Ross, R. Paul ;
O'Toole, Paul W. .
PLOS ONE, 2009, 4 (08)
[7]   The Ribosomal Database Project: improved alignments and new tools for rRNA analysis [J].
Cole, J. R. ;
Wang, Q. ;
Cardenas, E. ;
Fish, J. ;
Chai, B. ;
Farris, R. J. ;
Kulam-Syed-Mohideen, A. S. ;
McGarrell, D. M. ;
Marsh, T. ;
Garrity, G. M. ;
Tiedje, J. M. .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D141-D145
[8]   Substantial biases in ultra-short read data sets from high-throughput DNA sequencing [J].
Dohm, Juliane C. ;
Lottaz, Claudio ;
Borodina, Tatiana ;
Himmelbauer, Heinz .
NUCLEIC ACIDS RESEARCH, 2008, 36 (16)
[9]  
GUADET J, 1989, MOL BIOL EVOL, V6, P227
[10]  
Hibbett David S., 2011, Fungal Biology Reviews, V25, P38, DOI 10.1016/j.fbr.2011.01.001