bcgTree: automatized phylogenetic tree building from bacterial core genomes

被引:120
作者
Ankenbrand, Markus J. [1 ]
Keller, Alexander [1 ]
机构
[1] Univ Wurzburg, Dept Anim Ecol & Trop Biol, Wurzburg, Germany
关键词
bacteria; phylogeny; genome; phylogenomics; multi-gene; PROTEIN FAMILIES; HIGH-ACCURACY; DATABASE; RECONSTRUCTION; ALIGNMENTS; DIVERGENT; SEQUENCES; BLOCKS; GENES;
D O I
10.1139/gen-2015-0175
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The need for multi-gene analyses in scientific fields such as phylogenetics and DNA barcoding has increased in recent years. In particular, these approaches are increasingly important for differentiating bacterial species, where reliance on the standard 16S rDNA marker can result in poor resolution. Additionally, the assembly of bacterial genomes has become a standard task due to advances in next-generation sequencing technologies. We created a bioinformatic pipeline, bcgTree, which uses assembled bacterial genomes either from databases or own sequencing results from the user to reconstruct their phylogenetic history. The pipeline automatically extracts 107 essential single-copy core genes, found in a majority of bacteria, using hidden Markov models and performs a partitioned maximum-likelihood analysis. Here, we describe the workflow of bcgTree and, as a proof-of-concept, its usefulness in resolving the phylogeny of 293 publically available bacterial strains of the genus Lactobacillus. We also evaluate its performance in both low-and high-level taxonomy test sets. The tool is freely available at github (https://github.com/iimog/bcgTree) and our institutional homepage (http://www.dna-analytics.biozentrum.uni-wuerzburg.de).
引用
收藏
页码:783 / 791
页数:9
相关论文
共 32 条
[1]  
[Anonymous], 2010, R LANG ENV STAT COMP
[2]  
[Anonymous], BIOINFORMATICS
[3]   A phylogenomics approach for selecting robust sets of phylogenetic markers [J].
Capella-Gutierrez, Salvador ;
Kauff, Frank ;
Gabaldon, Toni .
NUCLEIC ACIDS RESEARCH, 2014, 42 (07) :e54
[4]   Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis [J].
Castresana, J .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (04) :540-552
[5]   KINGDOM PROTOZOA AND ITS 18 PHYLA [J].
CAVALIERSMITH, T .
MICROBIOLOGICAL REVIEWS, 1993, 57 (04) :953-994
[6]   Toward automatic reconstruction of a highly resolved tree of life [J].
Ciccarelli, FD ;
Doerks, T ;
von Mering, C ;
Creevey, CJ ;
Snel, B ;
Bork, P .
SCIENCE, 2006, 311 (5765) :1283-1287
[7]  
De Vos P., 2009, BERGEYS MANUAL SYSTE, V3, DOI [10.1007/978-0-387-68489-5, DOI 10.1007/978-0-387-68489-5]
[8]   Genomic insights to SAR86, an abundant and uncultivated marine bacterial lineage [J].
Dupont, Chris L. ;
Rusch, Douglas B. ;
Yooseph, Shibu ;
Lombardo, Mary-Jane ;
Richter, R. Alexander ;
Valas, Ruben ;
Novotny, Mark ;
Yee-Greenbaum, Joyclyn ;
Selengut, Jeremy D. ;
Haft, Dan H. ;
Halpern, Aaron L. ;
Lasken, Roger S. ;
Nealson, Kenneth ;
Friedman, Robert ;
Venter, J. Craig .
ISME JOURNAL, 2012, 6 (06) :1186-1199
[9]  
Eddy S., 2010, Hmmer3: a new generation of sequence homology search software
[10]   MUSCLE: multiple sequence alignment with high accuracy and high throughput [J].
Edgar, RC .
NUCLEIC ACIDS RESEARCH, 2004, 32 (05) :1792-1797