Mycobacterial Phylogenomics: An Enhanced Method for Gene Turnover Analysis Reveals Uneven Levels of Gene Gain and Loss among Species and Gene Families

被引:14
作者
Librado, Pablo [1 ,2 ]
Vieira, Filipe G. [1 ,2 ,3 ]
Sanchez-Gracia, Alejandro [1 ,2 ]
Kolokotronis, Sergios-Orestis [4 ,5 ]
Rozas, Julio [1 ,2 ]
机构
[1] Univ Barcelona, Dept Genet, Barcelona, Spain
[2] Univ Barcelona, Inst Recerca Biodiversitat IRBio, Barcelona, Spain
[3] Univ Calif Berkeley, Dept Integrat Biol, Berkeley, CA 94720 USA
[4] Fordham Univ, Dept Biol Sci, Bronx, NY 10458 USA
[5] Amer Museum Nat Hist, Sackler Inst Comparat Genom, New York, NY 10024 USA
关键词
gene turnover rates; gene gain and loss; gene families; maximum likelihood; rate heterogeneity; M; tuberculosis; PROTEIN FAMILY; TUBERCULOSIS; EVOLUTION; IDENTIFICATION; VIRULENCE; SEQUENCE; GENOMES; MODEL; RATES; BIOSYNTHESIS;
D O I
10.1093/gbe/evu117
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Species of the genus Mycobacterium differ in several features, from geographic ranges, and degree of pathogenicity, to ecological and host preferences. The recent availability of several fully sequenced genomes for a number of these species enabled the comparative study of the genetic determinants of this wide lifestyle diversity. Here, we applied two complementary phylogenetic-based approaches using information from 19 Mycobacterium genomes to obtain a more comprehensive view of the evolution of this genus. First, we inferred the phylogenetic relationships using two new approaches, one based on a Mycobacterium-specific amino acid substitution matrix and the other on a gene content dissimilarity matrix. Then, we utilized our recently developed gain-and-death stochastic models to study gene turnover dynamics in this genus in a maximum-likelihood framework. We uncovered a scenario that differs markedly from traditional 16S rRNA data and improves upon recent phylogenomic approaches. We also found that the rates of gene gain and death are high and unevenly distributed both across species and across gene families, further supporting the utility of the new models of rate heterogeneity applied in a phylogenetic context. Finally, the functional annotation of the most expanded or contracted gene families revealed that the transposable elements and the fatty acid metabolism-related gene families are the most important drivers of gene content evolution in Mycobacterium.
引用
收藏
页码:1454 / 1465
页数:12
相关论文
共 64 条
[1]   Divergence and redundancy of 16S rRNA sequences in genomes with multiple rrn operons [J].
Acinas, SG ;
Marcelino, LA ;
Klepac-Ceraj, V ;
Polz, MF .
JOURNAL OF BACTERIOLOGY, 2004, 186 (09) :2629-2635
[2]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[3]   T-REX: a web server for inferring, validating and visualizing phylogenetic trees and networks [J].
Alix, Boc ;
Boubacar, Diallo Alpha ;
Vladimir, Makarenkov .
NUCLEIC ACIDS RESEARCH, 2012, 40 (W1) :W573-W579
[4]   Ontologizer 2.0 - a multifunctional tool for GO term enrichment analysis and data exploration [J].
Bauer, Sebastian ;
Grossmann, Steffen ;
Vingron, Martin ;
Robinson, Peter N. .
BIOINFORMATICS, 2008, 24 (14) :1650-1651
[5]   Bacterial fatty acid biosynthesis: Targets for antibacterial drug discovery [J].
Campbell, JW ;
Cronan, JE .
ANNUAL REVIEW OF MICROBIOLOGY, 2001, 55 :305-332
[6]   Human T cell epitopes of Mycobacterium tuberculosis are evolutionarily hyperconserved [J].
Comas, Inaki ;
Chakravartti, Jaidip ;
Small, Peter M. ;
Galagan, James ;
Niemann, Stefan ;
Kremer, Kristin ;
Ernst, Joel D. ;
Gagneux, Sebastien .
NATURE GENETICS, 2010, 42 (06) :498-U41
[7]   Mycobacterium africanum-Review of an Important Cause of Human Tuberculosis in West Africa [J].
de Jong, Bouke C. ;
Antonio, Martin ;
Gagneux, Sebastien .
PLOS NEGLECTED TROPICAL DISEASES, 2010, 4 (09)
[8]   Contribution of the Mycobacterium tuberculosis MmpL protein family to virulence and drug resistance [J].
Domenech, P ;
Reed, MB ;
Barry, CE .
INFECTION AND IMMUNITY, 2005, 73 (06) :3492-3501
[9]   An efficient algorithm for large-scale detection of protein families [J].
Enright, AJ ;
Van Dongen, S ;
Ouzounis, CA .
NUCLEIC ACIDS RESEARCH, 2002, 30 (07) :1575-1584
[10]  
FELSENSTEIN J, 1985, EVOLUTION, V39, P783, DOI 10.1111/j.1558-5646.1985.tb00420.x