Evolution of Pan-Genomes of Escherichia coli, Shigella spp., and Salmonella enterica

被引:84
作者
Gordienko, Evgeny N. [1 ]
Kazanov, Marat D. [2 ]
Gelfand, Mikhail S. [2 ,3 ]
机构
[1] Russian Acad Sci, NI Vavilov Inst Gen Genet, Moscow, Russia
[2] Russian Acad Sci, AA Kharkevich Inst Informat Transmiss Problems, Moscow, Russia
[3] Moscow MV Lomonosov State Univ, Fac Bioengn & Bioinformat, Moscow, Russia
关键词
BACILLUS-ANTHRACIS; SOIL BACTERIA; GENE; PANGENOME; INSIGHTS; COMMENSAL; SEQUENCE; STRAINS; LOCUS; SIZE;
D O I
10.1128/JB.02285-12
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Multiple sequencing of genomes belonging to a bacterial species allows one to analyze and compare statistics and dynamics of the gene complements of species, their pan-genomes. Here, we analyzed multiple genomes of Escherichia coli, Shigella spp., and Salmonella enterica. We demonstrate that the distribution of the number of genomes harboring a gene is well approximated by a sum of two power functions, describing frequent genes (present in many strains) and rare genes (present in few strains). The virtual absence of Shigella-specific genes not present in E. coli genomes confirms previous observations that Shigella is not an independent genus. While the pan-genome size is increasing with each new strain, the number of genes present in a fixed fraction of strains stabilizes quickly. For instance, slightly fewer than 4,000 genes are present in at least half of any group of E. coli genomes. Comparison of S. enterica and E. coli pan-genomes revealed the existence of a common periphery, that is, genes present in some but not all strains of both species. Analysis of phylogenetic trees demonstrates that rare genes from the periphery likely evolve under horizontal transfer, whereas frequent periphery genes may have been inherited from the periphery genome of the common ancestor.
引用
收藏
页码:2786 / 2792
页数:7
相关论文
共 37 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] Current Mycobacterium: what's on tomorrow's menu?
    Behr, Marcel A.
    [J]. MICROBES AND INFECTION, 2008, 10 (09) : 968 - 972
  • [3] GOstat: find statistically overrepresented Gene Ontologies within a group of genes
    Beissbarth, T
    Speed, TP
    [J]. BIOINFORMATICS, 2004, 20 (09) : 1464 - 1465
  • [4] Brenner D. J., 1984, Bergey's Manual of systematic bacteriology. Volume 1, P408
  • [5] How to become a uropathogen:: Comparative genomic analysis of extraintestinal pathogenic Escherichia coli strains
    Brzuszkiewicz, Elzbieta
    Brueggemann, Holger
    Liesegang, Heiko
    Emmerth, Melanie
    Oeschlaeger, Tobias
    Nagy, Gabor
    Albermann, Kaj
    Wagner, Christian
    Buchrieser, Carmen
    Emody, Levente
    Gottschalk, Gerhard
    Hackert, Joerg
    Dobrindt, Ulrich
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (34) : 12879 - 12884
  • [6] Castellani A., 1919, MANUAL TROPICAL MED, V3rd
  • [7] Testing the Infinitely Many Genes Model for the Evolution of the Bacterial Core Genome and Pangenome
    Collins, R. Eric
    Higgs, Paul G.
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2012, 29 (11) : 3413 - 3425
  • [8] Effects of growth medium, inoculum size, and incubation time on culturability and isolation of soil bacteria
    Davis, KER
    Joseph, SJ
    Janssen, PH
    [J]. APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2005, 71 (02) : 826 - 834
  • [9] Comparative genomics of Salmonella enterica serovar typhi strains Ty2 and CT18
    Deng, W
    Liou, SR
    Plunkett, G
    Mayhew, GF
    Rose, DJ
    Burland, V
    Kodoyianni, V
    Schwartz, DC
    Blattner, FR
    [J]. JOURNAL OF BACTERIOLOGY, 2003, 185 (07) : 2330 - 2337
  • [10] Determining divergence times of the major kingdoms of living organisms with a protein clock
    Doolittle, RF
    Feng, DF
    Tsang, S
    Cho, G
    Little, E
    [J]. SCIENCE, 1996, 271 (5248) : 470 - 477