Estimating the size of the bacterial pan-genome

被引:225
作者
Lapierrel, Pascal [1 ]
Gogarten, J. Peter [2 ]
机构
[1] Univ Connecticut, Ctr Biotechnol, Storrs, CT 06269 USA
[2] Univ Connecticut, Dept Mol & Cell Biol, Storrs, CT 06269 USA
关键词
PROTEIN FAMILIES; EVOLUTION; TRANSPORTERS; PATTERNS; GENES;
D O I
10.1016/j.tig.2008.12.004
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The 'pan-genome' denotes the set of all genes present in the genomes of a group of organisms. Here, we extend the pan-genome concept to higher taxonomic units. Using 573 sequenced genomes, we estimate the size of the bacterial pan-genome based on the frequency of occurrences of genes among sampled genomes. Using gene- and genome-centered approaches, we characterize three distinct pools of gene families that comprise the bacterial pan-genome, each evolving under different evolutionary constraints. Our findings indicate that the pan-genome of the bacterial domain is of infinite size (the Bacteria as a whole have an open pan-genome) and that similar to 250 genes per genome belong to the extended bacterial core genome.
引用
收藏
页码:107 / 110
页数:4
相关论文
共 24 条
[1]   Ab initio gene identification:: prokaryote genome annotation with GeneScan and GLIMMER [J].
Aggarwal, G ;
Ramaswamy, R .
JOURNAL OF BIOSCIENCES, 2002, 27 (01) :7-14
[2]   Start-up entities in the origin of new genes [J].
Daubin, V ;
Ochman, H .
CURRENT OPINION IN GENETICS & DEVELOPMENT, 2004, 14 (06) :616-619
[3]   The source of laterally transferred genes in bacterial genomes -: art. no. R57 [J].
Daubin, V ;
Lerat, E ;
Perrière, G .
GENOME BIOLOGY, 2003, 4 (09)
[4]   ATP-binding cassette transporters in bacteria [J].
Davidson, AL ;
Chen, J .
ANNUAL REVIEW OF BIOCHEMISTRY, 2004, 73 :241-268
[5]   An efficient algorithm for large-scale detection of protein families [J].
Enright, AJ ;
Van Dongen, S ;
Ouzounis, CA .
NUCLEIC ACIDS RESEARCH, 2002, 30 (07) :1575-1584
[6]   The microbial engines that drive Earth's biogeochemical cycles [J].
Falkowski, Paul G. ;
Fenchel, Tom ;
Delong, Edward F. .
SCIENCE, 2008, 320 (5879) :1034-1039
[7]   WHOLE-GENOME RANDOM SEQUENCING AND ASSEMBLY OF HAEMOPHILUS-INFLUENZAE RD [J].
FLEISCHMANN, RD ;
ADAMS, MD ;
WHITE, O ;
CLAYTON, RA ;
KIRKNESS, EF ;
KERLAVAGE, AR ;
BULT, CJ ;
TOMB, JF ;
DOUGHERTY, BA ;
MERRICK, JM ;
MCKENNEY, K ;
SUTTON, G ;
FITZHUGH, W ;
FIELDS, C ;
GOCAYNE, JD ;
SCOTT, J ;
SHIRLEY, R ;
LIU, LI ;
GLODEK, A ;
KELLEY, JM ;
WEIDMAN, JF ;
PHILLIPS, CA ;
SPRIGGS, T ;
HEDBLOM, E ;
COTTON, MD ;
UTTERBACK, TR ;
HANNA, MC ;
NGUYEN, DT ;
SAUDEK, DM ;
BRANDON, RC ;
FINE, LD ;
FRITCHMAN, JL ;
FUHRMANN, JL ;
GEOGHAGEN, NSM ;
GNEHM, CL ;
MCDONALD, LA ;
SMALL, KV ;
FRASER, CM ;
SMITH, HO ;
VENTER, JC .
SCIENCE, 1995, 269 (5223) :496-512
[8]   Parallel evolution of ligand specificity between LacI/GalR family repressors and periplasmic sugar-binding proteins [J].
Fukami-Kobayashi, K ;
Tateno, Y ;
Nishikawa, K .
MOLECULAR BIOLOGY AND EVOLUTION, 2003, 20 (02) :267-277
[9]   Horizontal gene transfer, genome innovation and evolution [J].
Gogarten, JP ;
Townsend, JP .
NATURE REVIEWS MICROBIOLOGY, 2005, 3 (09) :679-687
[10]   A hybrid clustering approach to recognition of protein families in 114 microbial genomes [J].
Harlow, TJ ;
Gogarten, JP ;
Ragan, MA .
BMC BIOINFORMATICS, 2004, 5 (1)