Estimating the size of the bacterial pan-genome

被引:225
作者
Lapierrel, Pascal [1 ]
Gogarten, J. Peter [2 ]
机构
[1] Univ Connecticut, Ctr Biotechnol, Storrs, CT 06269 USA
[2] Univ Connecticut, Dept Mol & Cell Biol, Storrs, CT 06269 USA
关键词
PROTEIN FAMILIES; EVOLUTION; TRANSPORTERS; PATTERNS; GENES;
D O I
10.1016/j.tig.2008.12.004
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The 'pan-genome' denotes the set of all genes present in the genomes of a group of organisms. Here, we extend the pan-genome concept to higher taxonomic units. Using 573 sequenced genomes, we estimate the size of the bacterial pan-genome based on the frequency of occurrences of genes among sampled genomes. Using gene- and genome-centered approaches, we characterize three distinct pools of gene families that comprise the bacterial pan-genome, each evolving under different evolutionary constraints. Our findings indicate that the pan-genome of the bacterial domain is of infinite size (the Bacteria as a whole have an open pan-genome) and that similar to 250 genes per genome belong to the extended bacterial core genome.
引用
收藏
页码:107 / 110
页数:4
相关论文
共 24 条
[21]   Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae:: Implications for the microbial "pan-genome" [J].
Tettelin, H ;
Masignani, V ;
Cieslewicz, MJ ;
Donati, C ;
Medini, D ;
Ward, NL ;
Angiuoli, SV ;
Crabtree, J ;
Jones, AL ;
Durkin, AS ;
DeBoy, RT ;
Davidsen, TM ;
Mora, M ;
Scarselli, M ;
Ros, IMY ;
Peterson, JD ;
Hauser, CR ;
Sundaram, JP ;
Nelson, WC ;
Madupu, R ;
Brinkac, LM ;
Dodson, RJ ;
Rosovitz, MJ ;
Sullivan, SA ;
Daugherty, SC ;
Haft, DH ;
Selengut, J ;
Gwinn, ML ;
Zhou, LW ;
Zafar, N ;
Khouri, H ;
Radune, D ;
Dimitrov, G ;
Watkins, K ;
O'Connor, KJB ;
Smith, S ;
Utterback, TR ;
White, O ;
Rubens, CE ;
Grandi, G ;
Madoff, LC ;
Kasper, DL ;
Telford, JL ;
Wessels, MR ;
Rappuoli, R ;
Fraser, CM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (39) :13950-13955
[22]   Polyketide biosynthesis: understanding and exploiting modularity [J].
Weissman, KJ .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2004, 362 (1825) :2671-2690
[23]   The Sorcerer II Global Ocean Sampling expedition:: Expanding the universe of protein families [J].
Yooseph, Shibu ;
Sutton, Granger ;
Rusch, Douglas B. ;
Halpern, Aaron L. ;
Williamson, Shannon J. ;
Remington, Karin ;
Eisen, Jonathan A. ;
Heidelberg, Karla B. ;
Manning, Gerard ;
Li, Weizhong ;
Jaroszewski, Lukasz ;
Cieplak, Piotr ;
Miller, Christopher S. ;
Li, Huiying ;
Mashiyama, Susan T. ;
Joachimiak, Marcin P. ;
van Belle, Christopher ;
Chandonia, John-Marc ;
Soergel, David A. ;
Zhai, Yufeng ;
Natarajan, Kannan ;
Lee, Shaun ;
Raphael, Benjamin J. ;
Bafna, Vineet ;
Friedman, Robert ;
Brenner, Steven E. ;
Godzik, Adam ;
Eisenberg, David ;
Dixon, Jack E. ;
Taylor, Susan S. ;
Strausberg, Robert L. ;
Frazier, Marvin ;
Venter, J. Craig .
PLOS BIOLOGY, 2007, 5 (03) :432-466
[24]   Protein sequence similarity searches using patterns as seeds [J].
Zhang, Z ;
Schaffer, AA ;
Miller, W ;
Madden, TL ;
Lipman, DJ ;
Koonin, EV ;
Altschul, SF .
NUCLEIC ACIDS RESEARCH, 1998, 26 (17) :3986-3990