Contingency, repeatability, and predictability in the evolution of a prokaryotic pangenome

被引:19
作者
Beavan, Alan [1 ]
Sananes, Maria Rosa Domingo -
Mcinerney, James O. [1 ]
机构
[1] Univ Nottingham, Sch Life Sci, Nottingham NG7 2UH, England
基金
英国生物技术与生命科学研究理事会;
关键词
pangenomes; machine learning; evolution; ESCHERICHIA-COLI; GENE; SELECTION; SEQUENCE; TREE; KEGG;
D O I
10.1073/pnas.2304934120
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Pangenomes exhibit remarkable variability in many prokaryotic species, much of which is maintained through the processes of horizontal gene transfer and gene loss. Repeated acquisitions of near- identical homologs can easily be observed across pangenomes, leading to the question of whether these parallel events potentiate similar evolutionary trajectories, or whether the remarkably different genetic backgrounds of the recipients mean that postacquisition evolutionary trajectories end up being quite different. In this study, we present a machine learning method that predicts the presence or absence of genes in the Escherichia coli pangenome based on complex patterns of the presence or absence of other accessory genes within a genome. Our analysis leverages the repeated transfer of genes through the E. coli pangenome to observe patterns of repeated evolu-tion following similar events. We find that the presence or absence of a substantial set of genes is highly predictable from other genes alone, indicating that selection potentiates and maintains gene-gene co- occurrence and avoidance relationships deterministically over long-term bacterial evolution and is robust to differences in host evolutionary history. We propose that at least part of the pangenome can be understood as a set of genes with relationships that govern their likely cohabitants, analogous to an ecosys-tem's set of interacting organisms. Our findings indicate that intragenomic gene fitness effects may be key drivers of prokaryotic evolution, influencing the repeated emergence of complex gene-gene relationships across the pangenome.
引用
收藏
页数:10
相关论文
共 75 条
[61]   The ortholog conjecture revisited: the value of orthologs and paralogs in function prediction [J].
Stamboulian, Moses ;
Guerrero, Rafael F. ;
Hahn, Matthew W. ;
Radivojac, Predrag .
BIOINFORMATICS, 2020, 36 :219-226
[62]   How confident can we be that orthologs are similar, but paralogs differ? [J].
Studer, Romain A. ;
Robinson-Rechavi, Marc .
TRENDS IN GENETICS, 2009, 25 (05) :210-216
[63]  
Swofford D.L., 2003, PAUP PHYLOGENETIC AN
[64]   The use and abuse of vegetational concepts and terms [J].
Tansley, AG .
ECOLOGY, 1935, 16 :284-307
[65]   Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae:: Implications for the microbial "pan-genome" [J].
Tettelin, H ;
Masignani, V ;
Cieslewicz, MJ ;
Donati, C ;
Medini, D ;
Ward, NL ;
Angiuoli, SV ;
Crabtree, J ;
Jones, AL ;
Durkin, AS ;
DeBoy, RT ;
Davidsen, TM ;
Mora, M ;
Scarselli, M ;
Ros, IMY ;
Peterson, JD ;
Hauser, CR ;
Sundaram, JP ;
Nelson, WC ;
Madupu, R ;
Brinkac, LM ;
Dodson, RJ ;
Rosovitz, MJ ;
Sullivan, SA ;
Daugherty, SC ;
Haft, DH ;
Selengut, J ;
Gwinn, ML ;
Zhou, LW ;
Zafar, N ;
Khouri, H ;
Radune, D ;
Dimitrov, G ;
Watkins, K ;
O'Connor, KJB ;
Smith, S ;
Utterback, TR ;
White, O ;
Rubens, CE ;
Grandi, G ;
Madoff, LC ;
Kasper, DL ;
Telford, JL ;
Wessels, MR ;
Rappuoli, R ;
Fraser, CM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (39) :13950-13955
[66]  
Tin Kam Ho, 1995, Proceedings of the Third International Conference on Document Analysis and Recognition, P278, DOI 10.1109/ICDAR.1995.598994
[67]   Producing polished prokaryotic pangenomes with the Panaroo pipeline [J].
Tonkin-Hill, Gerry ;
MacAlasdair, Neil ;
Ruis, Christopher ;
Weimann, Aaron ;
Horesh, Gal ;
Lees, John A. ;
Gladstone, Rebecca A. ;
Lo, Stephanie ;
Beaudoin, Christopher ;
Floto, R. Andres ;
Frost, Simon D. W. ;
Corander, Jukka ;
Bentley, Stephen D. ;
Parkhill, Julian .
GENOME BIOLOGY, 2020, 21 (01)
[68]   Horizontal Transfer, Not Duplication, Drives the Expansion of Protein Families in Prokaryotes [J].
Treangen, Todd J. ;
Rocha, Eduardo P. C. .
PLOS GENETICS, 2011, 7 (01)
[69]  
Van Rijsbergen C., 1979, INFORM RETRIEVAL, V2nd
[70]   Rates of Lateral Gene Transfer in Prokaryotes: High but Why? [J].
Vos, Michiel ;
Hesselman, Matthijn C. ;
te Beek, Tim A. ;
van Passel, Mark W. J. ;
Eyre-Walker, Adam .
TRENDS IN MICROBIOLOGY, 2015, 23 (10) :598-605