Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins

被引:1648
作者
Croucher, Nicholas J. [1 ,2 ,3 ]
Page, Andrew J. [1 ]
Connor, Thomas R. [1 ,4 ]
Delaney, Aidan J. [5 ]
Keane, Jacqueline A. [1 ]
Bentley, Stephen D. [1 ,6 ]
Parkhill, Julian [1 ]
Harris, Simon R. [1 ]
机构
[1] Wellcome Trust Sanger Inst, Pathogen Genom, Cambridge CB10 1SA, England
[2] Harvard Univ, Sch Publ Hlth, Ctr Commun Dis Dynam, Boston, MA 02115 USA
[3] Univ London Imperial Coll Sci Technol & Med, Dept Infect Dis Epidemiol, London W2 1PG, England
[4] Cardiff Sch Biosci, Cardiff CF10 3AX, S Glam, Wales
[5] Univ Brighton, Sch Comp Engn & Math, Brighton BN2 4GJ, E Sussex, England
[6] Univ Cambridge, Addenbrookes Hosp, Dept Med, Cambridge CB2 0SP, England
基金
英国惠康基金;
关键词
STAPHYLOCOCCUS-AUREUS; EVOLUTION; TRANSMISSION; CHROMOSOME; INFERENCE; DYNAMICS; CLONE;
D O I
10.1093/nar/gku1196
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The emergence of new sequencing technologies has facilitated the use of bacterial whole genome alignments for evolutionary studies and outbreak analyses. These datasets, of increasing size, often include examples of multiple different mechanisms of horizontal sequence transfer resulting in substantial alterations to prokaryotic chromosomes. The impact of these processes demands rapid and flexible approaches able to account for recombination when reconstructing isolates' recent diversification. Gubbins is an iterative algorithm that uses spatial scanning statistics to identify loci containing elevated densities of base substitutions suggestive of horizontal sequence transfer while concurrently constructing a maximum likelihood phylogeny based on the putative point mutations outside these regions of high sequence diversity. Simulations demonstrate the algorithm generates highly accurate reconstructions under realistically parameterized models of bacterial evolution, and achieves convergence in only a few hours on alignments of hundreds of bacterial genome sequences. Gubbins is appropriate for reconstructing the recent evolutionary history of a variety of haploid genotype alignments, as it makes no assumptions about the underlying mechanism of recombination. The software is freely available for download at github.com/sanger-pathogens/Gubbins, implemented in Python and C and supported on Linux and Mac OS X.
引用
收藏
页数:13
相关论文
共 58 条
[1]   Evolution, Population Structure, and Phylogeography of Genetically Monomorphic Bacterial Pathogens [J].
Achtman, Mark .
ANNUAL REVIEW OF MICROBIOLOGY, 2008, 62 :53-70
[2]   FastML: a web server for probabilistic reconstruction of ancestral sequences [J].
Ashkenazy, Haim ;
Penn, Osnat ;
Doron-Faigenboim, Adi ;
Cohen, Ofir ;
Cannarozzi, Gina ;
Zomer, Oren ;
Pupko, Tal .
NUCLEIC ACIDS RESEARCH, 2012, 40 (W1) :W580-W584
[3]   Genomic Characterisation of Invasive Non-Typhoidal Salmonella enterica Subspecies enterica Serovar Bovismorbificans Isolates from Malawi [J].
Bronowski, Christina ;
Fookes, Maria C. ;
Gilderthorp, Ruth ;
Ashelford, Kevin E. ;
Harris, Simon R. ;
Phiri, Amos ;
Hall, Neil ;
Gordon, Melita A. ;
Wain, John ;
Hart, Charles A. ;
Wigley, Paul ;
Thomson, Nicholas R. ;
Winstanley, Craig .
PLOS NEGLECTED TROPICAL DISEASES, 2013, 7 (11)
[4]   A simple and robust statistical test for detecting the presence of recombination [J].
Bruen, TC ;
Philippe, H ;
Bryant, D .
GENETICS, 2006, 172 (04) :2665-2681
[5]   Whole-genome sequencing to identify transmission of Mycobacterium abscessus between patients with cystic fibrosis: a retrospective cohort study [J].
Bryant, Josephine M. ;
Grogono, Dorothy M. ;
Greaves, Daniel ;
Foweraker, Juliet ;
Roddick, Iain ;
Inns, Thomas ;
Reacher, Mark ;
Haworth, Charles S. ;
Curran, Martin D. ;
Harris, Simon R. ;
Peacock, Sharon J. ;
Parkhill, Julian ;
Floto, R. Andres .
LANCET, 2013, 381 (9877) :1551-1560
[6]   Phylogeographic variation in recombination rates within a global clone of methicillin-resistant Staphylococcus aureus [J].
Castillo-Ramirez, Santiago ;
Corander, Jukka ;
Marttinen, Pekka ;
Aldeljawi, Mona ;
Hanage, William P. ;
Westh, Henrik ;
Boye, Kit ;
Gulay, Zeynep ;
Bentley, Stephen D. ;
Parkhill, Julian ;
Holden, Matthew T. ;
Feil, Edward J. .
GENOME BIOLOGY, 2012, 13 (12)
[7]   Dense genomic sampling identifies highways of pneumococcal recombination [J].
Chewapreecha, Claire ;
Harris, Simon R. ;
Croucher, Nicholas J. ;
Turner, Claudia ;
Marttinen, Pekka ;
Cheng, Lu ;
Pessia, Alberto ;
Aanensen, David M. ;
Mather, Alison E. ;
Page, Andrew J. ;
Salter, Susannah J. ;
Harris, David ;
Nosten, Francois ;
Goldblatt, David ;
Corander, Jukka ;
Parkhill, Julian ;
Turner, Paul ;
Bentley, Stephen D. .
NATURE GENETICS, 2014, 46 (03) :305-+
[8]   Variable recombination dynamics during the emergence, transmission and 'disarming' of a multidrug-resistant pneumococcal clone [J].
Croucher, Nicholas J. ;
Hanage, William P. ;
Harris, Simon R. ;
McGee, Lesley ;
van der Linden, Mark ;
de Lencastre, Herminia ;
Sa-Leao, Raquel ;
Song, Jae-Hoon ;
Ko, Kwan Soo ;
Beall, Bernard ;
Klugman, Keith P. ;
Parkhill, Julian ;
Tomasz, Alexander ;
Kristinsson, Karl G. ;
Bentley, Stephen D. .
BMC BIOLOGY, 2014, 12
[9]   Evidence for Soft Selective Sweeps in the Evolution of Pneumococcal Multidrug Resistance and Vaccine Escape [J].
Croucher, Nicholas J. ;
Chewapreecha, Claire ;
Hanage, William P. ;
Harris, Simon R. ;
McGee, Lesley ;
van der Linden, Mark ;
Song, Jae-Hoon ;
Ko, Kwan Soo ;
de Lencastre, Herminia ;
Turner, Claudia ;
Yang, Fan ;
Sa-Leao, Raquel ;
Beall, Bernard ;
Klugman, Keith P. ;
Parkhill, Julian ;
Turner, Paul ;
Bentley, Stephen D. .
GENOME BIOLOGY AND EVOLUTION, 2014, 6 (07) :1589-1602
[10]   Dominant Role of Nucleotide Substitution in the Diversification of Serotype 3 Pneumococci over Decades and during a Single Infection [J].
Croucher, Nicholas J. ;
Mitchell, Andrea M. ;
Gould, Katherine A. ;
Inverarity, Donald ;
Barquist, Lars ;
Feltwell, Theresa ;
Fookes, Maria C. ;
Harris, Simon R. ;
Dordel, Janina ;
Salter, Susannah J. ;
Browall, Sarah ;
Zemlickova, Helena ;
Parkhill, Julian ;
Normark, Staffan ;
Henriques-Normark, Birgitta ;
Hinds, Jason ;
Mitchell, Tim J. ;
Bentley, Stephen D. .
PLOS GENETICS, 2013, 9 (10)