A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination

被引:147
作者
Collins, Caitlin [1 ]
Didelot, Xavier [1 ]
机构
[1] Imperial Coll London, Dept Infect Dis Epidemiol, London, England
基金
英国生物技术与生命科学研究理事会;
关键词
NEISSERIA-MENINGITIDIS; ANTIBIOTIC-RESISTANCE; PENICILLIN RESISTANCE; HEMOGLOBIN RECEPTOR; EVOLUTIONARY TREES; GENETIC-BASIS; INFERENCE; EMERGENCE; PROTEIN; SPREAD;
D O I
10.1371/journal.pcbi.1005958
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Genome-Wide Association Studies (GWAS) in microbial organisms have the potential to vastly improve the way we understand, manage, and treat infectious diseases. Yet, microbial GWAS methods established thus far remain insufficiently able to capitalise on the growing wealth of bacterial and viral genetic sequence data. Facing clonal population structure and homologous recombination, existing GWAS methods struggle to achieve both the precision necessary to reject spurious findings and the power required to detect associations in microbes. In this paper, we introduce a novel phylogenetic approach that has been tailor-made for microbial GWAS, which is applicable to organisms ranging from purely clonal to frequently recombining, and to both binary and continuous phenotypes. Our approach is robust to the confounding effects of both population structure and recombination, while maintaining high statistical power to detect associations. Thorough testing via application to simulated data provides strong support for the power and specificity of our approach and demonstrates the advantages offered over alternative cluster-based and dimension-reduction methods. Two applications to Neisseria meningitidis illustrate the versatility and potential of our method, confirming previously-identified penicillin resistance loci and resulting in the identification of both well-characterised and novel drivers of invasive disease. Our method is implemented as an open-source R package called treeWAS which is freely available at https://github.com/caitiecollins/treeWAS.
引用
收藏
页数:21
相关论文
共 78 条
[31]   TOWARD DEFINING COURSE OF EVOLUTION - MINIMUM CHANGE FOR A SPECIFIC TREE TOPOLOGY [J].
FITCH, WM .
SYSTEMATIC ZOOLOGY, 1971, 20 (04) :406-&
[32]   BIONJ: An improved version of the NJ algorithm based on a simple model of sequence data [J].
Gascuel, O .
MOLECULAR BIOLOGY AND EVOLUTION, 1997, 14 (07) :685-695
[33]   Evolutionary Genomics of Staphylococcus aureus Reveals Insights into the Origin and Molecular Basis of Ruminant Host Adaptation [J].
Guinane, Caitriona M. ;
Ben Zakour, Nouri L. ;
Tormo-Mas, Maria A. ;
Weinert, Lucy A. ;
Lowder, Bethan V. ;
Cartwright, Robyn A. ;
Smyth, Davida S. ;
Smyth, Cyril J. ;
Lindsay, Jodi A. ;
Gould, Katherine A. ;
Witney, Adam ;
Hinds, Jason ;
Bollback, Jonathan P. ;
Rambaut, Andrew ;
Penades, Jose R. ;
Fitzgerald, J. Ross .
GENOME BIOLOGY AND EVOLUTION, 2010, 2 :454-466
[34]   Complement factor H variant increases the risk of age-related macular degeneration [J].
Haines, JL ;
Hauser, MA ;
Schmidt, S ;
Scott, WK ;
Olson, LM ;
Gallins, P ;
Spencer, KL ;
Kwan, SY ;
Noureddine, M ;
Gilbert, JR ;
Schnetz-Boutaud, N ;
Agarwal, A ;
Postel, EA ;
Pericak-Vance, MA .
SCIENCE, 2005, 308 (5720) :419-421
[35]   Epidemiological Evidence for the Role of the Hemoglobin Receptor, HmbR, in Meningococcal Virulence [J].
Harrison, Odile B. ;
Evans, Nicholas J. ;
Blair, Jessica M. ;
Grimes, Holly S. ;
Tinsley, Colin R. ;
Nassif, Xavier ;
Kriz, Paula ;
Ure, Roisin ;
Gray, Steve J. ;
Derrick, Jeremy P. ;
Maiden, Martin C. J. ;
Feavers, Ian M. .
JOURNAL OF INFECTIOUS DISEASES, 2009, 200 (01) :94-98
[36]   Cellular and molecular biology of Neisseria meningitidis colonization and invasive disease [J].
Hill, Darryl J. ;
Griffiths, Natalie J. ;
Borodina, Elena ;
Virji, Mumtaz .
CLINICAL SCIENCE, 2010, 118 (9-10) :547-564
[37]   A genomic portrait of the emergence, evolution, and global spread of a methicillin-resistant Staphylococcus aureus pandemic [J].
Holden, Matthew T. G. ;
Hsu, Li-Yang ;
Kurt, Kevin ;
Weinert, Lucy A. ;
Mather, Alison E. ;
Harris, Simon R. ;
Strommenger, Birgit ;
Layer, Franziska ;
Witte, Wolfgang ;
de Lencastre, Herminia ;
Skov, Robert ;
Westh, Henrik ;
Zemlickova, Helena ;
Coombs, Geoffrey ;
Kearns, Angela M. ;
Hill, Robert L. R. ;
Edgeworth, Jonathan ;
Gould, Ian ;
Gant, Vanya ;
Cooke, Jonathan ;
Edwards, Giles F. ;
McAdam, Paul R. ;
Templeton, Kate E. ;
McCann, Angela ;
Zhou, Zhemin ;
Castillo-Ramirez, Santiago ;
Feil, Edward J. ;
Hudson, Lyndsey O. ;
Enright, Mark C. ;
Balloux, Francois ;
Aanensen, David M. ;
Spratt, Brian G. ;
Fitzgerald, J. Ross ;
Parkhill, Julian ;
Achtman, Mark ;
Bentley, Stephen D. ;
Nuebel, Ulrich .
GENOME RESEARCH, 2013, 23 (04) :653-664
[38]   BIGSdb: Scalable analysis of bacterial genome variation at the population level [J].
Jolley, Keith A. ;
Maiden, Martin C. J. .
BMC BIOINFORMATICS, 2010, 11
[39]   Discriminant analysis of principal components: a new method for the analysis of genetically structured populations [J].
Jombart, Thibaut ;
Devillard, Sebastien ;
Balloux, Francois .
BMC GENETICS, 2010, 11
[40]  
Kiechle FL, 2004, ARCH PATHOL LAB MED, V128, P1337