Genome-Wide Association Studies (GWAS) in microbial organisms have the potential to vastly improve the way we understand, manage, and treat infectious diseases. Yet, microbial GWAS methods established thus far remain insufficiently able to capitalise on the growing wealth of bacterial and viral genetic sequence data. Facing clonal population structure and homologous recombination, existing GWAS methods struggle to achieve both the precision necessary to reject spurious findings and the power required to detect associations in microbes. In this paper, we introduce a novel phylogenetic approach that has been tailor-made for microbial GWAS, which is applicable to organisms ranging from purely clonal to frequently recombining, and to both binary and continuous phenotypes. Our approach is robust to the confounding effects of both population structure and recombination, while maintaining high statistical power to detect associations. Thorough testing via application to simulated data provides strong support for the power and specificity of our approach and demonstrates the advantages offered over alternative cluster-based and dimension-reduction methods. Two applications to Neisseria meningitidis illustrate the versatility and potential of our method, confirming previously-identified penicillin resistance loci and resulting in the identification of both well-characterised and novel drivers of invasive disease. Our method is implemented as an open-source R package called treeWAS which is freely available at https://github.com/caitiecollins/treeWAS.
机构:
Univ Birmingham, Sch Biosci, Birmingham, W Midlands, England
Kingston Univ, Sch Life Sci, Kingston upon Thames KT1 2EE, Surrey, EnglandUniv Birmingham, Sch Biosci, Birmingham, W Midlands, England
Snyder, Lori A. S.
;
Cole, Jeff A.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Birmingham, Sch Biosci, Birmingham, W Midlands, EnglandUniv Birmingham, Sch Biosci, Birmingham, W Midlands, England
Cole, Jeff A.
;
Pallen, Mark J.
论文数: 0引用数: 0
h-index: 0
机构:Univ Birmingham, Sch Biosci, Birmingham, W Midlands, England
机构:
Univ Calif Davis, Dept Biol Chem, Rowe Program Human Genet, Davis, CA 95616 USA
Univ Calif Davis, Dept Med, Davis, CA 95616 USAUniv Calif Davis, Dept Biol Chem, Rowe Program Human Genet, Davis, CA 95616 USA
Tian, Chao
;
Gregersen, Peter K.
论文数: 0引用数: 0
h-index: 0
机构:
N Shore LIJ Hlth Syst, Feinstein Inst Med Res, Robert S Boas Ctr Genom & Human Genet, Manhasset, NY 11030 USAUniv Calif Davis, Dept Biol Chem, Rowe Program Human Genet, Davis, CA 95616 USA
Gregersen, Peter K.
;
Seldin, Michael F.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Davis, Dept Biol Chem, Rowe Program Human Genet, Davis, CA 95616 USA
Univ Calif Davis, Dept Med, Davis, CA 95616 USAUniv Calif Davis, Dept Biol Chem, Rowe Program Human Genet, Davis, CA 95616 USA
机构:
Univ Birmingham, Sch Biosci, Birmingham, W Midlands, England
Kingston Univ, Sch Life Sci, Kingston upon Thames KT1 2EE, Surrey, EnglandUniv Birmingham, Sch Biosci, Birmingham, W Midlands, England
Snyder, Lori A. S.
;
Cole, Jeff A.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Birmingham, Sch Biosci, Birmingham, W Midlands, EnglandUniv Birmingham, Sch Biosci, Birmingham, W Midlands, England
Cole, Jeff A.
;
Pallen, Mark J.
论文数: 0引用数: 0
h-index: 0
机构:Univ Birmingham, Sch Biosci, Birmingham, W Midlands, England
机构:
Univ Calif Davis, Dept Biol Chem, Rowe Program Human Genet, Davis, CA 95616 USA
Univ Calif Davis, Dept Med, Davis, CA 95616 USAUniv Calif Davis, Dept Biol Chem, Rowe Program Human Genet, Davis, CA 95616 USA
Tian, Chao
;
Gregersen, Peter K.
论文数: 0引用数: 0
h-index: 0
机构:
N Shore LIJ Hlth Syst, Feinstein Inst Med Res, Robert S Boas Ctr Genom & Human Genet, Manhasset, NY 11030 USAUniv Calif Davis, Dept Biol Chem, Rowe Program Human Genet, Davis, CA 95616 USA
Gregersen, Peter K.
;
Seldin, Michael F.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Davis, Dept Biol Chem, Rowe Program Human Genet, Davis, CA 95616 USA
Univ Calif Davis, Dept Med, Davis, CA 95616 USAUniv Calif Davis, Dept Biol Chem, Rowe Program Human Genet, Davis, CA 95616 USA