Forecasting Staphylococcus aureus Infections Using Genome-Wide Association Studies, Machine Learning, and Transcriptomic Approaches

被引:5
作者
Sassi, Mohamed [1 ]
Bronsard, Julie [1 ]
Pascreau, Gaetan [1 ]
Emily, Mathieu [2 ]
Donnio, Pierre-Yves [1 ,3 ]
Revest, Matthieu [1 ,4 ]
Felden, Brice [1 ]
Wirth, Thierry [5 ,6 ]
Augagneur, Yoann [1 ]
机构
[1] INSERM, BRM Bacterial Regulatory RNAs & Med UMR S 1230, Rennes, France
[2] Univ Rennes, CNRS, Inst Agro, IRMAR Inst Rech Math Rennes UMR 6625, Rennes, France
[3] CHU Rennes, Serv Bacteriol Hyg Hosp, Rennes, France
[4] CHU Rennes, Serv Malad Infect & Reanimat Med, Rennes, France
[5] Univ Paris 06, Univ Antilles, Ecole Prat Hautes Etud, Museum Natl Hist Nat,UMR CNRS 7205,Inst Systemat, Paris, France
[6] PSL Univ, Ecole Prat Hautes Etud EPHE, Paris, France
关键词
Staphylococcus aureus; genomics; GWAS; coding and noncoding regions; random forest; trancriptomics; genetic markers; bacteremia; nasal colonization; METHICILLIN-RESISTANT; REGULATORY RNA; NASAL CARRIAGE; VIRULENCE; BACTEREMIA; EPIDEMIOLOGY; MECHANISMS; EXPRESSION; PATHOPHYSIOLOGY; CONTRIBUTES;
D O I
10.1128/msystems.00378-22
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Staphylococcus aureus is a major human and animal pathogen, colonizing diverse ecological niches within its hosts. Predicting whether an isolate will infect a specific host and its subsequent clinical fate remains unknown. In this study, we investigated the S. aureus pangenome using a curated set of 356 strains, spanning a wide range of hosts, origins, and clinical display and antibiotic resistance profiles. We used genome-wide association study (GWAS) and random forest (RF) algorithms to discriminate strains based on their origins and clinical sources. Here, we show that the presence of sak and scn can discriminate strains based on their host specificity, while other genes such as mecA are often associated with virulent outcomes. Both GWAS and RF indicated the importance of intergenic regions (IGRs) and coding DNA sequence (CDS) but not sRNAs in forecasting an outcome. Additional transcriptomic analyses performed on the most prevalent clonal complex 8 (CC8) clonal types, in media mimicking nasal colonization or bacteremia, indicated three RNAs as potential RNA markers to forecast infection, followed by 30 others that could serve as infection severity predictors. Our report shows that genetic association and transcriptomics are complementary approaches that will be combined in a single analytical framework to improve our understanding of bacterial pathogenesis and ultimately identify potential predictive molecular markers. IMPORTANCE Predicting the outcome of bacterial colonization and infections, based on extensive genomic and transcriptomic data from a given pathogen, would be of substantial help for clinicians in treating and curing patients. In this report, genome-wide association studies and random forest algorithms have defined gene combinations that differentiate human from animal strains, colonization from diseases, and nonsevere from severe diseases, while it revealed the importance of IGRs and CDS, but not small RNAs (sRNAs), in anticipating an outcome. In addition, transcriptomic analyses performed on the most prevalent clonal types, in media mimicking either nasal colonization or bacteremia, revealed significant differences and therefore potent RNA markers. Overall, the use of both genomic and transcriptomic data in a single analytical framework can enhance our understanding of bacterial pathogenesis.
引用
收藏
页数:21
相关论文
共 86 条
  • [11] Staphylococcus aureus infective endocarditis versus bacteremia strains: Subtle genetic differences at stake
    Bouchiat, Coralie
    Moreau, Karen
    Devillard, Sebastien
    Rasigade, Jean-Philippe
    Mosnier, Amandine
    Geissmann, Tom
    Bes, Michele
    Tristan, Anne
    Lina, Gerard
    Laurent, Frederic
    Piroth, Lionel
    Aissa, Nejla
    Duval, Xavier
    Le Moing, Vincent
    Vandenesch, Francois
    [J]. INFECTION GENETICS AND EVOLUTION, 2015, 36 : 524 - 530
  • [12] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [13] sRNA and cis-antisense sRNA identification in Staphylococcus aureus highlights an unusual sRNA gene cluster with one encoding a secreted peptide
    Bronsard, Julie
    Pascreau, Gaetan
    Sassi, Mohamed
    Mauro, Tony
    Augagneur, Yoann
    Felden, Brice
    [J]. SCIENTIFIC REPORTS, 2017, 7
  • [14] Rapid scoring of genes in microbial pan-genome-wide association studies with Scoary
    Brynildsrud, Ola
    Bohlin, Jon
    Scheffer, Lonneke
    Eldholm, Vegard
    [J]. GENOME BIOLOGY, 2016, 17
  • [15] Genomic and transcriptomic comparison between Staphylococcus aureus strains associated with high and low within herd prevalence of intra-mammary infection
    Capra, E.
    Cremonesi, P.
    Pietrelli, A.
    Puccio, S.
    Luini, M.
    Stella, A.
    Castiglioni, B.
    [J]. BMC MICROBIOLOGY, 2017, 17 : 1 - 16
  • [16] Genome and pan-genome analysis to classify emerging bacteria
    Caputo, Aurelia
    Fournier, Pierre-Edouard
    Raoult, Didier
    [J]. BIOLOGY DIRECT, 2019, 14 (1)
  • [17] Random forests for genomic data analysis
    Chen, Xi
    Ishwaran, Hemant
    [J]. GENOMICS, 2012, 99 (06) : 323 - 329
  • [18] HomoplasyFinder: a simple tool to identify homoplasies on a phylogeny
    Crispell, Joseph
    Balaz, Daniel
    Gordon, Stephen Vincent
    [J]. MICROBIAL GENOMICS, 2019, 5 (01):
  • [19] jModelTest 2: more models, new heuristics and parallel computing
    Darriba, Diego
    Taboada, Guillermo L.
    Doallo, Ramon
    Posada, David
    [J]. NATURE METHODS, 2012, 9 (08) : 772 - 772
  • [20] Natural mutations in a Staphylococcus aureus virulence regulator attenuate cytotoxicity but permit bacteremia and abscess formation
    Dasa, Sudip
    Lindemann, Claudia
    Young, Bernadette C.
    Muller, Julius
    Osterreich, Babett
    Ternette, Nicola
    Winkler, Ann-Cathrin
    Paprotka, Kerstin
    Reinhardt, Richard
    Foerstner, Konrad U.
    Allen, Elizabeth
    Flaxman, Amy
    Yamaguchi, Yuko
    Rollier, Christine S.
    van Diemen, Pauline
    Blaettner, Sebastian
    Remmele, Christian W.
    Selle, Martina
    Dittrich, Marcus
    Mueller, Tobias
    Vogel, Joerg
    Ohlsen, Knut
    Crook, Derrick W.
    Massey, Ruth
    Wilson, Daniel J.
    Rudel, Thomas
    Wyllie, David H.
    Fraunholz, Martin J.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (22) : E3101 - E3110