Predicting hosts and cross-species transmission of Streptococcus agalactiae by interpretable machine learning

被引:2
|
作者
Ren, Yunxiao [1 ]
Li, Carmen [2 ]
Sapugahawatte, Dulmini Nanayakkara [2 ]
Zhu, Chendi [2 ]
Spaenig, Sebastian [1 ]
Jamrozy, Dorota [3 ]
Rothen, Julian [4 ,5 ]
Daubenberger, Claudia A. [4 ,5 ]
Bentley, Stephen D. [3 ]
Ip, Margaret [2 ]
Heider, Dominik [1 ,6 ,7 ]
机构
[1] Philipps Univ Marburg, Fac Math & Comp Sci, Dept Data Sci Biomed, Marburg, Germany
[2] Chinese Univ Hong Kong, Fac Med, Dept Microbiol, Hong Kong, Peoples R China
[3] Wellcome Sanger Inst, Parasites & Microbes Programme, Wellcome Genome Campus, Cambridge, England
[4] Swiss Trop & Publ Hlth Inst Swiss TPH Basel, Dept Med Parasitol & Infect Biol, CH-4002 Basel, Switzerland
[5] Univ Basel, CH-4002 Basel, Switzerland
[6] Univ Dusseldorf, Inst Comp Sci, D-40211 Dusseldorf, Germany
[7] Heinrich Heine Univ Dusseldorf, Ctr Digital Hlth, Moorenstr 5, D-40225 Dusseldorf, Germany
关键词
Hosts prediction; Host adaptations; Cross-species transmission; Interpretable machine learning; GROUP-B STREPTOCOCCUS; SEQUENCE TYPE 283; FISH;
D O I
10.1016/j.compbiomed.2024.108185
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Streptococcus agalactiae, commonly known as Group B Streptococcus (GBS), exhibits a broad host range, manifesting as both a beneficial commensal and an opportunistic pathogen across various species. In humans, it poses significant risks, causing neonatal sepsis and meningitis, along with severe infections in adults. Additionally, it impacts livestock by inducing mastitis in bovines and contributing to epidemic mortality in fish populations. Despite its wide host spectrum, the mechanisms enabling GBS to adapt to specific hosts remain inadequately elucidated. Therefore, the development of a rapid and accurate method differentiates GBS strains associated with particular animal hosts based on genome-wide information holds immense potential. Such a tool would not only bolster the identification and containment efforts during GBS outbreaks but also deepen our comprehension of the bacteria's host adaptations spanning humans, livestock, and other natural animal reservoirs. Methods and results: Here, we developed three machine learning models-random forest (RF), logistic regression (LR), and support vector machine (SVM) based on genome-wide mutation data. These models enabled precise prediction of the host origin of GBS, accurately distinguishing between human, bovine, fish, and pig hosts. Moreover, we conducted an interpretable machine learning using SHapley Additive exPlanations (SHAP) and variant annotation to uncover the most influential genomic features and associated genes for each host. Additionally, by meticulously examining misclassified samples, we gained valuable insights into the dynamics of host transmission and the potential for zoonotic infections. Conclusions: Our study underscores the effectiveness of random forest (RF) and logistic regression (LR) models based on mutation data for accurately predicting GBS host origins. Additionally, we identify the key features associated with each GBS host, thereby enhancing our understanding of the bacteria's host -specific adaptations.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Virulence mismatches in index hosts shape the outcomes of cross-species transmission
    Mollentze, Nardus
    Streicker, Daniel G.
    Murcia, Pablo R.
    Hampson, Katie
    Biek, Roman
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (46) : 28859 - 28866
  • [2] Hepatitis B virus lineages in mammalian hosts:Potential for bidirectional cross-species transmission
    Cibele R Bonvicino
    Miguel A Moreira
    Marcelo A Soares
    World Journal of Gastroenterology, 2014, 20 (24) : 7665 - 7674
  • [3] Hepatitis B virus lineages in mammalian hosts: Potential for bidirectional cross-species transmission
    Bonvicino, Cibele R.
    Moreira, Miguel A.
    Soares, Marcelo A.
    WORLD JOURNAL OF GASTROENTEROLOGY, 2014, 20 (24) : 7665 - 7674
  • [4] A Universal Influenza Nanovaccine for "Mixing Vessel" Hosts Confers Potential Ability to Block Cross-Species Transmission
    Ding, Peiyang
    Jin, Qianyue
    Zhou, Wen
    Chai, Yongxiao
    Liu, Xiao
    Wang, Yao
    Chen, Xinxin
    Guo, Junqing
    Deng, Ruiguang
    Gao, George F.
    Zhang, Gaiping
    ADVANCED HEALTHCARE MATERIALS, 2019, 8 (16)
  • [5] FIV cross-species transmission: An evolutionary prospective
    Troyer, Jennifer L.
    VandeWoude, Sue
    Pecon-Slattery, Jill
    McIntosh, Carl
    Franklin, Sam
    Antunes, Agostinho
    Johnson, Warren
    O'Brien, Stephen J.
    VETERINARY IMMUNOLOGY AND IMMUNOPATHOLOGY, 2008, 123 (1-2) : 159 - 166
  • [6] Intra- and Cross-Species Transmission of Astroviruses
    Roach, Shanley N.
    Langlois, Ryan A.
    VIRUSES-BASEL, 2021, 13 (06):
  • [7] Predicting systemic financial risk with interpretable machine learning
    Tang, Pan
    Tang, Tiantian
    Lu, Chennuo
    NORTH AMERICAN JOURNAL OF ECONOMICS AND FINANCE, 2024, 71
  • [8] Cross-species transfer of group B streptococcus via ingestion?
    Ismail, Abdul Qader Tahir
    Anthony, Mark
    JOURNAL OF PERINATAL MEDICINE, 2012, 40 (02) : 201 - 202
  • [9] Physiological variables in machine learning QSARs allow for both cross-chemical and cross-species predictions
    Zubrod, Jochen P.
    Galic, Nika
    Vaugeois, Maxime
    Dreier, David A.
    ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY, 2023, 263
  • [10] Cross-species transmission, evolution and zoonotic potential of coronaviruses
    Li, Qian
    Shah, Taif
    Wang, Binghui
    Qu, Linyu
    Wang, Rui
    Hou, Yutong
    Baloch, Zulqarnain
    Xia, Xueshan
    FRONTIERS IN CELLULAR AND INFECTION MICROBIOLOGY, 2023, 12