Predicting hosts and cross-species transmission of Streptococcus agalactiae by interpretable machine learning

被引:2
作者
Ren, Yunxiao [1 ]
Li, Carmen [2 ]
Sapugahawatte, Dulmini Nanayakkara [2 ]
Zhu, Chendi [2 ]
Spaenig, Sebastian [1 ]
Jamrozy, Dorota [3 ]
Rothen, Julian [4 ,5 ]
Daubenberger, Claudia A. [4 ,5 ]
Bentley, Stephen D. [3 ]
Ip, Margaret [2 ]
Heider, Dominik [1 ,6 ,7 ]
机构
[1] Philipps Univ Marburg, Fac Math & Comp Sci, Dept Data Sci Biomed, Marburg, Germany
[2] Chinese Univ Hong Kong, Fac Med, Dept Microbiol, Hong Kong, Peoples R China
[3] Wellcome Sanger Inst, Parasites & Microbes Programme, Wellcome Genome Campus, Cambridge, England
[4] Swiss Trop & Publ Hlth Inst Swiss TPH Basel, Dept Med Parasitol & Infect Biol, CH-4002 Basel, Switzerland
[5] Univ Basel, CH-4002 Basel, Switzerland
[6] Univ Dusseldorf, Inst Comp Sci, D-40211 Dusseldorf, Germany
[7] Heinrich Heine Univ Dusseldorf, Ctr Digital Hlth, Moorenstr 5, D-40225 Dusseldorf, Germany
关键词
Hosts prediction; Host adaptations; Cross-species transmission; Interpretable machine learning; GROUP-B STREPTOCOCCUS; SEQUENCE TYPE 283; FISH;
D O I
10.1016/j.compbiomed.2024.108185
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Streptococcus agalactiae, commonly known as Group B Streptococcus (GBS), exhibits a broad host range, manifesting as both a beneficial commensal and an opportunistic pathogen across various species. In humans, it poses significant risks, causing neonatal sepsis and meningitis, along with severe infections in adults. Additionally, it impacts livestock by inducing mastitis in bovines and contributing to epidemic mortality in fish populations. Despite its wide host spectrum, the mechanisms enabling GBS to adapt to specific hosts remain inadequately elucidated. Therefore, the development of a rapid and accurate method differentiates GBS strains associated with particular animal hosts based on genome-wide information holds immense potential. Such a tool would not only bolster the identification and containment efforts during GBS outbreaks but also deepen our comprehension of the bacteria's host adaptations spanning humans, livestock, and other natural animal reservoirs. Methods and results: Here, we developed three machine learning models-random forest (RF), logistic regression (LR), and support vector machine (SVM) based on genome-wide mutation data. These models enabled precise prediction of the host origin of GBS, accurately distinguishing between human, bovine, fish, and pig hosts. Moreover, we conducted an interpretable machine learning using SHapley Additive exPlanations (SHAP) and variant annotation to uncover the most influential genomic features and associated genes for each host. Additionally, by meticulously examining misclassified samples, we gained valuable insights into the dynamics of host transmission and the potential for zoonotic infections. Conclusions: Our study underscores the effectiveness of random forest (RF) and logistic regression (LR) models based on mutation data for accurately predicting GBS host origins. Additionally, we identify the key features associated with each GBS host, thereby enhancing our understanding of the bacteria's host -specific adaptations.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Porcine Deltacoronaviruses: Origin, Evolution, Cross-Species Transmission and Zoonotic Potential
    Kong, Fanzhi
    Wang, Qiuhong
    Kenney, Scott P.
    Jung, Kwonil
    Vlasova, Anastasia N.
    Saif, Linda J.
    [J]. PATHOGENS, 2022, 11 (01):
  • [32] Superinfection reconciles host-parasite association and cross-species transmission
    Haven, James
    Park, Andrew William
    [J]. THEORETICAL POPULATION BIOLOGY, 2013, 90 : 129 - 134
  • [33] Understanding and predicting online product return behavior: An interpretable machine learning approach
    Duong, Quang Huy
    Zhou, Li
    Nguyen, Truong Van
    Meng, Meng
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2025, 280
  • [34] Interpretable machine learning models for predicting and explaining vehicle fuel consumption anomalies
    Barbado, Alberto
    Corcho, Oscar
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 115
  • [35] Predicting the fundraising performance of environmental crowdfunding projects: An interpretable machine learning approach
    Liu, Zhanyu
    Hu, Saiquan
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (02)
  • [36] Cross-species transmission potential of chicken anemia virus and avian gyrovirus 2
    Liu, Yumeng
    Lv, Qiao
    Li, Yuying
    Yu, Ziping
    Huang, Haixin
    Lan, Tian
    Wang, Wei
    Cao, Liang
    Shi, Yaokai
    Sun, Wenchao
    Zheng, Min
    [J]. INFECTION GENETICS AND EVOLUTION, 2022, 99
  • [37] Emergence of a novel porcine pestivirus with potential for cross-species transmission in China, 2023
    Deng, Li-shuang
    Xu, Tong
    Xu, Zhi-wen
    Zhu, Ling
    [J]. VETERINARY RESEARCH, 2025, 56 (01) : 32
  • [38] Cross-species transmission of a novel bisegmented orfanplasmovirus in the phytopathogenic fungus Exserohilum rostratum
    Jia, Jichun
    Nan, Linjie
    Song, Zehao
    Chen, Xu
    Xia, Jinsheng
    Cheng, Lihong
    Zhang, Baojun
    Mu, Fan
    [J]. FRONTIERS IN MICROBIOLOGY, 2024, 15
  • [39] Cross-species transmission of feline herpesvirus 1 (FHV-1) to chinchillas
    Shi, Longyan
    Huang, Shuping
    Lu, Yuxin
    Su, Yuqing
    Guo, Lin
    Guo, Lijun
    Xie, Wei
    Li, Xiang
    Wang, Yulong
    Yang, Siyuan
    Chai, Hongliang
    Wang, Yajun
    [J]. VETERINARY MEDICINE AND SCIENCE, 2022, 8 (06) : 2532 - 2537
  • [40] Feline APOBEC3s, Barriers to Cross-Species Transmission of FIV?
    Zhang, Zeli
    Gu, Qinyong
    Marino, Daniela
    Lee, Kyeong-Lim
    Kong, Il-Keun
    Haeussinger, Dieter
    Muenk, Carsten
    [J]. VIRUSES-BASEL, 2018, 10 (04):