Optimising machine learning prediction of minimum inhibitory concentrations in Klebsiella pneumoniae

被引:1
作者
Biffignandi, Gherard Batisti [1 ,2 ,3 ]
Chindelevitch, Leonid [2 ]
Corbella, Marta [4 ]
Feil, Edward J. [5 ]
Sassera, Davide [1 ,6 ]
Lees, John A. [3 ]
机构
[1] Univ Pavia, Dept Biol & Biotechnol, Pavia, Italy
[2] Imperial Coll London, MRC Ctr Global Infect Dis Anal, London, England
[3] European Bioinformat Inst, European Mol Biol Lab, Wellcome Genome Campus, Hinxton, England
[4] Fdn IRCCS Policlin San Matteo, Microbiol & Virol Unit, Pavia, Italy
[5] Univ Bath, Milner Ctr Evolut, Dept Life Sci, Bath, England
[6] Fdn IRCCS Policlin San Matteo, Pavia, Italy
来源
MICROBIAL GENOMICS | 2024年 / 10卷 / 03期
基金
英国医学研究理事会;
关键词
AMR; antibiotic resistance; bacterial genomics; GWAS; Klebsiella pneumoniae; machine learning; MIC; REGULARIZATION; SELECTION; MODELS;
D O I
10.1099/mgen.0.001222
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Minimum Inhibitory Concentrations (MICs) are the gold standard for quantitatively measuring antibiotic resistance. However, lab- based MIC determination can be time- consuming and suffers from low reproducibility, and interpretation as sensitive or resistant relies on guidelines which change over time. Genome sequencing and machine learning promise to allow in silico MIC prediction as an alternative approach which overcomes some of these difficulties, albeit the interpretation of MIC is still needed. Nevertheless, precisely how we should handle MIC data when dealing with predictive models remains unclear, since they are measured semi- quantitatively, with varying resolution, and are typically also left- and right- censored within varying ranges. We therefore investigated genome- based prediction of MICs in the pathogen Klebsiella pneumoniae using 4367 genomes with both simulated semi- quantitative traits and real MICs. As we were focused on clinical interpretation, we used interpretable rather than black - box machine learning models, namely, Elastic Net, Random Forests, and linear mixed models. Simulated traits were generated accounting for oligogenic, polygenic, and homoplastic genetic effects with different levels of heritability. Then we assessed how model prediction accuracy was affected when MICs were framed as regression and classification. Our results showed that treating the MICs differently depending on the number of concentration levels of antibiotic available was the most promising learning strategy. Specifically, to optimise both prediction accuracy and inference of the correct causal variants, we recommend considering the MICs as continuous and framing the learning problem as a regression when the number of observed antibiotic concentration levels is large, whereas with a smaller number of concentration levels they should be treated as a categorical variable and the learning problem should be framed as a classification. Our findings also underline how predictive models can be improved when prior biological knowledge is taken into account, due to the varying genetic architecture of each antibiotic resistance trait. Finally, we emphasise that incrementing the population database is pivotal for the future clinical implementation of these models to support routine machinelearning based diagnostics.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Rapid determination of colistin resistance in Klebsiella pneumoniae by MALDI-TOF peak based machine learning algorithm with MATLAB
    Iskender, Secil
    Heydarov, Saddam
    Yalcin, Metin
    Faydaci, Cagri
    Kurt, Ozge
    Surme, Serkan
    Kucukbasmaci, Omer
    DIAGNOSTIC MICROBIOLOGY AND INFECTIOUS DISEASE, 2023, 107 (04)
  • [22] Inhibitory effect of trisodium citrate on biofilms formed by Klebsiella pneumoniae
    Chu, Lijuan
    Zhou, Xingyan
    Shen, Yan
    Yu, Yu
    JOURNAL OF GLOBAL ANTIMICROBIAL RESISTANCE, 2020, 22 : 452 - 456
  • [23] Predicting Phenotypic Polymyxin Resistance in Klebsiella pneumoniae through Machine Learning Analysis of Genomic Data
    Macesic, Nenad
    Bear, Oliver J.
    Pe'er, Itsik
    Tatonetti, Nicholas P.
    Peleg, Anton Y.
    Uhlemann, Anne-Catrin
    MSYSTEMS, 2020, 5 (03)
  • [24] Characterization of Klebsiella pneumoniae strains Exposed to Subinhibitory Concentrations of Chlorhexidine
    Duman, Gamze Gizem
    Cuhadar, Tugba
    Yamak, Ahmet Sungur
    Albakkour, Katren
    Caglar, Kayhan
    Kalkanci, Ayse
    MIKROBIYOLOJI BULTENI, 2019, 53 (01): : 114 - 117
  • [25] Mutant prevention concentrations of ciprofloxacin against urinary isolates of Escherichia coli and Klebsiella pneumoniae
    Daoud, Ziad
    Sokhn, Elie Salem
    Azar, Eid
    Masri, Khalil
    Doron, Shira
    JOURNAL OF INFECTION IN DEVELOPING COUNTRIES, 2014, 8 (02): : 154 - 159
  • [26] Analysis and prediction of atmospheric ozone concentrations using machine learning
    Rass, Stephan
    Leuenberger, Markus C.
    FRONTIERS IN BIG DATA, 2025, 7
  • [27] Machine learning model of imipenem-resistant Klebsiella pneumoniae based on MALDI-TOF-MS platform: An observational study
    Zeng, Yu
    Wang, Chao
    Ye, Qing
    Liu, Gang
    Zhang, Lixia
    Wan, Jingjing
    Zhu, Yu
    HEALTH SCIENCE REPORTS, 2023, 6 (09)
  • [28] Clinical relevance of Minimum Inhibitory Concentrations (MICs)
    Branson, E
    AQUACULTURE, 2001, 196 (3-4) : 289 - 296
  • [29] Comparing Scientific Machine Learning With Population Pharmacokinetic and Classical Machine Learning Approaches for Prediction of Drug Concentrations
    Valderrama, Diego
    Teplytska, Olga
    Koltermann, Luca Marie
    Trunz, Elena
    Schmulenson, Eduard
    Fritsch, Achim
    Jaehde, Ulrich
    Froehlich, Holger
    CPT-PHARMACOMETRICS & SYSTEMS PHARMACOLOGY, 2025, : 759 - 769
  • [30] In silico Prediction of Inhibitory Constant of Thrombin Inhibitors using Machine Learning
    Zhao, Junnan
    Zhu, Lu
    Zhou, Weineng
    Yin, Lingfeng
    Wang, Yuchen
    Fan, Yuanrong
    Chen, Yadong
    Liu, Haichun
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2018, 21 (09) : 662 - 669