Prediction of antimicrobial resistance based on whole-genome sequencing and machine learning

被引:80
|
作者
Ren, Yunxiao [1 ]
Chakraborty, Trinad [2 ,3 ]
Doijad, Swapnil [2 ,3 ]
Falgenhauer, Linda [3 ,4 ,5 ]
Falgenhauer, Jane [2 ,3 ]
Goesmann, Alexander [3 ,6 ]
Hauschild, Anne-Christin [1 ]
Schwengers, Oliver [3 ,6 ]
Heider, Dominik [1 ]
机构
[1] Philipps Univ Marburg, Fac Math & Comp Sci, Dept Data Sci Biomed, D-35032 Marburg, Germany
[2] Justus Liebig Univ Giessen, Inst Med Microbiol, D-35392 Giessen, Germany
[3] German Ctr Infect Res, Partner Site Giessen Marburg Langen, D-35392 Giessen, Germany
[4] Justus Liebig Univ Giessen, Inst Hyg & Environm Med, D-35392 Giessen, Germany
[5] Hess Univ Kompetenzzentrum Krankenhaushyg, D-35392 Giessen, Germany
[6] Justus Liebig Univ Giessen, Dept Bioinformat & Syst Biol, D-35392 Giessen, Germany
关键词
CHAOS GAME REPRESENTATION; ANTIBIOTIC-RESISTANCE; ESCHERICHIA-COLI; READ ALIGNMENT; MODEL;
D O I
10.1093/bioinformatics/btab681
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Antimicrobial resistance (AMR) is one of the biggest global problems threatening human and animal health. Rapid and accurate AMR diagnostic methods are thus very urgently needed. However, traditional antimicrobial susceptibility testing (AST) is time-consuming, low throughput and viable only for cultivable bacteria. Machine learning methods may pave the way for automated AMR prediction based on genomic data of the bacteria. However, comparing different machine learning methods for the prediction of AMR based on different encodings and whole-genome sequencing data without previously known knowledge remains to be done. Results: In this study, we evaluated logistic regression (LR), support vector machine (SVM), random forest (RF) and convolutional neural network (CNN) for the prediction of AMR for the antibiotics ciprofloxacin, cefotaxime, ceftazidime and gentamicin. We could demonstrate that these models can effectively predict AMR with label encoding, one-hot encoding and frequency matrix chaos game representation (FCGR encoding) on whole-genome sequencing data. We trained these models on a large AMR dataset and evaluated them on an independent public dataset. Generally, RFs and CNNs perform better than LR and SVM with AUCs up to 0.96. Furthermore, we were able to identify mutations that are associated with AMR for each antibiotic.
引用
收藏
页码:325 / 334
页数:10
相关论文
共 50 条
  • [31] Whole-genome sequencing for TB drug resistance determination
    Omar, S. V.
    Featherston, J.
    Said, H.
    Bhyat, Z.
    Dreyer, A.
    Rees, J.
    Ismail, N.
    INTERNATIONAL JOURNAL OF INFECTIOUS DISEASES, 2014, 21 : 56 - 57
  • [32] Integrating whole genome sequencing and machine learning for predicting antimicrobial resistance in critical pathogens: a systematic review of antimicrobial susceptibility tests
    Ardila, Carlos M.
    Yadalam, Pradeep K.
    Gonzalez-Arroyave, Daniel
    PEERJ, 2024, 12
  • [33] Integrating whole genome sequencing and machine learning for predicting antimicrobial resistance in critical pathogens: a systematic review of antimicrobial susceptibility tests
    Ardila, Carlos M.
    Yadalam, Pradeep K.
    Gonzalez-Arroyave, Daniel
    PEERJ COMPUTER SCIENCE, 2024, 12
  • [34] Machine learning approaches for advanced detection of rare genetic disorders in whole-genome sequencing
    Alzahrani, Ahmad A.
    Alharithi, Fahd S.
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 106 : 582 - 593
  • [35] Implementing Whole-Genome Sequencing for Ongoing Surveillance of Antimicrobial Resistance: Exemplifying Insights Into Klebsiella pneumoniae
    Aanensen, David M.
    Carlos, Celia C.
    Donado-Godoy, Pilar
    Okeke, Iruka N.
    Ravikumar, K. L.
    CLINICAL INFECTIOUS DISEASES, 2021, 73 : S255 - S257
  • [36] Whole-Genome Sequencing Analysis Accurately Predicts Antimicrobial Resistance Phenotypes in Campylobacter spp.
    Zhao, S.
    Tyson, G. H.
    Chen, Y.
    Li, C.
    Mukherjee, S.
    Young, S.
    Lam, C.
    Folster, J. P.
    Whichard, J. M.
    McDermott, P. F.
    APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2016, 82 (02) : 459 - 466
  • [37] Challenges and opportunities for whole-genome sequencing- based surveillance of antibiotic resistance
    Schurch, Anita C.
    van Schaik, Willem
    ANNALS OF THE NEW YORK ACADEMY OF SCIENCES, 2017, 1388 (01) : 108 - 120
  • [38] Interpreting Whole-Genome Sequencing
    Grody, Wayne W.
    Vilain, Eric
    Nelson, Stanley F.
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2014, 312 (03): : 296 - 296
  • [39] Whole-genome sequencing in pharmacogeneticson
    Urban, Thomas J.
    PHARMACOGENOMICS, 2013, 14 (04) : 345 - 348
  • [40] Whole-genome DNA sequencing
    Myers, G
    COMPUTING IN SCIENCE & ENGINEERING, 1999, 1 (03) : 33 - 43