Prediction of genome- wide imipenem resistance features in Klebsiella pneumoniae using machine learning

被引:3
作者
Li, Shanshan [1 ]
Wu, Jun [2 ]
Ma, Nan [1 ]
Liu, Wenjia [1 ,3 ]
Shao, Mengjie [1 ]
Ying, Nanjiao [1 ,4 ]
Zhu, Lei [1 ,4 ]
机构
[1] Hangzhou Dianzi Univ, Coll Automation, Hangzhou 310018, Zhejiang, Peoples R China
[2] Linan Ctr Dis Control & Prevent, Linan 311300, Peoples R China
[3] Hangzhou Dianzi Univ, Coll Elect & Informat Engn, Hangzhou 310018, Peoples R China
[4] Hangzhou Dianzi Univ, Inst Biomed Engn & Instrument, Hangzhou 310018, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Klebsiella pneumoniae; carbapenem; imipenem; mer feature; antibiotic resistance gene; machine learning; ESCHERICHIA-COLI; DISSEMINATION; GENES;
D O I
10.1099/jmm.0.001657
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Introduction. The resistance rate of Klebsiella pneumoniae (K. pneumoniae) to imipenem is increasing year by year, and the imipenem resistance mechanism of K. pneumoniae is complex. Therefore, it is urgent to develop new strategies to explore the resistance mechanism of imipenem for its effective and accurate use in clinical practice. Hypothesis/Gap sStatement. Machine learning could identify resistance features and biological process that influence micro-bial resistance from whole-genome sequencing (WGS) data. Aims. This work aimed to predict imipenem resistance genetic features in K. pneumoniae from whole-genome k-mer features, and analyse their function for understanding its resistance mechanism. Methods. This study analysed WGS data of K. pneumoniae combined with resistance phenotype for imipenem, and established K. pneumoniae to imipenem genotype-phenotype model to predict resistance features using chi-squared test and random forest. An external clinical dataset was used to verify prediction power of resistance features. The potential genes were iden-tified through alignment the resistance features with the K. pneumoniae reference genome using B iota ASTn, the functions of potential genes were further analysed to explore its resistance-related signalling pathways with GO and KEGG analysis, the resistance sequence patterns were screened using STREME software. Finally, the resistance features were combined and mod-elled through four machine-learning algorithms (logistic regression, SVM, GBDT and XGBoost) to evaluate their phenotype prediction ability. Results. A total of 16 670 imipenem resistance features were predicted from genotype-phenotype model. The 30 potential genes were identified by annotating the resistance features and corresponded to known antibiotic-related genes (mdtM, dedA, rne, etc.). GO and KEGG pathway analyses indicated the possible association of imipenem resistance with metabolism process and cell membrane. CRYCAGCDN and CGRDAAAN were found from the imipenem resistance features, which were widely pre-sented in the reported beta-lactam resistance genes (blaSHV, blaCTX-M, blaTEM, etc.), and YCYAGCMCAST with metabolic functions (organic substance metabolic process, nitrogen compound metabolic process and cellular metabolic process) was identified from the top 50 resistance features. The 25 resistance genes in the training dataset included 19 genes in the external dataset, which verified the accuracy of prediction. The area under curve values of logistics regression, SVM, GBDT and XGBoost were 0.965, 0.966, 0.969 and 0.969, respectively, indicating that the imipenem resistance features have a strong prediction power. Conclusion. Machine-learning methods could effectively predict the imipenem resistance feature in K. pneumoniae, and provide resistance sequence profiles for predicting resistance phenotype and exploring potential resistance mechanisms. It provides an important insight into the potential therapeutic strategies of K. pneumoniae resistance to imipenem, and speed up the application of machine learning in routine diagnosis.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Prediction of antimicrobial resistance based on whole-genome sequencing and machine learning
    Ren, Yunxiao
    Chakraborty, Trinad
    Doijad, Swapnil
    Falgenhauer, Linda
    Falgenhauer, Jane
    Goesmann, Alexander
    Hauschild, Anne-Christin
    Schwengers, Oliver
    Heider, Dominik
    [J]. BIOINFORMATICS, 2022, 38 (02) : 325 - 334
  • [32] Whole genome sequencing reveals complex resistome features of Klebsiella pneumoniae isolated from patients at major hospitals in Trinidad, West Indies
    Pustam, Aarti
    Jayaraman, Jayaraj
    Ramsubhag, Adesh
    [J]. JOURNAL OF GLOBAL ANTIMICROBIAL RESISTANCE, 2024, 37 : 141 - 149
  • [33] Effect of Porins and blaKPC Expression on Activity of Imipenem with Relebactam in Klebsiella pneumoniae: Can Antibiotic Combinations Overcome Resistance?
    Balabanian, Gregory
    Rose, Michael
    Manning, Nyla
    Landman, David
    Quale, John
    [J]. MICROBIAL DRUG RESISTANCE, 2018, 24 (07) : 877 - 881
  • [34] The Rapid Prediction of Carbapenem Resistance in Patients With Klebsiella pneumoniae Bacteremia Using Electronic Medical Record Data
    Sullivan, Timothy
    Ichikawa, Osamu
    Dudley, Joel
    Li, Li
    Aberg, Judith
    [J]. OPEN FORUM INFECTIOUS DISEASES, 2018, 5 (05):
  • [35] Prediction of antimicrobial resistance in Klebsiella pneumoniae using genomic and metagenomic next-generation sequencing data
    Zhou, Xun
    Yang, Ming
    Chen, Fangyuan
    Wang, Leilei
    Han, Peng
    Jiang, Zhi
    Shen, Siquan
    Rao, Guanhua
    Yang, Fan
    [J]. JOURNAL OF ANTIMICROBIAL CHEMOTHERAPY, 2024, 79 (10) : 2509 - 2517
  • [36] Crop Yield Prediction Using Machine Learning Approaches on a Wide Spectrum
    Joshua, S. Vinson
    Priyadharson, A. Selwin Mich
    Kannadasan, Raju
    Khan, Arfat Ahmad
    Lawanont, Worawat
    Khan, Faizan Ahmed
    Rehman, Ateeq Ur
    Ali, Muhammad Junaid
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (03): : 5663 - 5679
  • [37] A Wide Scale Survey on Weather Prediction Using Machine Learning Techniques
    Kumari, Shabnam
    Muthulakshmi, P.
    [J]. JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2023, 22 (05)
  • [38] Prediction of ciprofloxacin resistance in hospitalized patients using machine learning
    Mintz, Igor
    Chowers, Michal
    Obolski, Uri
    [J]. COMMUNICATIONS MEDICINE, 2023, 3 (01):
  • [39] Splicing sites prediction of human genome using machine learning techniques
    Waseem Ullah
    Khan Muhammad
    Ijaz Ul Haq
    Amin Ullah
    Saeed Ullah Khattak
    Muhammad Sajjad
    [J]. Multimedia Tools and Applications, 2021, 80 : 30439 - 30460
  • [40] Splicing sites prediction of human genome using machine learning techniques
    Ullah, Waseem
    Muhammad, Khan
    Ul Haq, Ijaz
    Ullah, Amin
    Ullah Khattak, Saeed
    Sajjad, Muhammad
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (20) : 30439 - 30460