Developing a SARS-CoV-2 main protease binding prediction random forest model for drug repurposing for COVID-19 treatment

被引:5
作者
Liu, Jie [1 ]
Xu, Liang [1 ]
Guo, Wenjing [1 ]
Li, Zoe [1 ]
Khan, Md Kamrul Hasan [1 ]
Ge, Weigong [1 ]
Patterson, Tucker A. [1 ]
Hong, Huixiao [1 ]
机构
[1] US FDA, Natl Ctr Toxicol Res, Jefferson, AR 72079 USA
关键词
SARS-CoV-2; main protease; drug repurposing; random forest; binding activity; predictive model; MOLECULAR DESCRIPTORS; INHIBITORS; APPLICABILITY; DISCOVERY; DOMAIN;
D O I
10.1177/15353702231209413
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
The coronavirus disease 2019 (COVID-19) global pandemic resulted in millions of people becoming infected with the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus and close to seven million deaths worldwide. It is essential to further explore and design effective COVID-19 treatment drugs that target the main protease of SARS-CoV-2, a major target for COVID-19 drugs. In this study, machine learning was applied for predicting the SARS-CoV-2 main protease binding of Food and Drug Administration (FDA)-approved drugs to assist in the identification of potential repurposing candidates for COVID-19 treatment. Ligands bound to the SARS-CoV-2 main protease in the Protein Data Bank and compounds experimentally tested in SARS-CoV-2 main protease binding assays in the literature were curated. These chemicals were divided into training (516 chemicals) and testing (360 chemicals) data sets. To identify SARS-CoV-2 main protease binders as potential candidates for repurposing to treat COVID-19, 1188 FDA-approved drugs from the Liver Toxicity Knowledge Base were obtained. A random forest algorithm was used for constructing predictive models based on molecular descriptors calculated using Mold2 software. Model performance was evaluated using 100 iterations of fivefold cross-validations which resulted in 78.8% balanced accuracy. The random forest model that was constructed from the whole training dataset was used to predict SARS-CoV-2 main protease binding on the testing set and the FDA-approved drugs. Model applicability domain and prediction confidence on drugs predicted as the main protease binders discovered 10 FDA-approved drugs as potential candidates for repurposing to treat COVID-19. Our results demonstrate that machine learning is an efficient method for drug repurposing and, thus, may accelerate drug development targeting SARS-CoV-2.
引用
收藏
页码:1927 / 1936
页数:10
相关论文
共 59 条
  • [1] Targeting SARS-CoV-2 Main Protease: A Successful Story Guided by an In Silico Drug Repurposing Approach
    Ambrosio, Francesca Alessandra
    Costa, Giosue
    Romeo, Isabella
    Esposito, Francesca
    Alkhatib, Mohammad
    Salpini, Romina
    Svicher, Valentina
    Corona, Angela
    Malune, Paolo
    Tramontano, Enzo
    Ceccherini-Silberstein, Francesca
    Alcaro, Stefano
    Artese, Anna
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 63 (11) : 3601 - 3613
  • [2] Arun KG., 2021, J BIOMOL STRUCT DYN, V39
  • [3] In silico drug repurposing by combining machine learning classification model and molecular dynamics to identify a potential OGT inhibitor
    Azevedo, Pedro Henrique Rodrigues de Alencar
    Pecanha, Bruna Rachel de Britto
    Flores Jr, Luiz Augusto Pinheiro
    Alves, Tatiana Fialho
    Dias, Luiza Rosaria Sousa
    Muri, Estela Maris Freitas
    Lima, Camilo Henrique da Silva
    [J]. JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2024, 42 (03) : 1417 - 1428
  • [4] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [5] The Liver Toxicity Knowledge Base: A Systems Approach to a Complex End Point
    Chen, M.
    Zhang, J.
    Wang, Y.
    Liu, Z.
    Kelly, R.
    Zhou, G.
    Fang, H.
    Borlak, J.
    Tong, W.
    [J]. CLINICAL PHARMACOLOGY & THERAPEUTICS, 2013, 93 (05) : 409 - 412
  • [6] Cohen J, 2021, SCIENCE, V374, P1544, DOI 10.1126/science.acz9879
  • [7] Repurposing cefuroxime for treatment of COVID-19: a scoping review ofin silicostudies
    Durojaiye, Ashimiyu B.
    Clarke, John-Ross D.
    Stamatiades, George A.
    Wang, Can
    [J]. JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2021, 39 (12) : 4547 - 4554
  • [8] Machine-learning repurposing of DrugBank compounds for opioid use disorder
    Feng, Hongsong
    Jiang, Jian
    Wei, Guo-Wei
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 160
  • [9] Mpropred: A machine learning (ML) driven Web-App for bioactivity prediction of SARS-CoV-2 main protease (Mpro) antagonists
    Ferdous, Nadim
    Reza, Mahjerin Nasrin
    Hossain, Mohammad Uzzal
    Mahmud, Shahin
    Napis, Suhami
    Chowdhury, Kamal
    Mohiuddin, A. K. M.
    [J]. PLOS ONE, 2023, 18 (06):
  • [10] Perspectives on SARS-CoV-2 Main Protease Inhibitors
    Gao, Kaifu
    Wang, Rui
    Chen, Jiahui
    Tepe, Jetze J.
    Huang, Faqing
    Wei, Guo-Wei
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2021, 64 (23) : 16922 - 16955