Developing a SARS-CoV-2 main protease binding prediction random forest model for drug repurposing for COVID-19 treatment

被引:5
作者
Liu, Jie [1 ]
Xu, Liang [1 ]
Guo, Wenjing [1 ]
Li, Zoe [1 ]
Khan, Md Kamrul Hasan [1 ]
Ge, Weigong [1 ]
Patterson, Tucker A. [1 ]
Hong, Huixiao [1 ]
机构
[1] US FDA, Natl Ctr Toxicol Res, Jefferson, AR 72079 USA
关键词
SARS-CoV-2; main protease; drug repurposing; random forest; binding activity; predictive model; MOLECULAR DESCRIPTORS; INHIBITORS; APPLICABILITY; DISCOVERY; DOMAIN;
D O I
10.1177/15353702231209413
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
The coronavirus disease 2019 (COVID-19) global pandemic resulted in millions of people becoming infected with the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus and close to seven million deaths worldwide. It is essential to further explore and design effective COVID-19 treatment drugs that target the main protease of SARS-CoV-2, a major target for COVID-19 drugs. In this study, machine learning was applied for predicting the SARS-CoV-2 main protease binding of Food and Drug Administration (FDA)-approved drugs to assist in the identification of potential repurposing candidates for COVID-19 treatment. Ligands bound to the SARS-CoV-2 main protease in the Protein Data Bank and compounds experimentally tested in SARS-CoV-2 main protease binding assays in the literature were curated. These chemicals were divided into training (516 chemicals) and testing (360 chemicals) data sets. To identify SARS-CoV-2 main protease binders as potential candidates for repurposing to treat COVID-19, 1188 FDA-approved drugs from the Liver Toxicity Knowledge Base were obtained. A random forest algorithm was used for constructing predictive models based on molecular descriptors calculated using Mold2 software. Model performance was evaluated using 100 iterations of fivefold cross-validations which resulted in 78.8% balanced accuracy. The random forest model that was constructed from the whole training dataset was used to predict SARS-CoV-2 main protease binding on the testing set and the FDA-approved drugs. Model applicability domain and prediction confidence on drugs predicted as the main protease binders discovered 10 FDA-approved drugs as potential candidates for repurposing to treat COVID-19. Our results demonstrate that machine learning is an efficient method for drug repurposing and, thus, may accelerate drug development targeting SARS-CoV-2.
引用
收藏
页码:1927 / 1936
页数:10
相关论文
共 59 条
  • [31] Molecular Docking for Identification of Potential Targets for Drug Repurposing
    Luo, Heng
    Mattes, William
    Mendrick, Donna L.
    Hong, Huixiao
    [J]. CURRENT TOPICS IN MEDICINAL CHEMISTRY, 2016, 16 (30) : 3636 - 3645
  • [32] Machine Learning Methods for Predicting HLA-Peptide Binding Activity
    Luo, Heng
    Ye, Hao
    Ng, Hui Wen
    Shi, Leming
    Tong, Weida
    Mendrick, Donna L.
    Hong, Huixiao
    [J]. BIOINFORMATICS AND BIOLOGY INSIGHTS, 2015, 9 : 21 - 29
  • [33] Ultralarge Virtual Screening Identifies SARS-CoV-2 Main Protease Inhibitors with Broad-Spectrum Activity against Coronaviruses
    Luttens, Andreas
    Gullberg, Hjalmar
    Abdurakhmanov, Eldar
    Duy Duc Vo
    Akaberi, Dario
    Talibov, Vladimir O.
    Nekhotiaeva, Natalia
    Vangeel, Laura
    De Jonghe, Steven
    Jochmans, Dirk
    Krambrich, Janina
    Tas, Ali
    Lundgren, Bo
    Gravenfors, Ylva
    Craig, Alexander J.
    Atilaw, Yoseph
    Sandstrom, Anja
    Moodie, Lindon W. K.
    Lundkvist, Ake
    van Hemert, Martijn J.
    Neyts, Johan
    Lennerstrand, Johan
    Kihlberg, Jan
    Sandberg, Kristian
    Danielson, U. Helena
    Carlsson, Jens
    [J]. JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2022, 144 (07) : 2905 - 2920
  • [34] Discovery of Di- and Trihaloacetamides as Covalent SARS-CoV-2 Main Protease Inhibitors with High Target Specificity
    Ma, Chunlong
    Xia, Zilei
    Sacco, Michael Dominic
    Hu, Yanmei
    Townsend, Julia Alma
    Meng, Xiangzhi
    Choza, Juliana
    Tan, Haozhou
    Jang, Janice
    Gongora, Maura, V
    Zhang, Xiujun
    Zhang, Fushun
    Xiang, Yan
    Marty, Michael Thomas
    Chen, Yu
    Wang, Jun
    [J]. JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2021, 143 (49) : 20697 - 20709
  • [35] Support Vector Machine as a Supervised Learning for the Prioritization of Novel Potential SARS-CoV-2 Main Protease Inhibitors
    Mekni, Nedra
    Coronnello, Claudia
    Langer, Thierry
    De Rosa, Maria
    Perricone, Ugo
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2021, 22 (14)
  • [36] Structural Basis of Potential Inhibitors Targeting SARS-CoV-2 Main Protease
    Mengist, Hylemariam Mihiretie
    Dilnessa, Tebelay
    Jin, Tengchuan
    [J]. FRONTIERS IN CHEMISTRY, 2021, 9
  • [37] Searching for potential inhibitors of SARS-COV-2 main protease using supervised learning and perturbation calculations
    Nguyen, Trung Hai
    Tam, Nguyen Minh
    Tuan, Mai Van
    Zhan, Peng
    Vu, Van V.
    Quang, Duong Tuan
    Ngo, Son Tung
    [J]. CHEMICAL PHYSICS, 2022, 564
  • [38] Repurposing the HCV NS3-4A protease drug boceprevir as COVID-19 therapeutics
    Oerlemans, Rick
    Ruiz-Moreno, Angel Jonathan
    Cong, Yingying
    Kumar, Nilima Dinesh
    Velasco-Velazquez, Marco A.
    Neochoritis, Constantinos G.
    Smith, Jolanda
    Reggiori, Fulvio
    Groves, Matthew R.
    Domling, Alexander
    [J]. RSC MEDICINAL CHEMISTRY, 2021, 12 (03): : 370 - 379
  • [39] Uncovering Flexible Active Site Conformations of SARS-CoV-2 3CL Proteases through Protease Pharmacophore Clusters and COVID-19 Drug Repurposing
    Pathak, Nikhil
    Chen, Yun-Ti
    Hsu, Yen-Chao
    Hsu, Nung-Yu
    Kuo, Chih-Jung
    Tsai, Hui Ping
    Kang, Jaw-Jou
    Huang, Chih-Heng
    Chang, Sui-Yuan
    Chang, Yu-Hsiu
    Liang, Po-Huang
    Yang, Jinn-Moon
    [J]. ACS NANO, 2021, 15 (01) : 857 - 872
  • [40] An Overview of Severe Acute Respiratory Syndrome-Coronavirus (SARS-CoV) 3CL Protease Inhibitors: Peptidomimetics and Small Molecule Chemotherapy
    Pillaiyar, Thanigaimalai
    Manickam, Manoj
    Namasivayam, Vigneshwaran
    Hayashi, Yoshio
    Jung, Sang-Hun
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2016, 59 (14) : 6595 - 6628