Benchmarking 2D/3D/MD-QSAR Models for Imatinib Derivatives: How Far Can We Predict?

被引:14
作者
Zin, Phyo Phyo Kyaw [1 ]
Borrel, Alexandre [1 ]
Fourches, Denis [1 ]
机构
[1] North Carolina State Univ, Bioinformat Res Ctr, Dept Chem, Raleigh, NC 27695 USA
关键词
TYROSINE KINASE INHIBITORS; SELECTIVE INHIBITOR; PROTEIN; VALIDATION; DISCOVERY; DOCKING; GROWTH; POTENT; GLIDE; DRUG;
D O I
10.1021/acs.jcim.0c00200
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Imatinib, a 2-phenylaminopyridine-based BCR-ABL tyrosine kinase inhibitor, is a highly effective drug for treating Chronic Myeloid Leukemia (CML). However, cases of drug resistance are constantly emerging due to various mutations in the ABL kinase domain; thus, it is crucial to identify novel bioactive analogues. Reliable QSAR models and molecular docking protocols have been shown to facilitate the discovery of new compounds from chemical libraries prior to experimental testing. However, as the vast majority of QSAR models strictly relies on 2D descriptors, the rise of 3D descriptors directly computed from molecular dynamics simulations offers new opportunities to potentially augment the reliability of QSAR models. Herein, we employed molecular docking and molecular dynamics on a large series of Imatinib derivatives and developed an ensemble of QSAR models relying on deep neural nets (DNN) and hybrid sets of 2D/3D/MD descriptors in order to predict the binding affinity and inhibition potencies of those compounds. Through rigorous validation tests, we showed that our DNN regression models achieved excellent external prediction performances for the pKi data set (n = 555, R-2 >= 0.71. and MAE <= 0.85), and the pIC(50) data set (n = 306, R-2 >= 0.54. and MAE <= 0.71) with strict validation protocols based on external test sets and 10-fold native and nested cross validations. Interestingly, the best DNN and random forest models performed similarly across all descriptor sets. In fact, for this particular series of compounds, our external test results suggest that incorporating additional 3D protein-ligand binding site fingerprint, descriptors, or even MD time-series descriptors did not significantly improve the overall R-2 but lowered the MAE of DNN QSAR models. Those augmented models could still help in identifying and understanding the key dynamic protein-ligand interactions to be optimized for further molecular design.
引用
收藏
页码:3342 / 3360
页数:19
相关论文
共 45 条
[1]  
[Anonymous], 2014, ARXIV NEURAL EVOLUTI
[2]  
[Anonymous], 2015, LIGPREP 3 4 USER MAN
[3]   Characterizing the Chemical Space of ERK2 Kinase Inhibitors Using Descriptors Computed from Molecular Dynamics Trajectories [J].
Ash, Jeremy ;
Fourches, Denis .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2017, 57 (06) :1286-1299
[4]   A machine learning approach to predicting protein-ligand binding affinity with applications to molecular docking [J].
Ballester, Pedro J. ;
Mitchell, John B. O. .
BIOINFORMATICS, 2010, 26 (09) :1169-1175
[5]   KNIME-CDK: Workflow-driven cheminformatics [J].
Beisken, Stephan ;
Meinl, Thorsten ;
Wiswedel, Bernd ;
de Figueiredo, Luis F. ;
Berthold, Michael ;
Steinbeck, Christoph .
BMC BIOINFORMATICS, 2013, 14
[6]   Pock Drug: A Model for Predicting Pocket Druggability That Overcomes Pocket Estimation Uncertainties [J].
Borrel, Alexandre ;
Regad, Leslie ;
Xhaard, Henri ;
Petitjean, Michel ;
Camproux, Anne-Claude .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (04) :882-895
[7]  
Bowers KJ., 2006, P 2006 ACM IEEE C SU, P43, DOI [DOI 10.1145/1188455.1188544, DOI 10.1109/SC.2006.54]
[8]   Second-Generation Tyrosine Kinase Inhibitors (Tki) as Salvage Therapy for Resistant or Intolerant Patients to Prior TKIs [J].
Breccia, Massimo ;
Alimena, Giuliana .
MEDITERRANEAN JOURNAL OF HEMATOLOGY AND INFECTIOUS DISEASES, 2014, 6 (01)
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   PyDPI: Freely Available Python']Python Package for Chemoinformatics, Bioinformatics, and Chemogenomics Studies [J].
Cao, Dong-Sheng ;
Liang, Yi-Zeng ;
Yan, Jun ;
Tan, Gui-Shan ;
Xu, Qing-Song ;
Liu, Shao .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (11) :3086-3096