MathDL: mathematical deep learning for D3R Grand Challenge 4

被引:66
作者
Duc Duy Nguyen [1 ]
Gao, Kaifu [1 ]
Wang, Menglun [1 ]
Wei, Guo-Wei [1 ,2 ,3 ]
机构
[1] Michigan State Univ, Dept Math, E Lansing, MI 48824 USA
[2] Michigan State Univ, Dept Biochem & Mol Biol, E Lansing, MI 48824 USA
[3] Michigan State Univ, Dept Elect & Comp Engn, E Lansing, MI 48824 USA
关键词
D3R-drug design data resource; Algebraic topology; Graph theory; Differential geometry; Binding affinity; Pose prediction; Docking; Deep learning; Generative adversarial network; EMPIRICAL SCORING FUNCTIONS; PROTEIN-LIGAND INTERACTIONS; BINDING-AFFINITY; GRAPH-THEORY; PERSISTENT HOMOLOGY; DYNAMICS; PREDICTION; DOCKING; ENERGY; FLEXIBILITY;
D O I
10.1007/s10822-019-00237-5
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present the performances of our mathematical deep learning (MathDL) models for D3R Grand Challenge 4 (GC4). This challenge involves pose prediction, affinity ranking, and free energy estimation for beta secretase 1 (BACE) as well as affinity ranking and free energy estimation for Cathepsin S (CatS). We have developed advanced mathematics, namely differential geometry, algebraic graph, and/or algebraic topology, to accurately and efficiently encode high dimensional physical/chemical interactions into scalable low-dimensional rotational and translational invariant representations. These representations are integrated with deep learning models, such as generative adversarial networks (GAN) and convolutional neural networks (CNN) for pose prediction and energy evaluation, respectively. Overall, our MathDL models achieved the top place in pose prediction for BACE ligands in Stage 1a. Moreover, our submissions obtained the highest Spearman correlation coefficient on the affinity ranking of 460 CatS compounds, and the smallest centered root mean square error on the free energy set of 39 CatS molecules. It is worthy to mention that our method on docking pose predictions has significantly improved from our previous ones.
引用
收藏
页码:131 / 147
页数:17
相关论文
共 97 条
[1]   ICM - A NEW METHOD FOR PROTEIN MODELING AND DESIGN - APPLICATIONS TO DOCKING AND STRUCTURE PREDICTION FROM THE DISTORTED NATIVE CONFORMATION [J].
ABAGYAN, R ;
TOTROV, M ;
KUZNETSOV, D .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 1994, 15 (05) :488-506
[2]  
[Anonymous], TECHNICAL REPORT
[3]  
[Anonymous], 1983, CHEM GRAPH THEORY
[4]  
[Anonymous], STRUCT BIOINFORM
[5]  
Arjovsky M., 2017, P 34 INT C MACH LEAR, V70, P214
[6]   Anisotropy of fluctuation dynamics of proteins with an elastic network model [J].
Atilgan, AR ;
Durell, SR ;
Jernigan, RL ;
Demirel, MC ;
Keskin, O ;
Bahar, I .
BIOPHYSICAL JOURNAL, 2001, 80 (01) :505-515
[7]   Vibrational dynamics of folded proteins: Significance of slow and fast motions in relation to function and stability [J].
Bahar, I ;
Atilgan, AR ;
Demirel, MC ;
Erman, B .
PHYSICAL REVIEW LETTERS, 1998, 80 (12) :2733-2736
[8]   Direct evaluation of thermal fluctuations in proteins using a single-parameter harmonic potential [J].
Bahar, I ;
Atilgan, AR ;
Erman, B .
FOLDING & DESIGN, 1997, 2 (03) :173-181
[9]  
Balaban A. T., 1976, CHEM APPL GRAPH THEO
[10]   A machine learning approach to predicting protein-ligand binding affinity with applications to molecular docking [J].
Ballester, Pedro J. ;
Mitchell, John B. O. .
BIOINFORMATICS, 2010, 26 (09) :1169-1175