Time-Domain Analysis of Molecular Dynamics Trajectories Using Deep Neural Networks: Application to Activity Ranking of Tankyrase Inhibitors

被引:22
作者
Berishvili, Vladimir P. [1 ]
Perkin, Valentin O. [1 ]
Voronkov, Andrew E. [1 ,2 ]
Radchenko, Eugene V. [1 ]
Syed, Riyaz [3 ]
Reddy, Chittireddy Venkata Ramana [3 ]
Pillay, Viness [4 ]
Kumar, Pradeep [4 ]
Choonara, Yahya E. [4 ]
Kamal, Ahmed [5 ]
Palyulin, Vladimir A. [1 ]
机构
[1] Lomonosov Moscow State Univ, Dept Chem, Moscow 119991, Russia
[2] Digital BioPharm Ltd, Hovseterveien 42 A,H0301, N-0768 Oslo, Norway
[3] Jawaharlal Nehru Technol Univ, Dept Chem, Hyderabad 500085, India
[4] Univ Witwatersrand, Wits Adv Drug Delivery Platform Res Unit, Sch Therapeut Sci, Dept Pharm & Pharmacol,Fac Hlth Sci, 7 York Rd, ZA-2193 Parktown, South Africa
[5] Jamia Hamdard, Sch Pharmaceut Educ & Res, New Delhi 110062, India
基金
俄罗斯基础研究基金会; 新加坡国家研究基金会;
关键词
BINDING AFFINITIES; BETA-CATENIN; LIGAND; PREDICTION; OPTIMIZATION; ACCURACY; DOCKING;
D O I
10.1021/acs.jcim.9b00135
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Molecular dynamics simulations provide valuable insights into the behavior of molecular systems. Extending the recent trend of using machine learning techniques to predict physicochemical properties from molecular dynamics data, we propose to consider the trajectories as multidimensional time series represented by 2D tensors containing the ligand-protein interaction descriptor values for each time step. Similar in structure to the time series encountered in modern approaches for signal, speech, and natural language processing, these time series can be directly analyzed using long short-term memory (LSTM) recurrent neural networks or convolutional neural networks (CNNs). The predictive regression models for the ligand-protein affinity were built for a subset of the PDBbind v.2017 database and applied to inhibitors of tankyrase, an enzyme of the poly(ADP-ribose)-polymerase (PARP) family that can be used in the treatment of colorectal cancer. As an additional test set, a subset of the Community Structure Activity Resource (CSAR) data set was used. For comparison, the random forest and simple neural network models based on the crystal pose or the trajectory-averaged descriptors were used, as well as the commonly employed docking and molecular mechanics Poisson-Boltzmann surface area (MM-PBSA) scores. Convolutional neural networks based on the 2D tensors of ligand-protein interaction descriptors for short (2 ns) trajectories provide the best accuracy and predictive power, reaching the Spearman rank correlation coefficient of 0.73 and Pearson correlation coefficient of 0.70 for the tankyrase test set. Taking into account the recent increase in computational power of modern GPUs and relatively low computational complexity of the proposed approach, it can be used as an advanced virtual screening filter for compound prioritization.
引用
收藏
页码:3519 / 3532
页数:14
相关论文
共 74 条
  • [1] Machine learning for neuroirnaging with scikit-learn
    Abraham, Alexandre
    Pedregosa, Fabian
    Eickenberg, Michael
    Gervais, Philippe
    Mueller, Andreas
    Kossaifi, Jean
    Gramfort, Alexandre
    Thirion, Bertrand
    Varoquaux, Gael
    [J]. FRONTIERS IN NEUROINFORMATICS, 2014, 8
  • [2] Gromacs: High performance molecular simulations through multi-level parallelism from laptops to supercomputers
    Abraham, Mark James
    Murtola, Teemu
    Schulz, Roland
    Páll, Szilárd
    Smith, Jeremy C.
    Hess, Berk
    Lindah, Erik
    [J]. SoftwareX, 2015, 1-2 : 19 - 25
  • [3] Tankyrase Inhibition Blocks Wnt/β-Catenin Pathway and Reverts Resistance to PI3K and AKT Inhibitors in the Treatment of Colorectal Cancer
    Arques, Oriol
    Chicote, Irene
    Puig, Isabel
    Tenbaum, Stephan P.
    Argiles, Guillem
    Dienstmann, Rodrigo
    Fernandez, Natalia
    Caratu, Ginevra
    Matito, Judit
    Silberschmidt, Daniel
    Rodon, Jordi
    Landolfi, Stefania
    Prat, Aleix
    Espin, Eloy
    Charco, Ramon
    Nuciforo, Paolo
    Vivancos, Ana
    Shao, Wenlin
    Tabernero, Josep
    Palmer, Hector G.
    [J]. CLINICAL CANCER RESEARCH, 2016, 22 (03) : 644 - 656
  • [4] Characterizing the Chemical Space of ERK2 Kinase Inhibitors Using Descriptors Computed from Molecular Dynamics Trajectories
    Ash, Jeremy
    Fourches, Denis
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2017, 57 (06) : 1286 - 1299
  • [5] Machine Learning Classification Models to Improve the Docking-based Screening: A Case of PI3K-Tankyrase Inhibitors
    Berishvili, Vladimir P.
    Voronkov, Andrew E.
    Radchenko, Eugene V.
    Palyulin, Vladimir A.
    [J]. MOLECULAR INFORMATICS, 2018, 37 (11)
  • [6] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [7] A self-organizing-map approach to chemistry representation in combustion applications
    Blasco, JA
    Fueyo, N
    Dopazo, C
    Chen, JY
    [J]. COMBUSTION THEORY AND MODELLING, 2000, 4 (01) : 61 - 76
  • [8] Boski M, 2017, 2017 10TH INTERNATIONAL WORKSHOP ON MULTIDIMENSIONAL (ND) SYSTEMS (NDS)
  • [9] Accelerating the Calculation of Protein-Ligand Binding Free Energy and Residence Times Using Dynamically Optimized Collective Variables
    Brotzakis, Z. Faidon
    Limongelli, Vittorio
    Parrinello, Michele
    [J]. JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2019, 15 (01) : 743 - 750
  • [10] Complete reconstruction of an enzyme-inhibitor binding process by molecular dynamics simulations
    Buch, Ignasi
    Giorgino, Toni
    De Fabritiis, Gianni
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (25) : 10184 - 10189