The Impact of Crystallographic Data for the Development of Machine Learning Models to Predict Protein-Ligand Binding Affinity

被引:12
作者
Veit-Acosta, Martina [1 ]
de Azevedo Junior, Walter Filgueira [2 ,3 ]
机构
[1] Western Michigan Univ, 1903 Western,Michigan Ave, Kalamazoo, MI 49008 USA
[2] Pontifical Catholic Univ Rio Grande Sul PUCRS, Av Ipiranga,6681, BR-90619900 Porto Alegre, RS, Brazil
[3] Pontifical Catholic Univ Rio Grande Sul PUCRS, Specializat Program Bioinformat, Av Ipiranga,6681, BR-90619900 Porto Alegre, RS, Brazil
关键词
Crystal structures; machine learning; scoring function space; binding affinity; SAnDReS; Taba; MOLECULAR-DYNAMICS SIMULATIONS; NEURAL-NETWORK; CRYO-EM; SCORING FUNCTIONS; CRYSTAL-STRUCTURE; CRYOELECTRON MICROSCOPY; DOCKING SIMULATIONS; STRUCTURAL BASIS; CHEMICAL SPACE; FREE-ENERGY;
D O I
10.2174/0929867328666210210121320
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Background: One of the main challenges in the early stages of drug discovery is the computational assessment of protein-ligand binding affinity. Machine learning techniques can contribute to predicting this type of interaction. We may apply these techniques following two approaches. Firstly, using the experimental structures for which affinity data is available. Secondly, using protein-ligand docking simulations. Objective: In this review, we describe recently published machine learning models based on crystal structures, for which binding affinity and thermodynamic data are available. Method: We used experimental structures available at the protein data bank and binding affinity and thermodynamic data was accessed through BindingDB, Binding MOAD, and PDBbind databases. We reviewed machine learning models to predict binding created using open source programs, such as SAnDReS and Taba. Results: Analysis of machine learning models trained against datasets, composed of crystal structure complexes indicated the high predictive performance of these models when compared with classical scoring functions. Conclusion: The rapid increase in the number of crystal structures of protein-ligand complexes created a favorable scenario for developing machine learning models to predict binding affinity. These models rely on experimental data from two sources, the structural and the affinity data. The combination of experimental data generates computational models that outperform the classical scoring functions.
引用
收藏
页码:7006 / 7022
页数:17
相关论文
共 157 条
  • [1] Structural Studies of a Rationally Selected Multi-Drug Resistant HIV-1 Protease Reveal Synergistic Effect of Distal Mutations on Flap Dynamics
    Agniswamy, Johnson
    Louis, John M.
    Roche, Julien
    Harrison, Robert W.
    Weber, Irene T.
    [J]. PLOS ONE, 2016, 11 (12):
  • [2] QSAR classification-based virtual screening followed by molecular docking studies for identification of potential inhibitors of 5-lipoxygenase
    Ahamed, T. K. Shameera
    Rajan, Vijisha K.
    Sabira, K.
    Muraleedharan, K.
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2018, 77 : 154 - 166
  • [3] Recent improvements to Binding MOAD: a resource for protein-ligand binding affinities and structures
    Ahmed, Aqeel
    Smith, Richard D.
    Clark, Jordan J.
    Dunbar, James B., Jr.
    Carlson, Heather A.
    [J]. NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) : D465 - D469
  • [4] Predicting Kinase Inhibitor Resistance: Physics-Based and Data-Driven Approaches
    Aldeghi, Matteo
    Gapsys, Vytautas
    de Groot, Bert L.
    [J]. ACS CENTRAL SCIENCE, 2019, 5 (08) : 1468 - 1474
  • [5] AlphaFold at CASP13
    AlQuraishi, Mohammed
    [J]. BIOINFORMATICS, 2019, 35 (22) : 4862 - 4865
  • [6] Prediction of peptide binding to MHC using machine learning with sequence and structure-based feature sets
    Aranha, Michelle P.
    Spooner, Catherine
    Demerdash, Omar
    Czejdo, Bogdan
    Smith, Jeremy C.
    Mitchell, Julie C.
    [J]. BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS, 2020, 1864 (04):
  • [7] Azevedo LS, 2012, CURR BIOINFORM, V7, P352
  • [8] Does a More Precise Chemical Description of Protein-Ligand Complexes Lead to More Accurate Prediction of Binding Affinity?
    Ballester, Pedro J.
    Schreyer, Adrian
    Blundell, Tom L.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (03) : 944 - 955
  • [9] A machine learning approach to predicting protein-ligand binding affinity with applications to molecular docking
    Ballester, Pedro J.
    Mitchell, John B. O.
    [J]. BIOINFORMATICS, 2010, 26 (09) : 1169 - 1175
  • [10] Immunopeptidomic Data Integration to Artificial Neural Networks Enhances Protein-Drug Immunogenicity Prediction
    Barra, Carolina
    Ackaert, Chloe
    Reynisson, Birkir
    Schockaert, Jana
    Jessen, Leon Eyrich
    Watson, Mark
    Jang, Anne
    Comtois-Marotte, Simon
    Goulet, Jean-Philippe
    Pattijn, Sofie
    Paramithiotis, Eustache
    Nielsen, Morten
    [J]. FRONTIERS IN IMMUNOLOGY, 2020, 11