The Impact of Crystallographic Data for the Development of Machine Learning Models to Predict Protein-Ligand Binding Affinity

被引:12
作者
Veit-Acosta, Martina [1 ]
de Azevedo Junior, Walter Filgueira [2 ,3 ]
机构
[1] Western Michigan Univ, 1903 Western,Michigan Ave, Kalamazoo, MI 49008 USA
[2] Pontifical Catholic Univ Rio Grande Sul PUCRS, Av Ipiranga,6681, BR-90619900 Porto Alegre, RS, Brazil
[3] Pontifical Catholic Univ Rio Grande Sul PUCRS, Specializat Program Bioinformat, Av Ipiranga,6681, BR-90619900 Porto Alegre, RS, Brazil
关键词
Crystal structures; machine learning; scoring function space; binding affinity; SAnDReS; Taba; MOLECULAR-DYNAMICS SIMULATIONS; NEURAL-NETWORK; CRYO-EM; SCORING FUNCTIONS; CRYSTAL-STRUCTURE; CRYOELECTRON MICROSCOPY; DOCKING SIMULATIONS; STRUCTURAL BASIS; CHEMICAL SPACE; FREE-ENERGY;
D O I
10.2174/0929867328666210210121320
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Background: One of the main challenges in the early stages of drug discovery is the computational assessment of protein-ligand binding affinity. Machine learning techniques can contribute to predicting this type of interaction. We may apply these techniques following two approaches. Firstly, using the experimental structures for which affinity data is available. Secondly, using protein-ligand docking simulations. Objective: In this review, we describe recently published machine learning models based on crystal structures, for which binding affinity and thermodynamic data are available. Method: We used experimental structures available at the protein data bank and binding affinity and thermodynamic data was accessed through BindingDB, Binding MOAD, and PDBbind databases. We reviewed machine learning models to predict binding created using open source programs, such as SAnDReS and Taba. Results: Analysis of machine learning models trained against datasets, composed of crystal structure complexes indicated the high predictive performance of these models when compared with classical scoring functions. Conclusion: The rapid increase in the number of crystal structures of protein-ligand complexes created a favorable scenario for developing machine learning models to predict binding affinity. These models rely on experimental data from two sources, the structural and the affinity data. The combination of experimental data generates computational models that outperform the classical scoring functions.
引用
收藏
页码:7006 / 7022
页数:17
相关论文
共 157 条
  • [11] A History of Molecular Chaperone Structures in the Protein Data Bank
    Bascos, Neil Andrew D.
    Landry, Samuel J.
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (24)
  • [12] Screening of Therapeutic Agents for COVID-19 Using Machine Learning and Ensemble Docking Studies
    Batra, Rohit
    Chan, Henry
    Kamath, Ganesh
    Ramprasad, Rampi
    Cherukara, Mathew J.
    Sankaranarayanan, Subramanian K. R. S.
    [J]. JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2020, 11 (17) : 7058 - 7065
  • [13] Binding MOAD, a high-quality protein-ligand database
    Benson, Mark L.
    Smith, Richard D.
    Khazanov, Nickolay A.
    Dimcheff, Brandon
    Beaver, John
    Dresslar, Peter
    Nerothin, Jason
    Carlson, Heather A.
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : D674 - D678
  • [14] The data universe of structural biology
    Berman, Helen M.
    Vallat, Brinda
    Lawson, Catherine L.
    [J]. IUCRJ, 2020, 7 : 630 - 638
  • [15] The Protein Data Bank
    Berman, HM
    Battistuz, T
    Bhat, TN
    Bluhm, WF
    Bourne, PE
    Burkhardt, K
    Iype, L
    Jain, S
    Fagan, P
    Marvin, J
    Padilla, D
    Ravichandran, V
    Schneider, B
    Thanki, N
    Weissig, H
    Westbrook, JD
    Zardecki, C
    [J]. ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2002, 58 : 899 - 907
  • [16] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [17] Development of CDK-targeted scoring functions for prediction of binding affinity
    Bernhardt Levin, Nayara Maria
    Pintro, Val Oliveira
    Bitencourt-Ferreira, Gabriela
    de Mattos, Bruna Boldrini
    Silverio, Ariadne de Castro
    de Azevedo, Walter Filgueira, Jr.
    [J]. BIOPHYSICAL CHEMISTRY, 2018, 235 : 1 - 8
  • [18] Machine Learning-Based Scoring Functions, Development and Applications with SAnDReS
    Bitencourt-Ferreira, Gabriela
    Rizzotto, Camila
    de Azevedo Junior, Walter Filgueira
    [J]. CURRENT MEDICINAL CHEMISTRY, 2021, 28 (09) : 1746 - 1756
  • [19] Application of Machine Learning Techniques to Predict Binding Affinity for Drug Targets: A Study of Cyclin-Dependent Kinase 2
    Bitencourt-Ferreira, Gabriela
    da Silva, Amauri Duarte
    de Azevedo Jr, Walter Filgueira
    [J]. CURRENT MEDICINAL CHEMISTRY, 2021, 28 (02) : 253 - 265
  • [20] Bitencourt-Ferreira G, 2019, METHODS MOL BIOL, V2053, P275, DOI 10.1007/978-1-4939-9752-7_17