Protein-ligand binding affinity prediction exploiting sequence constituent homology

被引:1
作者
Abdel-Rehim, Abbi [1 ,7 ]
Orhobor, Oghenejokpeme [2 ]
Hang, Lou [3 ]
Ni, Hao [3 ,4 ]
King, Ross D. [1 ,4 ,5 ,6 ]
机构
[1] Univ Cambridge, Dept Chem Engn & Biotechnol, Cambridge CB3 0AS, England
[2] Natl Inst Agr Bot, Cambridge CB3 0LE, England
[3] UCL, Dept Math, London WC1H 0AY, England
[4] Alan Turing Inst, London NW1 2DB, England
[5] Chalmers Univ Technol, Dept Biol & Biol Engn, S-41296 Gothenburg, Sweden
[6] Chalmers Univ Technol, Dept Comp Sci & Engn, S-41296 Gothenburg, Sweden
[7] Univ Cambridge, Dept Chem Engn & Biotechnol, West Cambridge Site,Philippa Fawcett Dr, Cambridge CB3 0AS, England
基金
英国工程与自然科学研究理事会;
关键词
SCORING FUNCTIONS;
D O I
10.1093/bioinformatics/btad502
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Molecular docking is a commonly used approach for estimating binding conformations and their resultant binding affinities. Machine learning has been successfully deployed to enhance such affinity estimations. Many methods of varying complexity have been developed making use of some or all the spatial and categorical information available in these structures. The evaluation of such methods has mainly been carried out using datasets from PDBbind. Particularly the Comparative Assessment of Scoring Functions (CASF) 2007, 2013, and 2016 datasets with dedicated test sets. This work demonstrates that only a small number of simple descriptors is necessary to efficiently estimate binding affinity for these complexes without the need to know the exact binding conformation of a ligand.Results The developed approach of using a small number of ligand and protein descriptors in conjunction with gradient boosting trees demonstrates high performance on the CASF datasets. This includes the commonly used benchmark CASF2016 where it appears to perform better than any other approach. This methodology is also useful for datasets where the spatial relationship between the ligand and protein is unknown as demonstrated using a large ChEMBL-derived dataset.Availability and implementation Code and data uploaded to https://github.com/abbiAR/PLBAffinity.
引用
收藏
页数:4
相关论文
共 50 条
  • [31] An ensemble-based approach to estimate confidence of predicted protein-ligand binding affinity values
    Rayka, Milad
    Mirzaei, Morteza
    Latifi, Ali Mohammad
    MOLECULAR INFORMATICS, 2024, 43 (04)
  • [32] Development of a protein-ligand extended connectivity (PLEC) fingerprint and its application for binding affinity predictions
    Wojcikowski, Maciej
    Kukielka, Michal
    Stepniewska-Dziubinska, Marta M.
    Siedlecki, Pawel
    BIOINFORMATICS, 2019, 35 (08) : 1334 - 1341
  • [33] The Impact of Crystallographic Data for the Development of Machine Learning Models to Predict Protein-Ligand Binding Affinity
    Veit-Acosta, Martina
    de Azevedo Junior, Walter Filgueira
    CURRENT MEDICINAL CHEMISTRY, 2021, 28 (34) : 7006 - 7022
  • [34] Advances in Protein-Ligand Binding Affinity Prediction via Deep Learning: A Comprehensive Study of Datasets, Data Preprocessing Techniques, and Model Architectures
    Abdelkader, Gelany Aly
    Kim, Jeong-Dong
    CURRENT DRUG TARGETS, 2024, 25 (15) : 1041 - 1065
  • [35] Modern machine-learning for binding affinity estimation of protein-ligand complexes: Progress, opportunities, and challenges
    Harren, Tobias
    Gutermuth, Torben
    Grebner, Christoph
    Hessler, Gerhard
    Rarey, Matthias
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2024, 14 (03)
  • [36] Development of a graph convolutional neural network model for efficient prediction of protein-ligand binding affinities
    Son, Jeongtae
    Kim, Dongsup
    PLOS ONE, 2021, 16 (04):
  • [37] COACH-D: improved protein-ligand binding sites prediction with refined ligand-binding poses through molecular docking
    Wu, Qi
    Peng, Zhenling
    Zhang, Yang
    Yang, Jianyi
    NUCLEIC ACIDS RESEARCH, 2018, 46 (W1) : W438 - W442
  • [38] A comparative study of family-specific protein-ligand complex affinity prediction based on random forest approach
    Wang, Yu
    Guo, Yanzhi
    Kuang, Qifan
    Pu, Xuemei
    Ji, Yue
    Zhang, Zhihang
    Li, Menglong
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2015, 29 (04) : 349 - 360
  • [39] Recent advances in computational and experimental protein-ligand affinity determination techniques
    Kairys, Visvaldas
    Baranauskiene, Lina
    Kazlauskiene, Migle
    Zubriene, Asta
    Petrauskas, Vytautas
    Matulis, Daumantas
    Kazlauskas, Egidijus
    EXPERT OPINION ON DRUG DISCOVERY, 2024, 19 (06) : 649 - 670
  • [40] A comprehensive examination of the contributions to the binding entropy of protein-ligand complexes
    Singh, Nidhi
    Warshel, Arieh
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2010, 78 (07) : 1724 - 1735