DEELIG: A Deep Learning Approach to Predict Protein-Ligand Binding Affinity

被引:35
作者
Ahmed, Asad [1 ]
Mam, Bhavika [2 ,3 ]
Sowdhamini, Ramanathan [2 ]
机构
[1] Natl Inst Technol Warangal, Warangal, Andhra Prades, India
[2] Tata Inst Fundamental Res, Natl Ctr Biol Sci, GKVK Campus, Bangalore 560065, Karnataka, India
[3] Univ Trans Disciplinary Hlth Sci & Technol TDU, Bangalore, Karnataka, India
关键词
Binding affinity; protein-ligand binding; supervised learning; convolutional neural networks; deep learning; PDB; drug discovery; PDBBIND DATABASE; COLLECTION; FAMILY; C(7); MOAD; PDB;
D O I
10.1177/11779322211030364
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein-ligand binding prediction has extensive biological significance. Binding affinity helps in understanding the degree of protein-ligand interactions and is a useful measure in drug design. Protein-ligand docking using virtual screening and molecular dynamic simulations are required to predict the binding affinity of a ligand to its cognate receptor. Performing such analyses to cover the entire chemical space of small molecules requires intense computational power. Recent developments using deep learning have enabled us to make sense of massive amounts of complex data sets where the ability of the model to "learn" intrinsic patterns in a complex plane of data is the strength of the approach. Here, we have incorporated convolutional neural networks to find spatial relationships among data to help us predict affinity of binding of proteins in whole superfamilies toward a diverse set of ligands without the need of a docked pose or complex as user input. The models were trained and validated using a stringent methodology for feature extraction. Our model performs better in comparison to some existing methods used widely and is suitable for predictions on high-resolution protein crystal (<= 2.5 angstrom) and nonpeptide ligand as individual inputs. Our approach to network construction and training on protein-ligand data set prepared in-house has yielded significant insights. We have also tested DEELIG on few COVID-19 main protease-inhibitor complexes relevant to the current public health scenario. DEELIG-based predictions can be incorporated in existing databases including RSCB PDB, PDBMoad, and PDBbind in filling missing binding affinity data for protein-ligand complexes.
引用
收藏
页数:9
相关论文
共 40 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
[Anonymous], 2020, Schrodinger Release 2020-1: Protein Preparation Wizard
[3]   Binding MOAD, a high-quality protein-ligand database [J].
Benson, Mark L. ;
Smith, Richard D. ;
Khazanov, Nickolay A. ;
Dimcheff, Brandon ;
Beaver, John ;
Dresslar, Peter ;
Nerothin, Jason ;
Carlson, Heather A. .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D674-D678
[4]   Representability of algebraic topology for biomolecules in machine learning based scoring and virtual screening [J].
Cang, Zixuan ;
Mu, Lin ;
Wei, Guo-Wei .
PLOS COMPUTATIONAL BIOLOGY, 2018, 14 (01)
[5]   DLIGAND2: an improved knowledge-based energy function for protein-ligand interactions using the distance-scaled, finite, ideal-gas reference state [J].
Chen, Pin ;
Ke, Yaobin ;
Lu, Yutong ;
Du, Yunfei ;
Li, Jiahui ;
Yan, Hui ;
Zhao, Huiying ;
Zhou, Yaoqi ;
Yang, Yuedong .
JOURNAL OF CHEMINFORMATICS, 2019, 11 (01)
[6]  
Dahl GE, 2013, INT CONF ACOUST SPEE, P8609, DOI 10.1109/ICASSP.2013.6639346
[7]   sc-PDB: a 3D-database of ligandable binding sites-10 years on [J].
Desaphy, Jeremy ;
Bret, Guillaume ;
Rognan, Didier ;
Kellenberger, Esther .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D399-D404
[8]   AGL-Score: Algebraic Graph Learning Score for Protein-Ligand Binding Scoring, Ranking, Docking, and Screening [J].
Duc Duy Nguyen ;
Wei, Guo-Wei .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (07) :3291-3304
[9]   The MM/PBSA and MM/GBSA methods to estimate ligand-binding affinities [J].
Genheden, Samuel ;
Ryde, Ulf .
EXPERT OPINION ON DRUG DISCOVERY, 2015, 10 (05) :449-461
[10]   SimBoost: a read-across approach for predicting drug-target binding affinities using gradient boosting machines [J].
He, Tong ;
Heidemeyer, Marten ;
Ban, Fuqiang ;
Cherkasov, Artem ;
Ester, Martin .
JOURNAL OF CHEMINFORMATICS, 2017, 9