DeepDTA: deep drug-target binding affinity prediction

被引:895
作者
Ozturk, Hakime [1 ]
Ozgur, Arzucan [1 ]
Ozkirimli, Elif [2 ]
机构
[1] Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey
[2] Bogazici Univ, Dept Chem Engn, TR-34342 Istanbul, Turkey
关键词
NEURAL-NETWORKS; QUALITY; KERNELS; MODEL;
D O I
10.1093/bioinformatics/bty593
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The identification of novel drug-target (DT) interactions is a substantial part of the drug discovery process. Most of the computational methods that have been proposed to predict DT interactions have focused on binary classification, where the goal is to determine whether a DT pair interacts or not. However, protein-ligand interactions assume a continuum of binding strength values, also called binding affinity and predicting this value still remains a challenge. The increase in the affinity data available in DT knowledge-bases allows the use of advanced learning techniques such as deep learning architectures in the prediction of binding affinities. In this study, we propose a deep-learning based model that uses only sequence information of both targets and drugs to predict DT interaction binding affinities. The few studies that focus on DT binding affinity prediction use either 3D structures of protein-ligand complexes or 2D features of compounds. One novel approach used in this work is the modeling of protein sequences and compound 1D representations with convolutional neural networks (CNNs). Results: The results show that the proposed deep learning based model that uses the 1D representations of targets and drugs is an effective approach for drug target binding affinity prediction. The model in which high-level representations of a drug and a target are constructed via CNNs achieved the best Concordance Index (CI) performance in one of our larger benchmark datasets, outperforming the KronRLS algorithm and SimBoost, a state-of-the-art method for DT binding affinity prediction.
引用
收藏
页码:821 / 829
页数:9
相关论文
共 63 条
[1]  
Abadi M., 2016, TENSORFLOW LARGESCAL
[2]  
[Anonymous], 2014, NIPS WORKSH HIGH EN
[3]  
Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkh131, 10.1093/nar/gkw1099]
[4]   A machine learning approach to predicting protein-ligand binding affinity with applications to molecular docking [J].
Ballester, Pedro J. ;
Mitchell, John B. O. .
BIOINFORMATICS, 2010, 26 (09) :1169-1175
[5]   Supervised prediction of drug-target interactions using bipartite local models [J].
Bleakley, Kevin ;
Yamanishi, Yoshihiro .
BIOINFORMATICS, 2009, 25 (18) :2397-2403
[6]  
Bolton EE, 2010, ANN REP COMP CHEM, V4, P217, DOI 10.1016/S1574-1400(08)00012-1
[7]   Computational Prediction of Drug-Target Interactions Using Chemical, Biological, and Network Features [J].
Cao, Dong-Sheng ;
Zhang, Liu-Xia ;
Tan, Gui-Shan ;
Xiang, Zheng ;
Zeng, Wen-Bin ;
Xu, Qing-Song ;
Chen, Alex F. .
MOLECULAR INFORMATICS, 2014, 33 (10) :669-681
[8]   Large-scale prediction of drug-target interactions using protein sequences and drug topological structures [J].
Cao, Dong-Sheng ;
Liu, Shao ;
Xu, Qing-Song ;
Lu, Hong-Mei ;
Huang, Jian-Hua ;
Hu, Qian-Nan ;
Liang, Yi-Zeng .
ANALYTICA CHIMICA ACTA, 2012, 752 :1-10
[9]   IC50-to-Ki: a web-based tool for converting IC50 to Ki values for inhibitors of enzyme activity and ligand binding [J].
Cer, R. Z. ;
Mudunuri, U. ;
Stephens, R. ;
Lebeda, F. J. .
NUCLEIC ACIDS RESEARCH, 2009, 37 :W441-W445
[10]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794