Predicting drug-target interaction network using deep learning model

被引:84
作者
You, Jiaying [1 ,2 ,3 ]
McLeod, Robert D. [2 ]
Hu, Pingzhao [1 ,2 ,3 ,4 ]
机构
[1] Univ Manitoba, Dept Biochem & Med Genet, Room 308 Basic Med Sci Bldg,745 Bannatyne Ave, Winnipeg, MB R3E 0J9, Canada
[2] Univ Manitoba, Dept Elect & Comp Engn, Winnipeg, MB, Canada
[3] Univ Manitoba, George & Fay Yee Ctr Healthcare Innovat, Winnipeg, MB, Canada
[4] Univ Manitoba, Dept Comp Sci, Winnipeg, MB, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Deep learning; Drug repurposing; Feature integration; Drug-target interaction; LASSO models; GENOME-WIDE ASSOCIATION; PROTEIN;
D O I
10.1016/j.compbiolchem.2019.03.016
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Traditional methods for drug discovery are time-consuming and expensive, so efforts are being made to repurpose existing drugs. To find new ways for drug repurposing, many computational approaches have been proposed to predict drug-target interactions (DTIs). However, due to the high-dimensional nature of the data sets extracted from drugs and targets, traditional machine learning approaches, such as logistic regression analysis, cannot analyze these data sets efficiently. To overcome this issue, we propose LASSO (Least absolute shrinkage and selection operator)-based regularized linear classification models and a LASSO-DNN (Deep Neural Network) model based on LASSO feature selection to predict DTIs. These methods are demonstrated for re-purposing drugs for breast cancer treatment. Methods: We collected drug descriptors, protein sequence data from Drugbank and protein domain information from NCBI. Validated DTIs were downloaded from Drugbank. A new similarity-based approach was developed to build the negative DTIs. We proposed multiple LASSO models to integrate different combinations of feature sets to explore the prediction power and predict DTIs. Furthermore, building on the features extracted from the LASSO models with the best performance, we also introduced a LASSO-DNN model to predict DTIs. The performance of our newly proposed DNN model (LASSO-DNN) was compared with the LASSO, standard logistic (SLG) regression, support vector machine (SVM), and standard DNN models. Results: Experimental results showed that the LASSO-DNN over performed the SLG, LASSO, SVM and standard DNN models. In particular, the LASSO models with protein tripeptide composition (TC) features and domain features were superior to those that contained other protein information, which may imply that TC and domain information could be better representations of proteins. Furthermore, we showed that the top ranked DTIs predicted using the LASSO-DNN model can potentially be used for repurposing existing drugs for breast cancer based on risk gene information. Conclusions: In summary, we demonstrated that the efficient representations of drug and target features are key for building learning models for predicting DTIs. The disease-associated risk genes identified from large-scale genomic studies are the potential drug targets, which can be used for drug repurposing.
引用
收藏
页码:90 / 101
页数:12
相关论文
共 41 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
[Anonymous], [No title captured]
[3]  
[Anonymous], INT J ENDOCRINOL
[4]  
[Anonymous], THE NATIONAL ACADEMI
[5]   Drug repositioning: Identifying and developing new uses for existing drugs [J].
Ashburn, TT ;
Thor, KB .
NATURE REVIEWS DRUG DISCOVERY, 2004, 3 (08) :673-683
[6]   Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations? [J].
Bajusz, David ;
Racz, Anita ;
Heberger, Kroly .
JOURNAL OF CHEMINFORMATICS, 2015, 7
[7]   Capture Hi-C identifies putative target genes at 33 breast cancer risk loci [J].
Baxter, Joseph S. ;
Leavy, Olivia C. ;
Dryden, Nicola H. ;
Maguire, Sarah ;
Johnson, Nichola ;
Fedele, Vita ;
Simigdala, Nikiana ;
Martin, Lesley-Ann ;
Andrews, Simon ;
Wingett, Steven W. ;
Assiotis, Ioannis ;
Fenwick, Kerry ;
Chauhan, Ritika ;
Rust, Alistair G. ;
Orr, Nick ;
Dudbridge, Frank ;
Haider, Syed ;
Fletcher, Olivia .
NATURE COMMUNICATIONS, 2018, 9
[8]   Hyperforin as a possible antidepressant component of hypericum extracts [J].
Chatterjee, SS ;
Bhattacharya, SK ;
Wonnemann, M ;
Singer, A ;
Müller, WE .
LIFE SCIENCES, 1998, 63 (06) :499-510
[9]   Prediction of Drug-Target Interactions and Drug Repositioning via Network-Based Inference [J].
Cheng, Feixiong ;
Liu, Chuang ;
Jiang, Jing ;
Lu, Weiqiang ;
Li, Weihua ;
Liu, Guixia ;
Zhou, Weixing ;
Huang, Jin ;
Tang, Yun .
PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (05)
[10]   Adenine nucleotide translocase 2 is a key mitochondrial protein in cancer metabolism [J].
Chevrollier, Arnaud ;
Loiseau, Dominique ;
Reynier, Pascal ;
Stepien, Georges .
BIOCHIMICA ET BIOPHYSICA ACTA-BIOENERGETICS, 2011, 1807 (06) :562-567