A learning-based method for drug-target interaction prediction based on feature representation learning and deep neural network

被引:92
作者
Peng, Jiajie [1 ,2 ]
Li, Jingyi [1 ,2 ]
Shang, Xuequn [1 ,2 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Key Lab Big Data Storage Management, Minist Ind & Informat Technol, Xian 710072, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
DTIs prediction; Convolutional neural network; Feature representation learning; PROTEIN; PHARMACOLOGY;
D O I
10.1186/s12859-020-03677-1
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundDrug-target interaction prediction is of great significance for narrowing down the scope of candidate medications, and thus is a vital step in drug discovery. Because of the particularity of biochemical experiments, the development of new drugs is not only costly, but also time-consuming. Therefore, the computational prediction of drug target interactions has become an essential way in the process of drug discovery, aiming to greatly reducing the experimental cost and time.ResultsWe propose a learning-based method based on feature representation learning and deep neural network named DTI-CNN to predict the drug-target interactions. We first extract the relevant features of drugs and proteins from heterogeneous networks by using the Jaccard similarity coefficient and restart random walk model. Then, we adopt a denoising autoencoder model to reduce the dimension and identify the essential features. Third, based on the features obtained from last step, we constructed a convolutional neural network model to predict the interaction between drugs and proteins. The evaluation results show that the average AUROC score and AUPR score of DTI-CNN were 0.9416 and 0.9499, which obtains better performance than the other three existing state-of-the-art methods.ConclusionsAll the experimental results show that the performance of DTI-CNN is better than that of the three existing methods and the proposed method is appropriately designed.
引用
收藏
页数:13
相关论文
共 45 条
[1]   MEAN SQUARE ERROR OF PREDICTION AS A CRITERION FOR SELECTING VARIABLES [J].
ALLEN, DM .
TECHNOMETRICS, 1971, 13 (03) :469-&
[2]  
[Anonymous], 2008, P 25 INT C MACH LEAR
[3]   A Hybrid Deep Learning Architecture for Latent Topic-based Image Retrieval [J].
Arun, K. S. ;
Govindan, V. K. .
DATA SCIENCE AND ENGINEERING, 2018, 3 (02) :166-195
[4]   Prediction of Drug-Target Interactions and Drug Repositioning via Network-Based Inference [J].
Cheng, Feixiong ;
Liu, Chuang ;
Jiang, Jing ;
Lu, Weiqiang ;
Li, Weihua ;
Liu, Guixia ;
Zhou, Weixing ;
Huang, Jin ;
Tang, Yun .
PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (05)
[5]  
Clevert D.-A., 2015, FAST ACCURATE DEEP N
[6]   The Comparative Toxicogenomics Database: update 2013 [J].
Davis, Allan Peter ;
Murphy, Cynthia Grondin ;
Johnson, Robin ;
Lay, Jean M. ;
Lennon-Hopkins, Kelley ;
Saraceni-Richards, Cynthia ;
Sciaky, Daniela ;
King, Benjamin L. ;
Rosenstein, Michael C. ;
Wiegers, Thomas C. ;
Mattingly, Carolyn J. .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D1104-D1114
[7]   HADDOCK: A protein-protein docking approach based on biochemical or biophysical information [J].
Dominguez, C ;
Boelens, R ;
Bonvin, AMJJ .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2003, 125 (07) :1731-1737
[8]   Drug Target Protein-Protein Interaction Networks: A Systematic Perspective [J].
Feng, Yanghe ;
Wang, Qi ;
Wang, Tengjiao .
BIOMED RESEARCH INTERNATIONAL, 2017, 2017
[9]   SINGULAR VALUE DECOMPOSITION AND LEAST SQUARES SOLUTIONS [J].
GOLUB, GH ;
REINSCH, C .
NUMERISCHE MATHEMATIK, 1970, 14 (05) :403-&
[10]   MIND-BEST: Web Server for Drugs and Target Discovery; Design, Synthesis, and Assay of MAO-B Inhibitors and Theoretical-Experimental Study of G3PDH Protein from Trichomonas gallinae [J].
Gonzalez-Diaz, Humberto ;
Prado-Prado, Francisco ;
Garcia-Mera, Xerardo ;
Alonso, Nerea ;
Abeijon, Paula ;
Caamano, Olga ;
Yanez, Matilde ;
Munteanu, Cristian R. ;
Pazos, Alejandro ;
Auxiliadora Dea-Ayuela, Maria ;
Teresa Gomez-Munoz, Maria ;
Magdalena Garijo, M. ;
Sansano, Jose ;
Ubeira, Florencio M. .
JOURNAL OF PROTEOME RESEARCH, 2011, 10 (04) :1698-1718