DNN-DTIs: Improved drug-target interactions prediction using XGBoost feature selection and deep neural network

被引:56
作者
Chen, Cheng [1 ,2 ,3 ]
Shi, Han [4 ]
Jiang, Zhiwen [1 ,2 ]
Salhi, Adil [5 ]
Chen, Ruixin [1 ,2 ]
Cui, Xuefeng [3 ]
Yu, Bin [1 ,2 ,6 ]
机构
[1] Qingdao Univ Sci & Technol, Coll Math & Phys, Qingdao 266061, Peoples R China
[2] Qingdao Univ Sci & Technol, Artificial Intelligence & Biomed Big Data Res Ctr, Qingdao 266061, Peoples R China
[3] Shandong Univ, Sch Comp Sci & Technol, Qingdao 266237, Peoples R China
[4] Chinese Acad Sci, CAS Ctr Excellence Mol Plant Sci, Key Lab Synthet Biol, Shanghai 200032, Peoples R China
[5] King Abdullah Univ Sci & Technol KAUST, Computat Biosci Res Ctr CBRC, Thuwal 23955, Saudi Arabia
[6] Key Lab Computat Sci & Applicat Hainan Prov, Haikou 571158, Hainan, Peoples R China
基金
中国国家自然科学基金;
关键词
Drug-target interactions; Multi-information fusion; XGBoost; Deep neural network; INFORMATION; INTEGRATION;
D O I
10.1016/j.compbiomed.2021.104676
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Analysis and prediction of drug-target interactions (DTIs) play an important role in understanding drug mechanisms, as well as drug repositioning and design. Machine learning (ML)-based methods for DTIs prediction can mitigate the shortcomings of time-consuming and labor-intensive experimental approaches, while providing new ideas and insights for drug design. We propose a novel pipeline for predicting drug-target interactions, called DNN-DTIs. First, the target information is characterized by a number of features, namely, pseudo-amino acid composition, pseudo position-specific scoring matrix, conjoint triad composition, transition and distribution, Moreau-Broto autocorrelation, and structural features. The drug compounds are subsequently encoded using substructure fingerprints. Next, eXtreme gradient boosting (XGBoost) is used to determine the subset of non redundant features of importance. The optimal balanced set of sample vectors is obtained by applying the synthetic minority oversampling technique (SMOTE). Finally, a DTIs predictor, DNN-DTIs, is developed based on a deep neural network (DNN) via a layer-by-layer learning scheme. Experimental results indicate that DNN-DTIs achieves better performance than other state-of-the-art predictors with ACC values of 98.78%, 98.60%, 97.98%, 98.24% and 98.00% on Enzyme, Ion Channels (IC), GPCR, Nuclear Receptors (NR) and Kuang's datasets. Therefore, the accurate prediction performance of DNN-DTIs makes it a favored choice for contributing to the study of DTIs, especially drug repositioning.
引用
收藏
页数:14
相关论文
共 81 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Long short-term memory [J].
Hochreiter, S ;
Schmidhuber, J .
NEURAL COMPUTATION, 1997, 9 (08) :1735-1780
[3]   Pindolol augmentation of antidepressant response [J].
Artigas, F ;
Adell, A ;
Celada, P .
CURRENT DRUG TARGETS, 2006, 7 (02) :139-147
[4]   HIV antiretroviral drugs, dolutegravir, maraviroc and ritonavir-boosted atazanavir use different pathways to affect inflammation, senescence and insulin sensitivity in human coronary endothelial cells [J].
Auclair, Martine ;
Guenantin, Anne-Claire ;
Fellahi, Soraya ;
Garcia, Marie ;
Capeau, Jacqueline .
PLOS ONE, 2020, 15 (01)
[5]   Machine learning approaches and databases for prediction of drug-target interaction: a survey paper [J].
Bagherian, Maryam ;
Sabeti, Elyas ;
Wang, Kai ;
Sartor, Maureen A. ;
Nikolovska-Coleska, Zaneta ;
Najarian, Kayvan .
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (01) :247-269
[6]   The ChEMBL bioactivity database: an update [J].
Bento, A. Patricia ;
Gaulton, Anna ;
Hersey, Anne ;
Bellis, Louisa J. ;
Chambers, Jon ;
Davies, Mark ;
Krueger, Felix A. ;
Light, Yvonne ;
Mak, Lora ;
McGlinchey, Shaun ;
Nowotka, Michal ;
Papadatos, George ;
Santos, Rita ;
Overington, John P. .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D1083-D1090
[7]   Progestogens in menopausal hormone therapy [J].
Binkowska, Malgorzata ;
Woron, Jaroslaw .
MENOPAUSE REVIEW-PRZEGLAD MENOPAUZALNY, 2015, 14 (02) :134-143
[8]  
Blume-Peytavi Ulrike, 2007, J Dtsch Dermatol Ges, V5, P391, DOI 10.1111/j.1610-0387.2007.06295.x
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   Large-scale prediction of drug-target interactions using protein sequences and drug topological structures [J].
Cao, Dong-Sheng ;
Liu, Shao ;
Xu, Qing-Song ;
Lu, Hong-Mei ;
Huang, Jian-Hua ;
Hu, Qian-Nan ;
Liang, Yi-Zeng .
ANALYTICA CHIMICA ACTA, 2012, 752 :1-10