Identification of drug-target interactions via multiple information integration

被引:179
作者
Ding, Yijie [1 ,2 ]
Tang, Jijun [1 ,2 ,3 ]
Guo, Fei [1 ,2 ]
机构
[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin 300350, Peoples R China
[2] Tianjin Univ, Inst Computat Biol, Tianjin 300350, Peoples R China
[3] Univ South Carolina, Dept Comp Sci & Engn, Columbia, SC 29208 USA
基金
美国国家科学基金会;
关键词
Drug-target interaction; Molecular fingerprint; Multivariate mutual information; Feature selection; Support vector machine; PREDICTION; SELECTION;
D O I
10.1016/j.ins.2017.08.045
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Identifying Drug-Target Interactions (DTIs) is an important process in drug discovery. Traditional experimental methods are expensive and time-consuming for detecting DTIs. Therefore, computational approaches provide many effective strategies to deal with this issue. In recent years, most of computational methods only use the information of drug-drug similarity or target-target similarity, which cannot perfectly capture all characteristics to identify DTIs. In this paper, we propose a novel computational model of DTIs prediction, based on machine learning methods. To improve the performance of prediction, we further use molecular substructure fingerprints, Multivariate Mutual Information (MMI) of proteins and network topology to represent drugs, targets and relationship between them. Moreover, we employ Support Vector Machine (SVM) and Feature Selection (FS) to construct model for predicting DTIs. Experiments of evaluation show that proposed approach achieves better results than other outstanding methods for feature-based DTIs prediction. The proposed approach achieves AUPRs of 0.899, 0.929, 0.821 and 0.655 on Enzyme, Ion Channel (IC), GPCR and Nuclear Receptor datasets, respectively. Compared with existing best methods, AUPRs are increased by 0.016 on Ion Channel datasets. In addition, our method obtains the second best performance on GPCR and Enzyme datasets. The source code and all datasets are available at https://figshare.com/s/53bf5a6065f3911d46f6. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:546 / 560
页数:15
相关论文
共 35 条
[1]  
Alhamdoosh M., 2016, BIORXIV, P1
[2]   Fast decorrelated neural network ensembles with random weights [J].
Alhamdoosh, Monther ;
Wang, Dianhui .
INFORMATION SCIENCES, 2014, 264 :104-117
[3]   Supervised prediction of drug-target interactions using bipartite local models [J].
Bleakley, Kevin ;
Yamanishi, Yoshihiro .
BIOINFORMATICS, 2009, 25 (18) :2397-2403
[4]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[5]   Computational Prediction of Drug-Target Interactions Using Chemical, Biological, and Network Features [J].
Cao, Dong-Sheng ;
Zhang, Liu-Xia ;
Tan, Gui-Shan ;
Xiang, Zheng ;
Zeng, Wen-Bin ;
Xu, Qing-Song ;
Chen, Alex F. .
MOLECULAR INFORMATICS, 2014, 33 (10) :669-681
[6]   Large-scale prediction of drug-target interactions using protein sequences and drug topological structures [J].
Cao, Dong-Sheng ;
Liu, Shao ;
Xu, Qing-Song ;
Lu, Hong-Mei ;
Huang, Jian-Hua ;
Hu, Qian-Nan ;
Liang, Yi-Zeng .
ANALYTICA CHIMICA ACTA, 2012, 752 :1-10
[7]   Information theory of quantum entanglement and measurement [J].
Cerf, NJ ;
Adami, C .
PHYSICA D-NONLINEAR PHENOMENA, 1998, 120 (1-2) :62-81
[8]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[9]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[10]   Predicting protein-protein interactions via multivariate mutual information of protein sequences [J].
Ding, Yijie ;
Tang, Jijun ;
Guo, Fei .
BMC BIOINFORMATICS, 2016, 17