CFSBoost: Cumulative feature subspace boosting for drug-target interaction prediction

被引:20
作者
Rayhan, Farshid [1 ]
Ahmed, Sajid [1 ]
Farid, Dewan Md [1 ]
Dehzangi, Abdollah [2 ]
Shatabda, Swakkhar [1 ]
机构
[1] United Int Univ, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] Morgan State Univ, Dept Comp Sci, Baltimore, MD 21239 USA
关键词
Class imbalance; Drug-target; Classification; Ensemble classifier; Feature grouping; Boosting; DIVERSITY-ORIENTED SYNTHESIS; PROTEIN SEQUENCES; EVOLUTIONARY; INTEGRATION;
D O I
10.1016/j.jtbi.2018.12.024
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Drug target interaction prediction is a very labor-intensive and expensive experimental process which has motivated researchers to focus on in silico prediction to provide information on potential interaction. In recent years, researchers have proposed several computational approaches for predicting new drug target interactions. In this paper, we present CFSBoost, a simple and computationally cheap ensemble boosting classification model for identification and prediction of drug-target interactions using evolutionary and structural features. CFSBoost uses a simple yet novel feature group selection procedure which allows the model to be computationally very cheap while being able to achieve state of the art performance. The ensemble model uses extra tree as weak learners inside a boosting scheme while holding on to the best model per iteration. We tested our method of four benchmark datasets, which are also referred as gold standard datasets. Our method was able to achieve better score in terms of area under receiver operating characteristic (auROC) curve on 2 out of the 4 datasets. It was also able to achieve higher area under precision recall (auPR) curve on 3 out of the 4 datasets. It has been argued by researchers that auPR metric is more suitable than auROC for comparison of performance on imbalanced datasets such our benchmark datasets. Our reported result shows that, despite of its simplicity in design, CFSBoost's performance is very satisfactory comparing to other literatures. We also provide 5 new possible interactions for each dataset based on CFSBoost's prediction score. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 62 条
[1]   Drug-target interaction prediction through domain-tuned network-based inference [J].
Alaimo, Salvatore ;
Pulvirenti, Alfredo ;
Giugno, Rosalba ;
Ferro, Alfredo .
BIOINFORMATICS, 2013, 29 (16) :2004-2008
[2]  
[Anonymous], NUCLEIC ACIDS RES S1
[3]  
[Anonymous], 2017, ARXIV171204356
[4]   DASPfind: new efficient method to predict drug-target interactions [J].
Ba-Alawi, Wail ;
Soufan, Othman ;
Essack, Magbubah ;
Kalnis, Panos ;
Bajic, Vladimir B. .
JOURNAL OF CHEMINFORMATICS, 2016, 8
[5]   Supervised prediction of drug-target interactions using bipartite local models [J].
Bleakley, Kevin ;
Yamanishi, Yoshihiro .
BIOINFORMATICS, 2009, 25 (18) :2397-2403
[6]  
Blum A., 1999, Proceedings of the Twelfth Annual Conference on Computational Learning Theory, P203, DOI 10.1145/307400.307439
[7]   Large-scale prediction of drug-target interactions using protein sequences and drug topological structures [J].
Cao, Dong-Sheng ;
Liu, Shao ;
Xu, Qing-Song ;
Lu, Hong-Mei ;
Huang, Jian-Hua ;
Hu, Qian-Nan ;
Liang, Yi-Zeng .
ANALYTICA CHIMICA ACTA, 2012, 752 :1-10
[8]   PubChem as a Source of Polypharmacology [J].
Chen, Bin ;
Wild, David ;
Guha, Rajarshi .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2009, 49 (09) :2044-2055
[9]   A Semi-Supervised Method for Drug-Target Interaction Prediction with Consistency in Networks [J].
Chen, Hailin ;
Zhang, Zuping .
PLOS ONE, 2013, 8 (05)
[10]   Prediction of drug target groups based on chemical-chemical similarities and chemical-chemical/protein connections [J].
Chen, Lei ;
Lu, Jing ;
Luo, Xiaomin ;
Feng, Kai-Yan .
BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2014, 1844 (01) :207-213