Predicting drug target interactions using meta-path-based semantic network analysis

被引:95
作者
Fu, Gang [1 ]
Ding, Ying [2 ,3 ]
Seal, Abhik [2 ]
Chen, Bin [4 ]
Sun, Yizhou [5 ]
Bolton, Evan [1 ]
机构
[1] NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, 8600 Rockville Pike, Bethesda, MD USA
[2] Indiana Univ, Sch Informat & Comp, 107 S Indiana Ave, Bloomington, IN USA
[3] Wuhan Univ, Sch Informat Management, Wuhan 430072, Hubei, Peoples R China
[4] Stanford Univ, Dept Med, 450 Serra Mall, Stanford, CA 94305 USA
[5] Northeastern Univ, Coll Comp & Informat Sci, 360 Huntington Ave, Boston, MA 02115 USA
来源
BMC BIOINFORMATICS | 2016年 / 17卷
基金
美国国家科学基金会;
关键词
Semantic network analysis; Link prediction; Meta-path topological feature; Machine learning; Random forest; SYSTEMS CHEMICAL BIOLOGY; PUBCHEM; SUBSTANCE; COMPOUND;
D O I
10.1186/s12859-016-1005-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: In the context of drug discovery, drug target interactions (DTIs) can be predicted based on observed topological features of a semantic network across the chemical and biological space. In a semantic network, the types of the nodes and links are different. In order to take into account the heterogeneity of the semantic network, meta-path-based topological patterns were investigated for link prediction. Results: Supervised machine learning models were constructed based on meta-path topological features of an enriched semantic network, which was derived from Chem2Bio2RDF, and was expanded by adding compound and protein similarity neighboring links obtained from the PubChem databases. The additional semantic links significantly improved the predictive performance of the supervised learning models. The binary classification model built upon the enriched feature space using the Random Forest algorithm significantly outperformed an existing semantic link prediction algorithm, Semantic Link Association Prediction (SLAP), to predict unknown links between compounds and protein targets in an evolving network. In addition to link prediction, Random Forest also has an intrinsic feature ranking algorithm, which can be used to select the important topological features that contribute to link prediction. Conclusions: The proposed framework has been demonstrated as a powerful alternative to SLAP in order to predict DTIs using the semantic network that integrates chemical, pharmacological, genomic, biological, functional, and biomedical information into a unified framework. It offers the flexibility to enrich the feature space by using different normalization processes on the topological features, and it can perform model construction and feature selection at the same time.
引用
收藏
页数:10
相关论文
共 31 条
[1]  
[Anonymous], 2005, ACM SIGKDD EXPLOR NE
[2]  
[Anonymous], 2012, P 5 ACM INT C WEB SE, DOI DOI 10.1145/2124295.2124373
[3]   Chemogenomics: An emerging strategy for rapid target and drug discovery [J].
Bredel, M ;
Jacoby, E .
NATURE REVIEWS GENETICS, 2004, 5 (04) :262-275
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]  
Callahan Alison, 2013, Semantic Web: Semantics and Big Data. Proceedings of 10th International Conference (ESWC 2013): LNCS 7882, P200
[6]   Assessing Drug Target Association Using Semantic Linked Data [J].
Chen, Bin ;
Ding, Ying ;
Wild, David J. .
PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (07)
[7]   Chem2Bio2RDF: a semantic framework for linking and data mining chemogenomic and systems chemical biology data [J].
Chen, Bin ;
Dong, Xiao ;
Jiao, Dazhi ;
Wang, Huijun ;
Zhu, Qian ;
Ding, Ying ;
Wild, David J. .
BMC BIOINFORMATICS, 2010, 11
[8]   Prediction of Polypharmacological Profiles of Drugs by the Integration of Chemical, Side Effect, and Therapeutic Space [J].
Cheng, Feixiong ;
Li, Weihua ;
Wu, Zengrui ;
Wang, Xichuan ;
Zhang, Chen ;
Li, Jie ;
Liu, Guixia ;
Tang, Yun .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (04) :753-762
[9]   Prediction of Chemical-Protein Interactions Network with Weighted Network-Based Inference Method [J].
Cheng, Feixiong ;
Zhou, Yadi ;
Li, Weihua ;
Liu, Guixia ;
Tang, Yun .
PLOS ONE, 2012, 7 (07)
[10]   Prediction of Drug-Target Interactions and Drug Repositioning via Network-Based Inference [J].
Cheng, Feixiong ;
Liu, Chuang ;
Jiang, Jing ;
Lu, Weiqiang ;
Li, Weihua ;
Liu, Guixia ;
Zhou, Weixing ;
Huang, Jin ;
Tang, Yun .
PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (05)