Scoring multiple features to predict drug disease associations using information fusion and aggregation

被引:25
作者
Moghadam, H. [1 ]
Rahgozar, M. [1 ]
Gharaghani, S. [2 ]
机构
[1] Univ Tehran, Sch Elect & Comp Engn, Coll Engn, DBRG,CIPCE, Tehran, Iran
[2] Univ Tehran, Inst Biochem & Biophys, LBD, Tehran, Iran
关键词
Data fusion; multiple features; drug-disease interaction; kernel fusion; support vector machine; HUMAN GENES; KNOWLEDGEBASE; SIMILARITY; INFERENCE; MODELS;
D O I
10.1080/1062936X.2016.1209241
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Prediction of drug-disease associations is one of the current fields in drug repositioning that has turned into a challenging topic in pharmaceutical science. Several available computational methods use network-based and machine learning approaches to reposition old drugs for new indications. However, they often ignore features of drugs and diseases as well as the priority and importance of each feature, relation, or interactions between features and the degree of uncertainty. When predicting unknown drug-disease interactions there are diverse data sources and multiple features available that can provide more accurate and reliable results. This information can be collectively mined using data fusion methods and aggregation operators. Therefore, we can use the feature fusion method to make high-level features. We have proposed a computational method named scored mean kernel fusion (SMKF), which uses a new method to score the average aggregation operator called scored mean. To predict novel drug indications, this method systematically combines multiple features related to drugs or diseases at two levels: the drug-drug level and the drug-disease level. The purpose of this study was to investigate the effect of drug and disease features as well as data fusion to predict drug-disease interactions. The method was validated against a well-established drug-disease gold-standard dataset. When compared with the available methods, our proposed method outperformed them and competed well in performance with area under cover (AUC) of 0.91, F-measure of 84.9% and Matthews correlation coefficient of 70.31%.
引用
收藏
页码:609 / 628
页数:20
相关论文
共 48 条
[1]   Old drugs - new uses [J].
Aronson, J. K. .
BRITISH JOURNAL OF CLINICAL PHARMACOLOGY, 2007, 64 (05) :563-565
[2]   Drug repositioning: Identifying and developing new uses for existing drugs [J].
Ashburn, TT ;
Thor, KB .
NATURE REVIEWS DRUG DISCOVERY, 2004, 3 (08) :673-683
[3]  
Basilico J., 2004, P 27 ANN INT ACM SIG
[4]   Kernel methods for predicting protein-protein interactions [J].
Ben-Hur, A ;
Noble, WS .
BIOINFORMATICS, 2005, 21 :I38-I46
[5]   Supervised prediction of drug-target interactions using bipartite local models [J].
Bleakley, Kevin ;
Yamanishi, Yoshihiro .
BIOINFORMATICS, 2009, 25 (18) :2397-2403
[6]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[7]   A Review of Data Fusion Techniques [J].
Castanedo, Federico .
SCIENTIFIC WORLD JOURNAL, 2013,
[8]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[9]   Prediction of human genes and diseases targeted by xenobiotics using predictive toxicogenomic-derived models (PTDMs) [J].
Cheng, Feixiong ;
Li, Weihua ;
Zhou, Yadi ;
Li, Jie ;
Shen, Jie ;
Lee, Philip W. ;
Tang, Yun .
MOLECULAR BIOSYSTEMS, 2013, 9 (06) :1316-1325
[10]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411