Graph Convolutional Autoencoder and Fully-Connected Autoencoder with Attention Mechanism Based Method for Predicting Drug-Disease Associations

被引:43
作者
Xuan, Ping [1 ]
Gao, Ling [1 ]
Sheng, Nan [1 ]
Zhang, Tiangang [2 ]
Nakaguchi, Toshiya [3 ]
机构
[1] Heilongjiang Univ, Sch Comp Sci & Technol, Harbin 150080, Peoples R China
[2] Heilongjiang Univ, Sch Math Sci, Harbin 150080, Peoples R China
[3] Chiba Univ, Ctr Frontier Med Engn, Chiba 2638522, Japan
基金
中国博士后科学基金;
关键词
Drugs; Diseases; Proteins; Heterogeneous networks; Chemicals; Topology; Network topology; Prediction of drug-related diseases; graph convolutional autoencoder; fully-connected autoencoder; attribute-level attention; multiple kinds of drug similarities; MATRIX FACTORIZATION; SIMILARITY MEASURES; TARGET;
D O I
10.1109/JBHI.2020.3039502
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Predicting novel uses for approved drugs helps in reducing the costs of drug development and facilitates the development process. Most of previous methods focused on the multi-source data related to drugs and diseases to predict the candidate associations between drugs and diseases. There are multiple kinds of similarities between drugs, and these similarities reflect how similar two drugs are from the different views, whereas most of the previous methods failed to deeply integrate these similarities. In addition, the topology structures of the multiple drug-disease heterogeneous networks constructed by using the different kinds of drug similarities are not fully exploited. We therefore propose GFPred, a method based on a graph convolutional autoencoder and a fully-connected autoencoder with an attention mechanism, to predict drug-related diseases. GFPred integrates drug-disease associations, disease similarities, three kinds of drug similarities and attributes of the drug nodes. Three drug-disease heterogeneous networks are constructed based on the different kinds of drug similarities. We construct a graph convolutional autoencoder module, and integrate the attributes of the drug and disease nodes in each network to learn the topology representations of each drug node and disease node. As the different kinds of drug attributes contribute differently to the prediction of drug-disease associations, we construct an attribute-level attention mechanism. A fully-connected autoencoder module is established to learn the attribute representations of the drug and disease nodes. Finally, the original features of the drug-disease node pairs are also important auxiliary information for their association prediction. A combined strategy based on a convolutional neural network is proposed to fully integrate the topology representations, the attribute representations, and the original features of the drug-disease pairs. The ablation studies showed the contributions of data related to three types of drug attributes. Comparison with other methods confirmed that GFPred achieved better performance than several state-of-the-art prediction methods. In particular, case studies confirmed that GFPred is able to retrieve more actual drug-disease associations in the top k part of the prediction results. It is helpful for biologists to discover real associations by wet-lab experiments.
引用
收藏
页码:1793 / 1804
页数:12
相关论文
共 51 条
[1]   Drug repositioning screening identifies etravirine as a potential therapeutic for friedreich's ataxia [J].
Alfedi, Giulia ;
Luffarelli, Riccardo ;
Condo, Ivano ;
Pedini, Giorgia ;
Mannucci, Liliana ;
Massaro, Damiano S. ;
Benini, Monica ;
Toschi, Nicola ;
Alaimo, Giorgia ;
Panarello, Luca ;
Pacini, Laura ;
Fortuni, Silvia ;
Serio, Dario ;
Malisan, Florence ;
Testi, Roberto ;
Rufini, Alessandra .
MOVEMENT DISORDERS, 2019, 34 (03) :323-334
[2]   Matrix Factorization-based Technique for Drug Repurposing Predictions [J].
Ceddia, G. ;
Pinoli, P. ;
Ceri, S. ;
Masseroli, M. .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (11) :3162-3172
[3]   MicroRNA-small molecule association identification: from experimental results to computational models [J].
Chen, Xing ;
Guan, Na-Na ;
Sun, Ya-Zhou ;
Li, Jian-Qiang ;
Qu, Jia .
BRIEFINGS IN BIOINFORMATICS, 2020, 21 (01) :47-61
[4]   NLLSS: Predicting Synergistic Drug Combinations Based on Semi-supervised Learning [J].
Chen, Xing ;
Ren, Biao ;
Chen, Ming ;
Wang, Quanxin ;
Zhang, Lixin ;
Yan, Guiying .
PLOS COMPUTATIONAL BIOLOGY, 2016, 12 (07)
[5]   Drug-target interaction prediction: databases, web servers and computational models [J].
Chen, Xing ;
Yan, Chenggang Clarence ;
Zhang, Xiaotian ;
Zhang, Xu ;
Dai, Feng ;
Yin, Jian ;
Zhang, Yongdong .
BRIEFINGS IN BIOINFORMATICS, 2016, 17 (04) :696-712
[6]   The Comparative Toxicogenomics Database: update 2019 [J].
Davis, Allan Peter ;
Grondin, Cynthia J. ;
Johnson, Robin J. ;
Sciaky, Daniela ;
McMorran, Roy ;
Wiegers, Jolene ;
Wiegers, Thomas C. ;
Mattingly, Carolyn J. .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D948-D954
[7]   Similarity-based machine learning methods for predicting drug-target interactions: a brief review [J].
Ding, Hao ;
Takigawa, Ichigaku ;
Mamitsuka, Hiroshi ;
Zhu, Shanfeng .
BRIEFINGS IN BIOINFORMATICS, 2014, 15 (05) :734-747
[8]   metapath2vec: Scalable Representation Learning for Heterogeneous Networks [J].
Dong, Yuxiao ;
Chawla, Nitesh V. ;
Swami, Ananthram .
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, :135-144
[9]  
Glubb D. M., FRONT PHARMACOL, V8, P1
[10]   PREDICT: a method for inferring novel drug indications with application to personalized medicine [J].
Gottlieb, Assaf ;
Stein, Gideon Y. ;
Ruppin, Eytan ;
Sharan, Roded .
MOLECULAR SYSTEMS BIOLOGY, 2011, 7