Self-Supervised Learning for Label Sparsity in Computational Drug Repositioning

被引：5

作者：

Yang, Xinxing ^{[1
,2
]}

Yang, Genke ^{[1
,2
]}

Chu, Jian ^{[1
,2
]}

机构：

[1] Shanghai Jiao Tong Univ, Ningbo Artificial Intelligence Inst, Shanghai 200240, Peoples R China

[2] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China

来源：

IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS | 2023年 / 20卷 / 05期

关键词：

Computational drug repositioning; drug discovery; label sparsity; self-supervised learning; INFORMATION;

D O I：

10.1109/TCBB.2023.3254163

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

The computational drug repositioning aims to discover new uses for marketed drugs, which can accelerate the drug development process and play an important role in the existing drug discovery system. However, the number of validated drug-disease associations is scarce compared to the number of drugs and diseases in the real world. Too few labeled samples will make the classification model unable to learn effective latent factors of drugs, resulting in poor generalization performance. In this work, we propose a multi-task self-supervised learning framework for computational drug repositioning. The framework tackles label sparsity by learning a better drug representation. Specifically, we take the drug-disease association prediction problem as the main task, and the auxiliary task is to use data augmentation strategies and contrast learning to mine the internal relationships of the original drug features, so as to automatically learn a better drug representation without supervised labels. And through joint training, it is ensured that the auxiliary task can improve the prediction accuracy of the main task. More precisely, the auxiliary task improves drug representation and serving as additional regularization to improve generalization. Furthermore, we design a multi-input decoding network to improve the reconstruction ability of the autoencoder model. We evaluate our model using three real-world datasets. The experimental results demonstrate the effectiveness of the multi-task self-supervised learning framework, and its predictive ability is superior to the state-of-the-art model.

引用

页码：3245 / 3256

页数：12

共 38 条

[1] Drug repositioning: Identifying and developing new uses for existing drugs
Ashburn, TT
Thor, KB
[J]. NATURE REVIEWS DRUG DISCOVERY, 2004, 3 (08) : 673 - 683
[2] A standard database for drug repositioning
Brown, Adam S.
Patel, Chirag J.
[J]. SCIENTIFIC DATA, 2017, 4
[3] iDrug: Integration of drug repositioning and drug-target prediction via cross-network embedding
Chen, Huiyuan
Cheng, Feixiong
Li, Jing
[J]. PLOS COMPUTATIONAL BIOLOGY, 2020, 16 (07)
[4] Key factors in the rising cost of new drug discovery and development
Dickson, M
Gagnon, JP
[J]. NATURE REVIEWS DRUG DISCOVERY, 2004, 3 (05) : 417 - 429
[5] PREDICT: a method for inferring novel drug indications with application to personalized medicine
Gottlieb, Assaf
Stein, Gideon Y.
Ruppin, Eytan
Sharan, Roded
[J]. MOLECULAR SYSTEMS BIOLOGY, 2011, 7
[6] Hamosh A, 2005, NUCLEIC ACIDS RES, V33, pD514
[7] Hybrid attentional memory network for computational drug repositioning
He, Jieyue
Yang, Xinxing
Gong, Zhuo
Zamit, Ibrahim
[J]. BMC BIOINFORMATICS, 2020, 21 (01)
[8] Neural Collaborative Filtering
He, Xiangnan
Liao, Lizi
Zhang, Hanwang
Nie, Liqiang
Hu, Xia
Chua, Tat-Seng
[J]. PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17), 2017, : 173 - 182
[9] DrugBank 3.0: a comprehensive resource for 'Omics' research on drugs
Knox, Craig
Law, Vivian
Jewison, Timothy
Liu, Philip
Ly, Son
Frolkis, Alex
Pon, Allison
Banco, Kelly
Mak, Christine
Neveu, Vanessa
Djoumbou, Yannick
Eisner, Roman
Guo, An Chi
Wishart, David S.
[J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : D1035 - D1041
[10] Koren Y, 2008, P 14 ACM SIGKDD INT, P426

← 1 2 3 4 →