Self-Supervised Learning for Label Sparsity in Computational Drug Repositioning

被引:5
作者
Yang, Xinxing [1 ,2 ]
Yang, Genke [1 ,2 ]
Chu, Jian [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Ningbo Artificial Intelligence Inst, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
关键词
Computational drug repositioning; drug discovery; label sparsity; self-supervised learning; INFORMATION;
D O I
10.1109/TCBB.2023.3254163
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The computational drug repositioning aims to discover new uses for marketed drugs, which can accelerate the drug development process and play an important role in the existing drug discovery system. However, the number of validated drug-disease associations is scarce compared to the number of drugs and diseases in the real world. Too few labeled samples will make the classification model unable to learn effective latent factors of drugs, resulting in poor generalization performance. In this work, we propose a multi-task self-supervised learning framework for computational drug repositioning. The framework tackles label sparsity by learning a better drug representation. Specifically, we take the drug-disease association prediction problem as the main task, and the auxiliary task is to use data augmentation strategies and contrast learning to mine the internal relationships of the original drug features, so as to automatically learn a better drug representation without supervised labels. And through joint training, it is ensured that the auxiliary task can improve the prediction accuracy of the main task. More precisely, the auxiliary task improves drug representation and serving as additional regularization to improve generalization. Furthermore, we design a multi-input decoding network to improve the reconstruction ability of the autoencoder model. We evaluate our model using three real-world datasets. The experimental results demonstrate the effectiveness of the multi-task self-supervised learning framework, and its predictive ability is superior to the state-of-the-art model.
引用
收藏
页码:3245 / 3256
页数:12
相关论文
共 38 条
  • [1] Drug repositioning: Identifying and developing new uses for existing drugs
    Ashburn, TT
    Thor, KB
    [J]. NATURE REVIEWS DRUG DISCOVERY, 2004, 3 (08) : 673 - 683
  • [2] A standard database for drug repositioning
    Brown, Adam S.
    Patel, Chirag J.
    [J]. SCIENTIFIC DATA, 2017, 4
  • [3] iDrug: Integration of drug repositioning and drug-target prediction via cross-network embedding
    Chen, Huiyuan
    Cheng, Feixiong
    Li, Jing
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2020, 16 (07)
  • [4] Key factors in the rising cost of new drug discovery and development
    Dickson, M
    Gagnon, JP
    [J]. NATURE REVIEWS DRUG DISCOVERY, 2004, 3 (05) : 417 - 429
  • [5] PREDICT: a method for inferring novel drug indications with application to personalized medicine
    Gottlieb, Assaf
    Stein, Gideon Y.
    Ruppin, Eytan
    Sharan, Roded
    [J]. MOLECULAR SYSTEMS BIOLOGY, 2011, 7
  • [6] Hamosh A, 2005, NUCLEIC ACIDS RES, V33, pD514
  • [7] Hybrid attentional memory network for computational drug repositioning
    He, Jieyue
    Yang, Xinxing
    Gong, Zhuo
    Zamit, Ibrahim
    [J]. BMC BIOINFORMATICS, 2020, 21 (01)
  • [8] Neural Collaborative Filtering
    He, Xiangnan
    Liao, Lizi
    Zhang, Hanwang
    Nie, Liqiang
    Hu, Xia
    Chua, Tat-Seng
    [J]. PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17), 2017, : 173 - 182
  • [9] DrugBank 3.0: a comprehensive resource for 'Omics' research on drugs
    Knox, Craig
    Law, Vivian
    Jewison, Timothy
    Liu, Philip
    Ly, Son
    Frolkis, Alex
    Pon, Allison
    Banco, Kelly
    Mak, Christine
    Neveu, Vanessa
    Djoumbou, Yannick
    Eisner, Roman
    Guo, An Chi
    Wishart, David S.
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : D1035 - D1041
  • [10] Koren Y, 2008, P 14 ACM SIGKDD INT, P426