MVCL-DTI: Predicting Drug-Target Interactions Using a Multiview Contrastive Learning Model on a Heterogeneous Graph

被引:0
作者
Zhang, Bei [1 ,2 ]
Quan, Lijun [1 ,3 ]
Zhang, Zhijun [1 ]
Cao, Lexin [1 ]
Chen, Qiufeng [1 ]
Peng, Liangchen [1 ]
Wang, Junkai [1 ]
Jiang, Yelu [1 ]
Nie, Liangpeng [1 ]
Li, Geng [1 ]
Wu, Tingfang [1 ,3 ]
Lyu, Qiang [1 ,3 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Jiangsu, Peoples R China
[2] China Mobile Suzhou Software Technol Co Ltd, Suzhou 215163, Peoples R China
[3] Collaborat Innovat Ctr Novel Software Technol & In, Nanjing 210000, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
INFORMATION; IDENTIFICATION;
D O I
10.1021/acs.jcim.4c02073
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Accurate prediction of drug-target interactions (DTIs) is pivotal for accelerating the processes of drug discovery and drug repurposing. MVCL-DTI, a novel model leveraging heterogeneous graphs for predicting DTIs, tackles the challenge of synthesizing information from varied biological subnetworks. It integrates neighbor view, meta-path view, and diffusion view to capture semantic features and employs an attention-based contrastive learning approach, along with a multiview attention-weighted fusion module, to effectively integrate and adaptively weight the information from the different views. Tested under various conditions on benchmark data sets, including varying positive-to-negative sample ratios, conducting hard negative sampling experiments, and masking known DTIs with different ratios, as well as redundant DTIs with various similarity metrics, MVCL-DTI exhibits strong robust generalization. The model is then employed to predict novel DTIs, with a particular focus on COVID-19-related drugs, highlighting its practical applicability. Ultimately, through features visualization and computational properties analysis, we've pinpointed critical elements, including Gene Ontology and substituent nodes, along with a proper initialization strategy, underscoring their vital role in DTI prediction tasks.
引用
收藏
页码:1009 / 1026
页数:18
相关论文
共 52 条
  • [1] ProteinBERT: a universal deep-learning model of protein sequence and function
    Brandes, Nadav
    Ofer, Dan
    Peleg, Yam
    Rappoport, Nadav
    Linial, Michal
    [J]. BIOINFORMATICS, 2022, 38 (08) : 2102 - 2110
  • [2] Brockschmidt, 2020, ARXIV
  • [3] PubChem as a Source of Polypharmacology
    Chen, Bin
    Wild, David
    Guha, Rajarshi
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2009, 49 (09) : 2044 - 2055
  • [4] MicroRNA-small molecule association identification: from experimental results to computational models
    Chen, Xing
    Guan, Na-Na
    Sun, Ya-Zhou
    Li, Jian-Qiang
    Qu, Jia
    [J]. BRIEFINGS IN BIOINFORMATICS, 2020, 21 (01) : 47 - 61
  • [5] The Comparative Toxicogenomics Database: update 2013
    Davis, Allan Peter
    Murphy, Cynthia Grondin
    Johnson, Robin
    Lay, Jean M.
    Lennon-Hopkins, Kelley
    Saraceni-Richards, Cynthia
    Sciaky, Daniela
    King, Benjamin L.
    Rosenstein, Michael C.
    Wiegers, Thomas C.
    Mattingly, Carolyn J.
    [J]. NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) : D1104 - D1114
  • [6] Identification of drug-target interactions via multiple information integration
    Ding, Yijie
    Tang, Jijun
    Guo, Fei
    [J]. INFORMATION SCIENCES, 2017, 418 : 546 - 560
  • [7] Donald B. R., 2011, ALGORITHMS STRUCTURA
  • [8] Reoptimization of MDL keys for use in drug discovery
    Durant, JL
    Leland, BA
    Henry, DR
    Nourse, JG
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (06): : 1273 - 1280
  • [9] SwissDock, a protein-small molecule docking web service based on EADock DSS
    Grosdidier, Aurelien
    Zoete, Vincent
    Michielin, Olivier
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : W270 - W277
  • [10] node2vec: Scalable Feature Learning for Networks
    Grover, Aditya
    Leskovec, Jure
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 855 - 864