Combining graph neural networks and transformers for few-shot nuclear receptor binding activity prediction

被引:0
作者
Torres, Luis H. M. [1 ]
Arrais, Joel P. [1 ]
Ribeiro, Bernardete [1 ]
机构
[1] Univ Coimbra, Ctr Informat & Syst, Dept Informat Engn, P-3030790 Coimbra, Portugal
来源
JOURNAL OF CHEMINFORMATICS | 2024年 / 16卷 / 01期
关键词
Graph Neural Network; Transformer; Few-shot Learning; Meta-Learning; Nuclear Receptor Binding Activity Prediction; Drug Discovery; NORMALITY;
D O I
10.1186/s13321-024-00902-4
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Nuclear receptors (NRs) play a crucial role as biological targets in drug discovery. However, determining which compounds can act as endocrine disruptors and modulate the function of NRs with a reduced amount of candidate drugs is a challenging task. Moreover, the computational methods for NR-binding activity prediction mostly focus on a single receptor at a time, which may limit their effectiveness. Hence, the transfer of learned knowledge among multiple NRs can improve the performance of molecular predictors and lead to the development of more effective drugs. In this research, we integrate graph neural networks (GNNs) and Transformers to introduce a few-shot GNN-Transformer, Meta-GTNRP to predict the binding activity of compounds using the combined information of different NRs and identify potential NR-modulators with limited data. The Meta-GTNRP model captures the local information in graph-structured data and preserves the global-semantic structure of molecular graph embeddings for NR-binding activity prediction. Furthermore, a few-shot meta-learning approach is proposed to optimize model parameters for different NR-binding tasks and leverage the complementarity among multiple NR-specific tasks to predict binding activity of compounds for each NR with just a few labeled molecules. Experiments with a compound database containing annotations on the binding activity for 11 NRs shows that Meta-GTNRP outperforms other graph-based approaches. The data and code are available at: https://github.com/ltorres97/Meta-GTNRP.Scientific contributionThe proposed few-shot GNN-Transformer model, Meta-GTNRP captures the local structure of molecular graphs and preserves the global-semantic information of graph embeddings to predict the NR-binding activity of compounds with limited available data; A few-shot meta-learning framework adapts model parameters across NR-specific tasks for different NRs in a joint learning procedure to predict the binding activity of compounds for each NR with just a few labeled molecules in highly imbalanced data scenarios; Meta-GTNRP is a data-efficient approach that combines the strengths of GNNs and Transformers to predict the NR-binding properties of compounds through an optimized meta-learning procedure and deliver robust results valuable to identify potential NR-based drug candidates.
引用
收藏
页数:19
相关论文
共 64 条
  • [1] Low Data Drug Discovery with One-Shot Learning
    Altae-Tran, Han
    Ramsundar, Bharath
    Pappu, Aneesh S.
    Pande, Vijay
    [J]. ACS CENTRAL SCIENCE, 2017, 3 (04) : 283 - 293
  • [2] Beyer L., 2022, arXiv
  • [3] Machine learning for molecular and materials science
    Butler, Keith T.
    Davies, Daniel W.
    Cartwright, Hugh
    Isayev, Olexandr
    Walsh, Aron
    [J]. NATURE, 2018, 559 (7715) : 547 - 555
  • [4] End-to-end sequence-structure-function meta-learning predicts genome-wide chemical-protein interactions for dark proteins
    Cai, Tian
    Xie, Li
    Zhang, Shuo
    Chen, Muge
    He, Di
    Badkul, Amitesh
    Liu, Yang
    Namballa, Hari Krishna
    Dorogan, Michael
    Harding, Wayne W.
    Mura, Cameron
    Bourne, Philip E.
    Xie, Lei
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2023, 19 (01)
  • [5] Chiazza F., 2016, Molecular Nutrition and Diabetes: A Volume in the Molecular Nutrition Series, P105, DOI [10.1016/B978-0-12-801585-8.00009-9, DOI 10.1016/B978-0-12-801585-8.00009-9]
  • [6] Bioalerts: a python']python library for the derivation of structural alerts from bioactivity and toxicity data sets
    Cortes-Ciriano, Isidro
    [J]. JOURNAL OF CHEMINFORMATICS, 2016, 8
  • [7] OMNIBUS TEST OF NORMALITY FOR MODERATE AND LARGE SIZE SAMPLES
    DAGOSTINO, RB
    [J]. BIOMETRIKA, 1971, 58 (02) : 341 - +
  • [8] The retinoid X receptors and their ligands
    Dawson, Marcia I.
    Xia, Zebin
    [J]. BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR AND CELL BIOLOGY OF LIPIDS, 2012, 1821 (01): : 21 - 56
  • [9] Defferrard M, 2017, Arxiv, DOI [arXiv:1606.09375, 10.48550/ARXIV.1606.09375]
  • [10] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929