Advancing Bioactivity Prediction Through Molecular Docking and Self-Attention

被引:1
作者
Yin, Yueming [1 ]
Lam, Hilbert Yuen In [2 ]
Mu, Yuguang [2 ]
Li, Hoi-Yeung [2 ]
Kong, Adams Wai-Kin [1 ]
机构
[1] Nanyang Technol Univ, Coll Comp & Data Sci, Singapore 639798, Singapore
[2] Nanyang Technol Univ, Sch Biol Sci, Singapore 637551, Singapore
关键词
Proteins; Biological system modeling; Protein engineering; Predictive models; Bioinformatics; Data models; Crystals; Drug discovery; bioactivity prediction; molecular docking; multi-head self-attention; semi-supervised learning; BIOLOGICAL-ACTIVITY; IDENTIFICATION; ACCURATE; BINDING;
D O I
10.1109/JBHI.2024.3448455
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bioactivity refers to the ability of a substance to induce biological effects within living systems, often describing the influence of molecules, drugs, or chemicals on organisms. In drug discovery, predicting bioactivity streamlines early-stage candidate screening by swiftly identifying potential active molecules. The popular deep learning methods in bioactivity prediction primarily model the ligand structure-bioactivity relationship under the premise of Quantitative Structure-Activity Relationship (QSAR). However, bioactivity is determined by multiple factors, including not only the ligand structure but also drug-target interactions, signaling pathways, reaction environments, pharmacokinetic properties, and species differences. Our study first integrates drug-target interactions into bioactivity prediction using protein-ligand complex data from molecular docking. We devise a Drug-Target Interaction Graph Neural Network (DTIGN), infusing interatomic forces into intermolecular graphs. DTIGN employs multi-head self-attention to identify native-like binding pockets and poses within molecular docking results. To validate the fidelity of the self-attention mechanism, we gather ground truth data from crystal structure databases. Subsequently, we employ these limited native structures to refine bioactivity prediction via semi-supervised learning. For this study, we establish a unique benchmark dataset for evaluating bioactivity prediction models in the context of protein-ligand complexes, showcasing the superior performance of our method (with an average improvement of 27.03%) through comparison with 9 leading deep learning-based bioactivity prediction methods.
引用
收藏
页码:7599 / 7610
页数:12
相关论文
共 64 条
  • [1] Applications of Quantitative Structure-Activity Relationships (QSAR) based Virtual Screening in Drug Design: A Review
    Achary, Patnala Ganga Raju
    [J]. MINI-REVIEWS IN MEDICINAL CHEMISTRY, 2020, 20 (14) : 1375 - 1388
  • [2] Fast, accurate, and reliable molecular docking with QuickVina 2
    Alhossary, Amr
    Handoko, Stephanus Daniel
    Mu, Yuguang
    Kwoh, Chee-Keong
    [J]. BIOINFORMATICS, 2015, 31 (13) : 2214 - 2216
  • [3] [Anonymous], 2015, Nucleic Acids Research, V43, pD204, DOI DOI 10.1093/NAR/GKU989
  • [4] Bahi M, 2018, 2018 3RD INTERNATIONAL CONFERENCE ON PATTERN ANALYSIS AND INTELLIGENT SYSTEMS (PAIS), P268
  • [5] A review of ligand-based virtual screening web tools and screening algorithms in large molecular databases in the age of big data
    Banegas-Luna, Antonio-Jesus
    Ceron-Carrasco, Jose P.
    Perez-Sanchez, Horacio
    [J]. FUTURE MEDICINAL CHEMISTRY, 2018, 10 (22) : 2641 - 2658
  • [6] DeepBSP-a Machine Learning Method for Accurate Prediction of Protein-Ligand Docking Structures
    Bao, Jingxiao
    He, Xiao
    Zhang, John Z. H.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (05) : 2231 - 2240
  • [7] Barcelos M.P., 2022, Research Topics in Bioactivity, Environment and Energy: Experimental and Theoretical Tools, P481, DOI [DOI 10.1007/978-3-031-07622-0_19, DOI 10.1007/978-3-031-07622-019]
  • [8] The properties of known drugs .1. Molecular frameworks
    Bemis, GW
    Murcko, MA
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 1996, 39 (15) : 2887 - 2893
  • [9] Papyrus: a large-scale curated dataset aimed at bioactivity predictions
    Bequignon, O. J. M.
    Bongers, B. J.
    Jespers, W.
    IJzerman, A. P.
    van der Water, B.
    van Westen, G. J. P.
    [J]. JOURNAL OF CHEMINFORMATICS, 2023, 15 (01)
  • [10] Adding Stochastic Negative Examples into Machine Learning Improves Molecular Bioactivity Prediction
    Caceres, Elena L.
    Mew, Nicholas C.
    Keiser, Michael J.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2020, 60 (12) : 5957 - 5970