HAC-Net: A Hybrid Attention-Based Convolutional Neural Network for Highly Accurate Protein-Ligand Binding Affinity Prediction

被引:25
作者
Kyro, Gregory W. [1 ]
Brent, Rafael I. [1 ]
Batista, Victor S. [1 ]
机构
[1] Yale Univ, Dept Chem, New Haven, CT 06511 USA
基金
美国国家卫生研究院;
关键词
MACHINE LEARNING APPROACH; CHARGE EQUILIBRATION; SCORING FUNCTION; SIMILARITY; SYSTEM;
D O I
10.1021/acs.jcim.3c00251
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Applying deep learning concepts from image detection and graph theory has greatly advanced protein-ligand binding affinity prediction, a challenge with enormous ramifica-tions for both drug discovery and protein engineering. We build upon these advances by designing a novel deep learning architecture consisting of a 3-dimensional convolutional neural network utilizing channel-wise attention and two graph convolu-tional networks utilizing attention-based aggregation of node features. HAC-Net (Hybrid Attention-Based Convolutional Neural Network) obtains state-of-the-art results on the PDBbind v.2016 core set, the most widely recognized benchmark in the field. We extensively assess the generalizability of our model using multiple train-test splits, each of which maximizes differences between either protein structures, protein sequences, or ligand extended -connectivity fingerprints of complexes in the training and test sets. Furthermore, we perform 10-fold cross-validation with a similarity cutoff between SMILES strings of ligands in the training and test sets and also evaluate the performance of HAC-Net on lower -quality data. We envision that this model can be extended to a broad range of supervised learning problems related to structure -based biomolecular property prediction. All of our software is available as an open-source repository at https://github.com/gregory-kyro/HAC-Net/, and the HACNet Python package is available through PyPI.
引用
收藏
页码:1947 / 1960
页数:14
相关论文
共 88 条
  • [1] [Anonymous], PYTORCH
  • [2] [Anonymous], 2020, INT J MOL SCI, V21, P9548
  • [3] [Anonymous], 2016, Bridging category-level and instance-level semantic image segmentation
  • [4] Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations?
    Bajusz, David
    Racz, Anita
    Heberger, Kroly
    [J]. JOURNAL OF CHEMINFORMATICS, 2015, 7
  • [5] A machine learning approach to predicting protein-ligand binding affinity with applications to molecular docking
    Ballester, Pedro J.
    Mitchell, John B. O.
    [J]. BIOINFORMATICS, 2010, 26 (09) : 1169 - 1175
  • [6] Learning from the ligand: using ligand-based features to improve binding affinity prediction
    Boyles, Fergus
    Deane, Charlotte M.
    Morris, Garrett M.
    [J]. BIOINFORMATICS, 2020, 36 (03) : 758 - 764
  • [7] Unsupervised data base clustering based on Daylight's fingerprint and Tanimoto similarity: A fast and automated way to cluster small and large data sets
    Butina, D
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (04): : 747 - 750
  • [8] Representability of algebraic topology for biomolecules in machine learning based scoring and virtual screening
    Cang, Zixuan
    Mu, Lin
    Wei, Guo-Wei
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2018, 14 (01)
  • [9] TopologyNet: Topology based deep convolutional and multi-task neural networks for biomolecular property predictions
    Cang, Zixuan
    Wei, Guowei
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2017, 13 (07)
  • [10] Cho K., 2014, PROC SSST 8 8 WORKSH