Leveraging 2D molecular graph pretraining for improved 3D conformer generation with graph neural networks*

被引:6
作者
Alhamoud, Kumail [1 ]
Ghunaim, Yasir [1 ]
Alshehri, Abdulelah S. [2 ,3 ]
Li, Guohao [1 ]
Ghanem, Bernard [1 ]
You, Fengqi [2 ]
机构
[1] King Abdullah Univ Sci & Technol KAUST, Thuwal 23955, Saudi Arabia
[2] Cornell Univ, Robert Frederick Smith Sch Chem & Biomol Engn, Ithaca, NY 14853 USA
[3] King Saud Univ, Coll Engn, Dept Chem Engn, Riyadh 11421, Saudi Arabia
关键词
Graph neural networks; Conformation generation; Pretraining molecular graph embeddings; Drug design; 3D molecular modeling; STRAIN;
D O I
10.1016/j.compchemeng.2024.108622
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Predicting stable 3D molecular conformations from 2D molecular graphs is a challenging and resource-intensive task, yet it is critical for various applications, particularly drug design. Density functional theory (DFT) calculations set the standard for molecular conformation generation, yet they are computationally intensive. Deep learning offers more computationally efficient approaches, but struggles to match DFT accuracy, particularly on complex drug-like structures. Additionally, the steep computational demands of assembling 3D molecular datasets constrain the broader adoption of deep learning. This work aims to utilize the abundant 2D molecular graph datasets for pretraining a machine learning model, a step that involves initially training the model on a different task with a wealth of data before fine-tuning it for the target task of 3D conformation generation. We build on GeoMol, an end-to-end graph neural network (GNN) method for predicting atomic 3D structures and torsion angles. We examine the limitations of the GeoMol method and introduce new baselines to enhance molecular graph embeddings. Our computational results show that 2D molecular graph pretraining enhances the quality of generated 3D conformers, yielding a 7.7 % average improvement over state-of-the-art sequential methods. These advancements not only facilitate superior 3D conformation generation but also emphasize the potential of leveraging pretrained graph embeddings to boost performance in 3D chemical tasks with GNNs.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] 3-D Ocean Temperature Prediction via Graph Neural Network With Optimized Attention Mechanisms
    Ou, Mingyu
    Xu, Shijie
    Luo, Bin
    Zhou, Hengan
    Zhang, Mingye
    Xu, Pan
    Zhu, Hongna
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [32] Subclinical hyperthyroidism impacts left ventricular deformation: 2D and 3D echocardiographic study
    Tadic, Marijana
    Ilic, Sanja
    Cuspidi, Cesare
    Marjanovic, Tamara
    Celic, Vera
    SCANDINAVIAN CARDIOVASCULAR JOURNAL, 2015, 49 (02) : 74 - 81
  • [33] Left atrial remodelling assessed by 2D and 3D echocardiography identifies paroxysmal atrial fibrillation
    Schaaf, Mathieu
    Andre, Philippe
    Altman, Mikhail
    Maucort-Boulch, Delphine
    Placide, Joel
    Chevalier, Philippe
    Bergerot, Cyrille
    Thibault, Helene
    EUROPEAN HEART JOURNAL-CARDIOVASCULAR IMAGING, 2017, 18 (01) : 46 - 53
  • [34] The effect of out-of-plane motion on 2D and 3D digital image correlation measurements
    Sutton, M. A.
    Yan, J. H.
    Tiwari, V.
    Schreier, H. W.
    Orteu, J. J.
    OPTICS AND LASERS IN ENGINEERING, 2008, 46 (10) : 746 - 757
  • [35] Intensifying the response of distributed optical fibre sensors using 2D and 3D image restoration
    Soto, Marcelo A.
    Ramirez, Jaime A.
    Thevenaz, Luc
    NATURE COMMUNICATIONS, 2016, 7
  • [36] Study of vacancy defect in 2D/3D semiconductor heterostructure based on monolayer WSe2 and GaN
    Ye, Li
    Liang, Yongchao
    MATERIALS TODAY COMMUNICATIONS, 2024, 40
  • [37] Clinical feasibility and validation of 3D principal strain analysis from cine MRI: comparison to 2D strain by MRI and 3D speckle tracking echocardiography
    Alessandro Satriano
    Bobak Heydari
    Mariam Narous
    Derek V. Exner
    Yoko Mikami
    Monica M. Attwood
    John V. Tyberg
    Carmen P. Lydell
    Andrew G. Howarth
    Nowell M. Fine
    James A. White
    The International Journal of Cardiovascular Imaging, 2017, 33 : 1979 - 1992
  • [38] Clinical feasibility and validation of 3D principal strain analysis from cine MRI: comparison to 2D strain by MRI and 3D speckle tracking echocardiography
    Satriano, Alessandro
    Heydari, Bobak
    Narous, Mariam
    Exner, Derek V.
    Mikami, Yoko
    Attwood, Monica M.
    Tyberg, John V.
    Lydell, Carmen P.
    Howarth, Andrew G.
    Fine, Nowell M.
    White, James A.
    INTERNATIONAL JOURNAL OF CARDIOVASCULAR IMAGING, 2017, 33 (12) : 1979 - 1992
  • [39] 3D molecular generation models expand chemical space exploration in drug design
    Xiang, Yu-Ting
    Huang, Guang-Yi
    Shi, Xing-Xing
    Hao, Ge-Fei
    Yang, Guang-Fu
    DRUG DISCOVERY TODAY, 2025, 30 (01)
  • [40] Transferable Graph Neural Network-Based Delay-Fault Localization for Monolithic 3-D ICs
    Hung, Shao-Chun
    Banerjee, Sanmitra
    Chaudhuri, Arjun
    Kim, Jinwoo
    Lim, Sung Kyu
    Chakrabarty, Krishnendu
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 42 (11) : 4296 - 4309