GT-NMR: a novel graph transformer-based approach for accurate prediction of NMR chemical shifts

被引:1
作者
Chen, Haochen [1 ]
Liang, Tao [1 ]
Tan, Kai [1 ]
Wu, Anan [1 ]
Lu, Xin [1 ]
机构
[1] Xiamen Univ, Coll Chem & Chem Engn, Fujian Prov Key Lab Theoret & Computat Chem, Dept Chem, Xiamen 361005, Peoples R China
来源
JOURNAL OF CHEMINFORMATICS | 2024年 / 16卷 / 01期
关键词
NMR chemical shifts; Machine learning; Graph transformer; Transformer; Graph neural network; Molecular complexity;
D O I
10.1186/s13321-024-00927-9
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In this work, inspired by the graph transformer, we presented an improved protocol, termed GT-NMR, which integrates 2D molecular graph representation with Transformer architecture, for accurate yet efficient prediction of NMR chemical shifts. The effectiveness of the GT-NMR was thoroughly examined with the standard nmrshiftdb2 dataset, 37 natural products and structural elucidation of 11 pairs of natural products. Systematical analysis affirms that GT-NMR outperforms traditional graph-based methods in all aspects, achieving state-of-the-art performance, with the mean absolute error of 0.158 and 1.189 ppm in predicting 1H and 13C NMR chemical shifts, respectively, for the standard nmrshiftdb2 dataset. Further scrutiny of its practical applications indicates that GT-NMR's efficacy is closely tied to molecular complexity, as quantified by the size-normalized spatial score (nSPS). For relatively simple molecules (nSPS < = 27.71), GT-NMR performs comparably to the best density functional while its effectiveness significantly diminishes with complex molecules characterized by higher nSPS values (nSPS > = 38.42). This trend is consistent across other graph-based NMR chemical shift prediction methods as well. Therefore, while employing GT-NMR or other graph-based methods for the rapid and routine prediction of NMR chemical shifts, it is advisable to utilize nSPS to assess their suitability. The source codes and trained model of GT-NMR are publicly available at GitHub. Scientific contribution GT-NMR, which combines the 2D molecular graph representation with the Transformer architecture, was implemented for the first time to predict atom-level NMR chemical shifts, achieving state-of-the-art performance. More importantly, the reliability of the GT-NMR and graph-based methods was assessed for the first time in terms of molecular complexity, as quantified by the size-normalized spacial score (nSPS). Systematical scrutiny demonstrated that GT-NMR offer a valuable way for routine application in structural screening and elucidation of relatively simple molecules.
引用
收藏
页数:10
相关论文
共 48 条
  • [1] Cyclopentane-1,3-dione: A Novel Isostere for the Carboxylic Acid Functional Group. Application to tie Design of Potent Thromboxane (A2) Receptor Antagonists
    Ballatore, Carlo
    Soper, James H.
    Piscitelli, Francesco
    James, Michael
    Huang, Longchuan
    Atasoylu, Onur
    Huryn, Donna M.
    Trojanowski, John Q.
    Lee, Virginia M. -Y.
    Brunden, Kurt R.
    Smith, Amos B., III
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2011, 54 (19) : 6969 - 6983
  • [2] BREMSER W, 1978, ANAL CHIM ACTA-COMP, V2, P355
  • [3] Using NMR to identify and characterize natural products
    Breton, Rosemary C.
    Reynolds, William F.
    [J]. NATURAL PRODUCT REPORTS, 2013, 30 (04) : 501 - 524
  • [4] Recent trends in the structural revision of natural products
    Chhetri, Bhuwan Khatri
    Lavoie, Serge
    Sweeney-Jones, Anne Marie
    Kubanek, Julia
    [J]. NATURAL PRODUCT REPORTS, 2018, 35 (06) : 514 - 531
  • [5] Dwivedi VP, 2022, Arxiv, DOI arXiv:2003.00982
  • [6] Enhancing Efficiency of Natural Product Structure Revision: Leveraging CASE and DFT over Total Synthesis
    Elyashberg, Mikhail
    Tyagarajan, Sriram
    Mandal, Mihir
    Buevich, Alexei V.
    [J]. MOLECULES, 2023, 28 (09):
  • [7] Gilmer J, 2017, PR MACH LEARN RES, V70
  • [8] Scalable graph neural network for NMR chemical shift prediction
    Han, Jongmin
    Kang, Hyungu
    Kang, Seokho
    Kwon, Youngchun
    Lee, Dongseon
    Choi, Youn-Suk
    [J]. PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2022, 24 (43) : 26870 - 26878
  • [9] Transformer-based molecular optimization beyond matched molecular pairs
    He, Jiazhen
    Nittinger, Eva
    Tyrchan, Christian
    Czechtizky, Werngard
    Patronov, Atanas
    Bjerrum, Esben Jannik
    Engkvist, Ola
    [J]. JOURNAL OF CHEMINFORMATICS, 2022, 14 (01)
  • [10] Molecular optimization by capturing chemist's intuition using deep neural networks
    He, Jiazhen
    You, Huifang
    Sandstrom, Emil
    Nittinger, Eva
    Bjerrum, Esben Jannik
    Tyrchan, Christian
    Czechtizky, Werngard
    Engkvist, Ola
    [J]. JOURNAL OF CHEMINFORMATICS, 2021, 13 (01)