Ilm-NMR-P31: an open-access 31P nuclear magnetic resonance database and data-driven prediction of 31P NMR shifts

被引:2
作者
Hack, Jasmin [1 ]
Jordan, Moritz [1 ]
Schmitt, Alina [1 ]
Raru, Melissa [1 ]
Zorn, Hannes Soenke [1 ]
Seyfarth, Alex [1 ]
Eulenberger, Isabel [1 ]
Geitner, Robert [1 ]
机构
[1] Tech Univ Ilmenau, Inst Chem & Bioengn, Grp Phys Chem Catalysis, Weimarer Str 32, D-98693 Ilmenau, Germany
关键词
NMR; Phosphorus; Database; Prediction; Increment system; Fingerprint; HOSE; Graph neural network; HOSE;
D O I
10.1186/s13321-023-00792-y
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This publication introduces a novel open-access P-31 Nuclear Magnetic Resonance (NMR) shift database. With 14,250 entries encompassing 13,730 distinct molecules from 3,648 references, this database offers a comprehensive repository of organic and inorganic compounds. Emphasizing single-phosphorus atom compounds, the database facilitates data mining and machine learning endeavors, particularly in signal prediction and Computer-Assisted Structure Elucidation (CASE) systems. Additionally, the article compares different models for P-31 NMR shift prediction, showcasing the database's potential utility. Hierarchically Ordered Spherical Environment (HOSE) code-based models and Graph Neural Networks (GNNs) perform exceptionally well with a mean squared error of 11.9 and 11.4 ppm respectively, achieving accuracy comparable to quantum chemical calculations.
引用
收藏
页数:12
相关论文
共 59 条
  • [1] ACD/Labs, 2022, NMR Prediction | 1H, 13 C, 15 N, 19F, 31P NMR predictor
  • [2] acdlabs, ACD/Labs ACD ChemSketch
  • [3] [Anonymous], 2012, J. Cheminformatics
  • [4] Aristaran M., 2020, Tabula
  • [5] BREMSER W, 1978, ANAL CHIM ACTA-COMP, V2, P355
  • [6] ChemmineR: a compound mining framework for R
    Cao, Yiqun
    Charisi, Anna
    Cheng, Li-Chang
    Jiang, Tao
    Girke, Thomas
    [J]. BIOINFORMATICS, 2008, 24 (15) : 1733 - 1734
  • [7] Charlop-Powers Zachary, 2024, CRAN, DOI 10.32614/CRAN.package.rcdklibs
  • [8] Chollet F., 2015, about us
  • [9] Cobas C., 2019, Ensemble NMR prediction-mestrelab resources
  • [10] Couch Simon, 2024, CRAN