Deep Learning-Based Method for Compound Identification in NMR Spectra of Mixtures

被引:21
|
作者
Wei, Weiwei [1 ]
Liao, Yuxuan [2 ]
Wang, Yufei [2 ]
Wang, Shaoqi [2 ]
Du, Wen [1 ]
Lu, Hongmei [2 ]
Kong, Bo [1 ]
Yang, Huawu [3 ]
Zhang, Zhimin [2 ]
机构
[1] China Tobacco Hunan Ind Co Ltd, Technol Ctr, Changsha 410014, Peoples R China
[2] Cent South Univ, Coll Chem & Chem Engn, Changsha 410083, Peoples R China
[3] China Tobacco Hunan Ind Co Ltd, Flavors & Fragrances Res Inst, Technol Ctr, Changsha 410014, Peoples R China
来源
MOLECULES | 2022年 / 27卷 / 12期
关键词
deep learning; identification; NMR; mixture analysis; NUCLEAR-MAGNETIC-RESONANCE; METABOLITE IDENTIFICATION; COMPLEX-MIXTURES; METABOLOMICS; SPECTROSCOPY; RESOLUTION; DECONVOLUTION; PREDICTION; ALIGNMENT; ROBUST;
D O I
10.3390/molecules27123653
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Nuclear magnetic resonance (NMR) spectroscopy is highly unbiased and reproducible, which provides us a powerful tool to analyze mixtures consisting of small molecules. However, the compound identification in NMR spectra of mixtures is highly challenging because of chemical shift variations of the same compound in different mixtures and peak overlapping among molecules. Here, we present a pseudo-Siamese convolutional neural network method (pSCNN) to identify compounds in mixtures for NMR spectroscopy. A data augmentation method was implemented for the superposition of several NMR spectra sampled from a spectral database with random noises. The augmented dataset was split and used to train, validate and test the pSCNN model. Two experimental NMR datasets (flavor mixtures and additional flavor mixture) were acquired to benchmark its performance in real applications. The results show that the proposed method can achieve good performances in the augmented test set (ACC = 99.80%, TPR = 99.70% and FPR = 0.10%), the flavor mixtures dataset (ACC = 97.62%, TPR = 96.44% and FPR = 2.29%) and the additional flavor mixture dataset (ACC = 91.67%, TPR = 100.00% and FPR = 10.53%). We have demonstrated that the translational invariance of convolutional neural networks can solve the chemical shift variation problem in NMR spectra. In summary, pSCNN is an off-the-shelf method to identify compounds in mixtures for NMR spectroscopy because of its accuracy in compound identification and robustness to chemical shift variation.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Deep learning-based component identification for the Raman spectra of mixtures
    Fan, Xiaqiong
    Ming, Wen
    Zeng, Huitao
    Zhang, Zhimin
    Lu, Hongmei
    ANALYST, 2019, 144 (05) : 1789 - 1798
  • [2] Deconvolution of 1D NMR spectra: A deep learning-based approach
    Schmid, N.
    Bruderer, S.
    Paruzzo, F.
    Fischetti, G.
    Toscano, G.
    Graf, D.
    Fey, M.
    Henrici, A.
    Ziebart, V.
    Heitmann, B.
    Grabner, H.
    Wegner, J. D.
    Sigel, R. K. O.
    Wilhelm, D.
    JOURNAL OF MAGNETIC RESONANCE, 2023, 347
  • [3] Multilabel Deep Learning-Based Lightweight Radar Compound Jamming Recognition Method
    Lv, Qinzhe
    Fan, Hanxin
    Liu, Junliang
    Zhao, Yinghai
    Xing, Mengdao
    Quan, Yinghui
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [4] Effective Deep Learning-Based Infrared Spectral Gas Identification Method
    Wang, Zhikang
    Zhao, Guodong
    ADVANCED THEORY AND SIMULATIONS, 2024, 7 (03)
  • [5] Deep Learning-Based Biomimetic Identification Method for Mask Wearing Standardization
    Yan, Bin
    Li, Xiameng
    Yan, Wenhui
    BIOMIMETICS, 2024, 9 (09)
  • [6] A Deep Learning-Based Method for Identification of Bacteriophage-Host Interaction
    Li, Menglu
    Wang, Yanan
    Li, Fuyi
    Zhao, Yun
    Liu, Mengya
    Zhang, Sijia
    Bin, Yannan
    Smith, A. Ian
    Webb, Geoffrey I.
    Li, Jian
    Song, Jiangning
    Xia, Junfeng
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (05) : 1801 - 1810
  • [7] NMR as a tool for compound identification in mixtures
    Borges, Ricardo Moreira
    Ferreira, Gabriela de Assis
    Campos, Mariana Martins
    Teixeira, Andrew Magno
    Costa, Fernanda das Neves
    das Chagas, Fernanda Oliveira
    Colonna, Maxwell
    PHYTOCHEMICAL ANALYSIS, 2023, 34 (04) : 385 - 392
  • [8] Deep Learning-Based Specific Emitter Identification
    Srinivasulu, N.B.
    Chalamalasetti, Yaswanth
    Ramkumar, Barathram
    Lecture Notes in Networks and Systems, 2023, 554 : 283 - 290
  • [9] Deep learning-based bacterial genus identification
    Khan, Shafiur Rahman
    Khan, Ishrat
    Bag, Md. Abdus Sattar
    Uddin, Machbah
    Hassan, Md. Rakib
    Hassan, Jayedul
    JOURNAL OF ADVANCED VETERINARY AND ANIMAL RESEARCH, 2022, 9 (04) : 573 - 582
  • [10] Deep learning-based segmentation for disease identification
    Mzoughi, Olfa
    Yahiaoui, Itheri
    ECOLOGICAL INFORMATICS, 2023, 75