ATR-FTIR spectroscopy and machine/deep learning models for detecting adulteration in coconut water with sugars, sugar alcohols, and artificial sweeteners

被引:2
作者
Teklemariam, Thomas A. [1 ]
Chou, Faith [2 ]
Kumaravel, Pavisha [3 ]
Van Buskrik, Jeremy [1 ]
机构
[1] Canadian Food Inspect Agcy, Greater Toronto Area Lab, 2301 Midland Ave, Toronto, ON M1P 4R7, Canada
[2] Canadian Food Inspect Agcy, 1400 Merivale Rd, Ottawa, ON K1A 0Y9, Canada
[3] Univ Guelph, Mol & Cellular Biol, Guelph, ON N1G 2W1, Canada
关键词
Coconut water; Adulteration; Sugars; Sugar substitutes; Machine-learning; Deep-learning; CARBON;
D O I
10.1016/j.saa.2024.124771
中图分类号
O433 [光谱学];
学科分类号
0703 ; 070302 ;
摘要
Packaged coconut water offers various options, from pure to those with added sugars and other additives. While the purity of coconut water is esteemed for its health benefits, its popularity also exposes it to potential adulteration and misrepresentation. To address this concern, our study combines Fourier transform infrared spectroscopy (FTIR) and machine learning techniques to detect potential adulterants in coconut water through classification models. The dataset comprises infrared spectra from coconut water samples spiked with 15 different types of potential sugar substitutes, including: sugars, artificial sweeteners, and sugar alcohols. The interaction of infrared light with molecular bonds generates unique molecular fingerprints, forming the basis of our analysis. Departing from previous research predominantly reliant on linear-based chemometrics for adulterant detection, our study explored linear, non-linear, and combined feature extraction models. By developing an interactive application utilizing principal component analysis (PCA) and t-distributed stochastic neighbor embedding (t-SNE), non-targeted sugar adulterant detection was streamlined through enhanced visualization and pattern recognition. Targeted analysis using ensemble learning random forest (RF) and deep learning 1dimensional convolutional neural network (1D CNN) achieved higher classification accuracies (95% and 96%, respectively) compared to sparse partial least squares discriminant analysis (sPLS-DA) at 77% and support vector machine (SVM) at 88% on the same dataset. The CNN's demonstrated classification accuracy is complemented by exceptional efficiency through its ability to train and test on raw data.
引用
收藏
页数:12
相关论文
共 45 条
  • [1] Convolutional neural networks for vibrational spectroscopic data analysis
    Acquarelli, Jacopo
    van Laarhoven, Twan
    Gerretzen, Jan
    Tran, Thanh N.
    Buydens, Lutgarde M. C.
    Marchiori, Elena
    [J]. ANALYTICA CHIMICA ACTA, 2017, 954 : 22 - 31
  • [2] Review of deep learning: concepts, CNN architectures, challenges, applications, future directions
    Alzubaidi, Laith
    Zhang, Jinglan
    Humaidi, Amjad J.
    Al-Dujaili, Ayad
    Duan, Ye
    Al-Shamma, Omran
    Santamaria, J.
    Fadhel, Mohammed A.
    Al-Amidie, Muthana
    Farhan, Laith
    [J]. JOURNAL OF BIG DATA, 2021, 8 (01)
  • [3] [Anonymous], 2021, RStudio:_Integrated_Development_Environment_for_R
  • [4] Physico-chemical characteristics and stability aspects of coconut water and kernel at different stages of maturity
    Appaiah, Prakruthi
    Sunil, L.
    Kumar, P. K. Prasanth
    Krishna, A. G. Gopala
    [J]. JOURNAL OF FOOD SCIENCE AND TECHNOLOGY-MYSORE, 2015, 52 (08): : 5196 - 5203
  • [5] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [6] Brigato L, 2020, Arxiv, DOI [arXiv:2003.12843, DOI 10.48550/ARXIV.2003.12843]
  • [7] Critical Review of Analytical and Bioanalytical Verification of the Authenticity of Coffee
    Burns, Duncan Thorburn
    Walker, Michael J.
    [J]. JOURNAL OF AOAC INTERNATIONAL, 2020, 103 (02) : 283 - 294
  • [8] Sparse PLS discriminant analysis: biologically relevant feature selection and graphical displays for multiclass problems
    Cao, Kim-Anh Le
    Boitard, Simon
    Besse, Philippe
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [9] A comprehensive survey on support vector machine classification: Applications, challenges and trends
    Cervantes, Jair
    Garcia-Lamont, Farid
    Rodriguez-Mazahua, Lisbeth
    Lopez, Asdrubal
    [J]. NEUROCOMPUTING, 2020, 408 : 189 - 215
  • [10] Chen XY, 2019, ANAL METHODS-UK, V11, P5118, DOI [10.1039/c9ay01531k, 10.1039/C9AY01531K]