Integrating near-infrared hyperspectral imaging with machine learning and feature selection: Detecting adulteration of extra-virgin olive oil with lower-grade olive oils and hazelnut oil

被引:2
作者
Malavi, Derick [1 ,2 ]
Raes, Katleen [1 ]
van Haute, Sam [1 ,2 ]
机构
[1] Univ Ghent, Fac Biosci Engn, Dept Food Technol Safety & Hlth, Coupure Links 653, B-9000 Ghent, Belgium
[2] Univ Ghent, Ctr Food Biotechnol & Microbiol, Global Campus,119 Songdomunhwa Ro, Incheon 21985, South Korea
来源
CURRENT RESEARCH IN FOOD SCIENCE | 2024年 / 9卷
关键词
Machine learning; Variable selection; Extra-virgin olive oil (EVOO); Adulteration; Authentication; Classification models; LEAST-SQUARES REGRESSION; VEGETABLE-OILS; FOOD AUTHENTICITY; MASS-SPECTROMETRY; SPECTROSCOPY; IDENTIFICATION; QUALITY; FTIR; NIR; CLASSIFICATION;
D O I
10.1016/j.crfs.2024.100913
中图分类号
TS2 [食品工业];
学科分类号
0832 ;
摘要
Detecting adulteration in extra virgin olive oil (EVOO) is particularly challenging with oils of similar chemical composition. This study applies near-infrared hyperspectral imaging (NIR-HSI) and machine learning (ML) to detect EVOO adulteration with hazelnut, refined olive, and olive pomace oils at various concentrations (1%, 5%, 10%, 20%, 40%, and 100% m/m). Savitzky-Golay filtering, first and second derivatives, multiplicative scatter correction (MSC), standard normal variate (SNV), and their combinations were used to preprocess the spectral data, with Principal Component Analysis (PCA) reducing dimensionality. Classification was performed using Partial Least Squares-Discriminant Analysis (PLS-DA) and ML algorithms, including k-Nearest Neighbors (k-NN), Na & iuml;ve Bayes, Random Forest (RF), Support Vector Machine (SVM), and Artificial Neural Networks (ANN). PLSDA, k-NN, RF, SVM, NB, and ANN models achieved accuracy rates of 97.0-99.0%, 96.2-100%, 96.5-100%, 98.6-99.5%, 93.9-99.7%, and 99.2-100%, respectively, in discriminating between pure EVOO, adulterants, and adulterated oils. PLS-DA, RF, SVM, and ANN significantly outperformed Na & iuml;ve Bayes (p < 0.05) in binary classification, with Matthews correlation coefficient (MCC) values exceeding 0.90. All the binary classifiers except Na & iuml;ve Bayes, when coupled with SNV/MSC, Savitzky-Golay smoothing and derivatives, consistently achieved perfect scores (1.0) for accuracy, sensitivity, specificity, F1 score, precision, and MCC in distinguishing pure EVOO from adulterated oils. No significant differences (p > 0.05) in model performance were found between those using full spectra and those based on key variable selection. However, PLS-DA and ANN significantly outperformed k-NN, RF, and SVM (p < 0.05), with MCC values ranging from 0.95 to 1.00, indicating superior classification performance. These findings demonstrate that combining NIR-HSI with machine learning, along with key variable selection, potentially offers an effective, non-destructive solution for detecting adulteration in EVOO and combating fraud in the olive oil industry.
引用
收藏
页数:23
相关论文
共 98 条
[1]   Application of random forests to select premium quality vegetable oils by their fatty acid composition [J].
Ai, Fang-fang ;
Bin, Jun ;
Zhang, Zhi-min ;
Huang, Jian-hua ;
Wang, Jian-bing ;
Liang, Yi-zeng ;
Yu, Ling ;
Yang, Zhen-yu .
FOOD CHEMISTRY, 2014, 143 :472-478
[2]  
[Anonymous], 2023, Food safety news
[3]   Authentication of vegetable oils by chromatographic techniques [J].
Aparicio, R ;
Aparicio-Ruíz, R .
JOURNAL OF CHROMATOGRAPHY A, 2000, 881 (1-2) :93-104
[4]   Hyperspectral identification of oil adulteration using machine learning techniques [J].
Aqeel, Muhammad ;
Sohaib, Ahmad ;
Iqbal, Muhammad ;
Rehman, Hafeez Ur ;
Rustam, Furqan .
CURRENT RESEARCH IN FOOD SCIENCE, 2024, 8
[5]   Olive oil adulterated with hazelnut oils: simulation to identify possible risks to allergic consumers [J].
Arlorio, M. ;
Coisson, J. D. ;
Bordiga, M. ;
Travaglia, F. ;
Garino, C. ;
Zuidmeer, L. ;
Van Ree, R. ;
Giuffrida, M. G. ;
Conti, A. ;
Martelli, A. .
FOOD ADDITIVES AND CONTAMINANTS PART A-CHEMISTRY ANALYSIS CONTROL EXPOSURE & RISK ASSESSMENT, 2010, 27 (01) :11-18
[6]   Neural networks applied to characterize blends containing refined and extra virgin olive oils [J].
Aroca-Santos, Regina ;
Cancilla, John C. ;
Pariente, Enrique S. ;
Torrecilla, Jose S. .
TALANTA, 2016, 161 :304-308
[7]   Review of the use of phytosterols as a detection tool for adulteration of olive oil with hazelnut oil [J].
Azadmard-Damirchi, Sodeif .
FOOD ADDITIVES AND CONTAMINANTS PART A-CHEMISTRY ANALYSIS CONTROL EXPOSURE & RISK ASSESSMENT, 2010, 27 (01) :1-10
[8]   The Use of Decision Trees and Naive Bayes Algorithms and Trace Element Patterns for Controlling the Authenticity of Free-Range-Pastured Hens' Eggs [J].
Barbosa, Rommel Melgaco ;
Nacano, Leticia Ramos ;
Freitas, Rodolfo ;
Batista, Bruno Lemos ;
Barbosa, Fernando, Jr. .
JOURNAL OF FOOD SCIENCE, 2014, 79 (09) :C1672-C1677
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   MALDI-TOF mass spectrometry detection of extra-virgin olive oil adulteration with hazelnut oil by analysis of phospholipids using an ionic liquid as matrix and extraction solvent [J].
Calvano, Cosima D. ;
De Ceglie, Cristina ;
D'Accolti, Lucia ;
Zambonin, Carlo G. .
FOOD CHEMISTRY, 2012, 134 (02) :1192-1198