Data augmentation in food science: Synthesising spectroscopic data of vegetable oils for performance enhancement

被引:21
作者
Georgouli, Konstantia [1 ]
Osorio, Maria Teresa [1 ]
Del Rincon, Jesus Martinez [2 ]
Koidis, Anastasios [1 ]
机构
[1] Queens Univ Belfast, Inst Global Food Secur, Belfast, Antrim, North Ireland
[2] Queens Univ Belfast, Inst Elect Commun & Informat Technol, Belfast, Antrim, North Ireland
关键词
artificial samples; classification; data augmentation; spectroscopy; vegetable oils; PARTIAL-LEAST-SQUARES; ARTIFICIAL NEURAL-NETWORKS; PATTERN-RECOGNITION; CALIBRATION MAINTENANCE; ENSEMBLE METHODS; NIR; STANDARDIZATION; PREDICTION; IDENTIFICATION; ROBUSTNESS;
D O I
10.1002/cem.3004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Generating more accurate, efficient, and robust classification models in chemometrics, able to address real-world problems in food analysis, is intrinsically related with the amount of available calibration samples. In this paper, we propose a data augmentation solution to increase the performance of a classification model by generating realistic data augmented samples. The feasibility of this solution has been evaluated on 3 main different experiments where Fourier transform mid infrared (FT-IR) spectroscopic data of vegetable oils were used for the identification of vegetable oil species in oil admixtures. Results demonstrate that data augmented samples improved the classification rate by around 19% in a single instrument validation and provided a significant 38% improvement in classification when testing in more than 10 different spectroscopic instruments to the calibration one.
引用
收藏
页数:15
相关论文
共 55 条
[1]  
[Anonymous], 1993, PRACTICAL NIR SPECTR
[2]   Partial least squares for discrimination [J].
Barker, M ;
Rayens, W .
JOURNAL OF CHEMOMETRICS, 2003, 17 (03) :166-173
[3]   STANDARD NORMAL VARIATE TRANSFORMATION AND DE-TRENDING OF NEAR-INFRARED DIFFUSE REFLECTANCE SPECTRA [J].
BARNES, RJ ;
DHANOA, MS ;
LISTER, SJ .
APPLIED SPECTROSCOPY, 1989, 43 (05) :772-777
[4]   Supervised pattern recognition in food analysis [J].
Berrueta, Luis A. ;
Alonso-Salces, Rosa M. ;
Heberger, Karoly .
JOURNAL OF CHROMATOGRAPHY A, 2007, 1158 (1-2) :196-214
[5]   Preparing calibration sets for use in pharmaceutical analysis by NIR spectroscopy [J].
Blanco, M. ;
Bautista, M. ;
Alcala, M. .
JOURNAL OF PHARMACEUTICAL SCIENCES, 2008, 97 (03) :1236-1245
[6]   Improvement of the piecewise direct standardisation procedure for the transfer of NIR spectra for multivariate calibration [J].
Bouveresse, E ;
Massart, DL .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1996, 32 (02) :201-213
[7]  
Breiman L., 2001, Machine Learning, V45, P5
[8]   Systematic prediction error correction: A novel strategy for maintaining the predictive abilities of multivariate calibration models [J].
Chen, Zeng-Ping ;
Li, Li-Mei ;
Yu, Ru-Qin ;
Littlejohn, David ;
Nordon, Alison ;
Morris, Julian ;
Dann, Alison S. ;
Jeffkins, Paul A. ;
Richardson, Mark D. ;
Stimpson, Sarah L. .
ANALYST, 2011, 136 (01) :98-106
[9]   Process analytical technologies and real time process control a review of some spectroscopic issues and challenges [J].
Chen, Zengping ;
Lovett, David ;
Morris, Julian .
JOURNAL OF PROCESS CONTROL, 2011, 21 (10) :1467-1482
[10]   Ensemble learning for independent component analysis [J].
Cheng, J ;
Liu, QS ;
Lu, HQ ;
Chen, YW .
PATTERN RECOGNITION, 2006, 39 (01) :81-88