Chemometric pre-processing can negatively affect the performance of near-infrared spectroscopy models for fruit quality prediction

被引:69
作者
Mishra, Puneet [1 ]
Rutledge, Douglas N. [2 ,3 ]
Roger, Jean-Michel [4 ,5 ]
Wali, Khan [6 ]
Khan, Haris Ahmad [6 ]
机构
[1] Wageningen Food & Biobased Res, Bornse Weilanden 9,POB 17, NL-6700 AA Wageningen, Netherlands
[2] Univ Paris Saclay, UMR SayFood, AgroParisTech, INRAE, F-75005 Paris, France
[3] Charles Sturt Univ, Natl Wine & Grape Ind Ctr, Wagga Wagga, NSW, Australia
[4] Univ Montpellier, Inst Agro, INRAE, ITAP, Montpellier, France
[5] ChemHouse Res Grp, Montpellier, France
[6] Wageningen Univ & Res, Farm Technol Grp, Wageningen, Netherlands
关键词
Artificial intelligence; Neural network; Fruit quality;
D O I
10.1016/j.talanta.2021.122303
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Chemometrics pre-processing of spectral data is widely performed to enhance the predictive performance of near-infrared (NIR) models related to fresh fruit quality. Pre-processing approaches in the domain of NIR data analysis are used to remove the scattering effects, thus, enhancing the absorption components related to the chemical properties. However, in the case of fresh fruit, both the scattering and absorption properties are of key interest as they jointly explain the physicochemical state of a fruit. Therefore, pre-processing data that reduces the scattering information in the spectra may lead to poorly performing models. The objectives of this study are to test two hypotheses to explore the effect of pre-processing on NIR spectra of fresh fruit. The first hypothesis is that the pre-processing of NIR spectra with scatter correction techniques can reduce the predictive performance of models as the scatter correction can reduce the useful scattering information correlated to the property of interest. The second hypothesis is that the Deep Learning (DL) can model the raw absorbance data (mix of scattering and absorption) much more efficiently than the Partial Least Squares (PLS) regression analysis. To test the hypotheses, a real NIR data set related to dry matter (DM) prediction in mango fruit was used. The dataset consisted of a total of 11,420 NIR spectra and reference DM measurements for model training and independent testing. The chemometric pre-processing methods explored were standard normal variate (SNV), variable sorting for normalization (VSN), Savitzky-Golay based 2nd derivative and their combinations. Further two modelling approaches i.e., PLS regression and DL were used to evaluate the effect of pre-processing. The results showed that the best root mean squared error of prediction (RMSEP) for both the PLS and DL models were obtained with the raw absorbance data. The spectral pre-processing in general decreased the performance of both the PLS and DL models. Further, the DL model attained the lowest RMSEP of 0.76%, which was 13% lower compared to the PLS regression on the raw absorbance data. Pre-processing approaches should be carefully used while analysing the NIR data related to fresh fruit.
引用
收藏
页数:7
相关论文
共 23 条
  • [1] Achieving robustness across season, location and cultivar for a NIRS model for intact mango fruit dry matter content
    Anderson, N. T.
    Walsh, K. B.
    Subedi, P. P.
    Hayes, C. H.
    [J]. POSTHARVEST BIOLOGY AND TECHNOLOGY, 2020, 168
  • [2] STANDARD NORMAL VARIATE TRANSFORMATION AND DE-TRENDING OF NEAR-INFRARED DIFFUSE REFLECTANCE SPECTRA
    BARNES, RJ
    DHANOA, MS
    LISTER, SJ
    [J]. APPLIED SPECTROSCOPY, 1989, 43 (05) : 772 - 777
  • [3] Towards calibration-invariant spectroscopy using deep learning
    Chatzidakis, M.
    Botton, G. A.
    [J]. SCIENTIFIC REPORTS, 2019, 9 (1)
  • [4] Clemmensen L, 2019, ARXIV PREPRINT ARXIV
  • [5] Cramer Richard D. Iii, 1993, Perspectives in Drug Discovery and Design, V1, P269, DOI 10.1007/BF02174528
  • [6] Modern practical convolutional neural networks for multivariate regression: Applications to NIR calibration
    Cui, Chenhao
    Fearn, Tom
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2018, 182 : 9 - 20
  • [7] Breaking with trends in pre-processing?
    Engel, Jasper
    Gerretzen, Jan
    Szymanska, Ewa
    Jansen, Jeroen J.
    Downey, Gerard
    Blanchet, Lionel
    Buydens, Lutgarde M. C.
    [J]. TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2013, 50 : 96 - 106
  • [8] Measurement of optical properties of fruits and vegetables: A review
    Lu, Renfu
    Van Beers, Robbe
    Saeys, Wouter
    Li, Changying
    Cen, Haiyan
    [J]. POSTHARVEST BIOLOGY AND TECHNOLOGY, 2020, 159
  • [9] Mishra P., 2020, CHEMOMET INTELL LAB
  • [10] Improved prediction of tablet properties with near-infrared spectroscopy by a fusion of scatter correction techniques
    Mishra, Puneet
    Nordon, Alison
    Roger, Jean-Michel
    [J]. JOURNAL OF PHARMACEUTICAL AND BIOMEDICAL ANALYSIS, 2021, 192