Using spatial information for evaluating the quality of prediction maps from hyperspectral images: A geostatistical approach

被引:7
作者
Herrero-Langreo, Ana [1 ]
Gorretta, Nathalie [2 ]
Tisseyre, Bruno [2 ]
Gowen, Aoife [1 ]
Xu, Jun-Li [1 ]
Chaix, Gilles [3 ,4 ]
Roger, Jean-Michel [2 ]
机构
[1] Univ Coll Dublin, UCD Sch Biosyst & Food Engn, Dublin 4, Ireland
[2] Univ Montpellier, Montpellier SupAgro, Irstea, ITAP, Montpellier, France
[3] UMR AGAP, CIRAD, Montpellier, France
[4] Univ Sao Paulo, Dept Ciencias Florestais, ESALQ, Sao Paulo, Brazil
基金
欧洲研究理事会; 爱尔兰科学基金会; 巴西圣保罗研究基金会;
关键词
Hyperspectral; Geostatistics; Image; Spatial; Overfitting; Regression; REGRESSION; VARIABILITY; DESIGN;
D O I
10.1016/j.aca.2019.05.067
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Applying a calibration model onto hyperspectral (HS) images is of great interest because it produces images of chemical or physical properties. HS imaging is widely used in this way in food processing industries for monitoring product quality and process control. In this context, one of the main difficulties in the application of regression models to HS images is to evaluate the error of the obtained predictions, since in a proximal imaging set up, the size of the pixels is usually much smaller than the area required to obtain a wet chemical reference. Moreover, the selection of regression model parameters, such as the number of latent variables (LV) in a partial least squares (PLS) model, can modify the appearance of the prediction maps. The objective of this work is to propose an approach based on geostatistical indices to use spatial information of prediction maps for supporting the evaluation of regression models applied to HS images. This work stablishes a theoretical connection between linear regression model performance estimates and the spatial decomposition of variance in prediction maps, when the ground truth to estimate is spatially structured. This approach was tested in a simulated dataset and two real case studies. Geostatistical indices of the prediction maps were compared to model performance metrics for PLS models with increasing number of LV. The theoretical framework was proven by the results on the simulated dataset. In particular, the evolution of the nugget effect, C0; corresponded with the evolution of the random error of the model. Conversely, the error term of the model related with the slope of the model corresponded with the evolution of the structured variance observed in the prediction maps. On the real case studies, geostatistical indexes, extracted from the prediction maps, allowed to quantitatively evaluate the spatial structure of the estimations and complement the Root Mean Standard Error of Cross Validation (RMSECV) for the choice of optimal number of LV to consider in the model. The main advantage of this approach is that it does not require ground truth values. It could be used as a source of information for supporting the choice of optimum calibration parameters, such as the number of latent variables, or the choice of pre-treatments, complementing the traditional visual inspection of prediction maps with quantitative and objective metrics. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:116 / 128
页数:13
相关论文
共 26 条
  • [1] Amigo JM, 2013, DATA HANDL SCI TECHN, V28, P343, DOI 10.1016/B978-0-444-59528-7.00009-0
  • [2] Influence of potassium and sodium nutrition on leaf area components in Eucalyptus grandis trees
    Battie-Laclau, Patricia
    Laclau, Jean-Paul
    Piccolo, Marisa de Cassia
    Arenque, Bruna Cersozimo
    Beri, Constance
    Mietton, Lauriane
    Almeida Muniz, Marta R.
    Jordan-Meille, Lionel
    Buckeridge, Marcos Silveira
    Nouvellon, Yann
    Ranger, Jacques
    Bouillet, Jean-Pierre
    [J]. PLANT AND SOIL, 2013, 371 (1-2) : 19 - 35
  • [3] GEOSTATISTICALLY MODELING STEM SIZE AND INCREMENT IN AN OLD-GROWTH FOREST
    BIONDI, F
    MYERS, DE
    AVERY, CC
    [J]. CANADIAN JOURNAL OF FOREST RESEARCH, 1994, 24 (07) : 1354 - 1368
  • [4] Random sampling or geostatistical modelling? Choosing between design-based and model-based sampling strategies for soil (with discussion)
    Brus, DJ
    deGruijter, JJ
    [J]. GEODERMA, 1997, 80 (1-2) : 1 - 44
  • [5] FIELD-SCALE VARIABILITY OF SOIL PROPERTIES IN CENTRAL IOWA SOILS
    CAMBARDELLA, CA
    MOORMAN, TB
    NOVAK, JM
    PARKIN, TB
    KARLEN, DL
    TURCO, RF
    KONOPKA, AE
    [J]. SOIL SCIENCE SOCIETY OF AMERICA JOURNAL, 1994, 58 (05) : 1501 - 1511
  • [6] Representative subset selection
    Daszykowski, M
    Walczak, B
    Massart, DL
    [J]. ANALYTICA CHIMICA ACTA, 2002, 468 (01) : 91 - 103
  • [7] Standardisation and calibration transfer for near infrared instruments: a review
    Fearn, T
    [J]. JOURNAL OF NEAR INFRARED SPECTROSCOPY, 2001, 9 (04) : 229 - 244
  • [8] Geostatistics in soil science: state-of-the-art and perspectives
    Goovaerts, P
    [J]. GEODERMA, 1999, 89 (1-2) : 1 - 45
  • [9] Gowen A., 2019, J SPECTR IMAGING, V8, pa4, DOI [10.1255/jsi.2019.a4, DOI 10.1255/JSI.2019.A4]
  • [10] Recent applications of Chemical Imaging to pharmaceutical process monitoring and quality control
    Gowen, A. A.
    O'Donnell, C. P.
    Cullen, P. J.
    Bell, S. E. J.
    [J]. EUROPEAN JOURNAL OF PHARMACEUTICS AND BIOPHARMACEUTICS, 2008, 69 (01) : 10 - 22