Model averaging in calibration of near-infrared instruments with correlated high-dimensional data

被引:1
作者
Salaki, Deiby Tineke [1 ]
Kurnia, Anang [2 ]
Sartono, Bagus [2 ]
Mangku, I. Wayan [3 ]
Gusnanto, Arief [4 ]
机构
[1] Sam Ratulangi Univ, Dept Math, Manado, Indonesia
[2] Bogor Agr Univ, Dept Stat, Bogor, Indonesia
[3] Bogor Agr Univ, Dept Math, Bogor, Indonesia
[4] Univ Leeds, Dept Stat, Leeds LS2 9JT, W Yorkshire, England
关键词
Model averaging; high-dimensional data; multicollinearity; calibration; near-infrared spectroscopy; VARIABLE SELECTION; RIDGE-REGRESSION; LASSO;
D O I
10.1080/02664763.2022.2122947
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Model averaging (MA) is a modelling strategy where the uncertainty in the configuration of selected variables is taken into account by weight-combining each estimate of the so-called 'candidate model'. Some studies have shown that MA enables better prediction, even in high-dimensional cases. However, little is known about the model prediction performance at different types of multicollinearity in high-dimensional data. Motivated by calibration of near-infrared (NIR) instruments,we focus on MA prediction performance in such data. The weighting schemes that we consider are based on the Akaike's information criterion (AIC), Mallows' C-p, and cross-validation. For estimating the model parameters, we consider the standard least squares and the ridge regression methods. The results indicate that MA outperforms model selection methods such as LASSO and SCAD in high-correlation data. The use of Mallows' C-p and cross-validation for the weights tends to yield similar results in all structures of correlation, although the former is generally preferred. We also find that the ridge model averaging outperforms the least-squares model averaging. This research suggests ridge model averaging to build a relatively better prediction of the NIR calibration model.
引用
收藏
页码:279 / 297
页数:19
相关论文
共 50 条
  • [21] A WEIGHT-RELAXED MODEL AVERAGING APPROACH FOR HIGH-DIMENSIONAL GENERALIZED LINEAR MODELS
    Ando, Tomohiro
    Li, Ker-Chau
    ANNALS OF STATISTICS, 2017, 45 (06) : 2654 - 2679
  • [22] Adaptive and reversed penalty for analysis of high-dimensional correlated data
    Yang, Yuehan
    Yang, Hu
    APPLIED MATHEMATICAL MODELLING, 2021, 92 : 63 - 77
  • [23] A near-infrared calibration method suitable for quantification of broadband data in humans
    Zhang, Qiong
    Srinivasan, Sathyanarayanan
    Wu, Ying
    Natah, Siraj
    Dunn, Jeff F.
    JOURNAL OF NEUROSCIENCE METHODS, 2010, 188 (02) : 181 - 186
  • [24] Estimation of semiparametric regression model with right-censored high-dimensional data
    Aydin, Dursun
    Ahmed, S. Ejaz
    Yilmaz, Ersin
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2019, 89 (06) : 985 - 1004
  • [25] Standardisation of near-infrared spectrometric instruments: A review
    Bouveresse, E
    Massart, DL
    VIBRATIONAL SPECTROSCOPY, 1996, 11 (01) : 3 - 15
  • [26] Survey of the development of near-infrared spectroscopy instruments
    Qi Xiao
    Han Jian-Guo
    Li Man-Li
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2007, 27 (10) : 2022 - 2026
  • [27] Model averaging estimation for high-dimensional covariance matrices with a network structure
    Zhu, Rong
    Zhang, Xinyu
    Ma, Yanyuan
    Zou, Guohua
    ECONOMETRICS JOURNAL, 2021, 24 (01) : 177 - 197
  • [28] Asymptotic efficiency of the calibration estimator in a high-dimensional data setting
    Chauvet, Guillaume
    Goga, Camelia
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2022, 217 : 177 - 187
  • [29] Modelling Interactions in High-dimensional Data with Backtracking
    Shah, Rajen D.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17 : 1 - 31
  • [30] Optimization of Wheat Protein Near-Infrared Calibration Model Based on SPXY
    Mao, Xiaodong
    Sun, Laijun
    Hao, Gang
    Xu, Lulu
    Hui, Guangyan
    FRONTIERS OF CHEMICAL ENGINEERING, METALLURGICAL ENGINEERING AND MATERIALS II, 2013, 803 : 122 - +