Statistical model choice including variable selection based on variable importance: A relevant way for biomarkers selection to predict meat tenderness

被引:0
|
作者
M. P. Ellies-Oury
M. Chavent
A. Conanec
M. Bonnet
B. Picard
J. Saracco
机构
[1] Université Clermont Auvergne,
[2] INRA,undefined
[3] VetAgro Sup,undefined
[4] UMR Herbivores,undefined
[5] INRIA Bordeaux Sud-Ouest,undefined
[6] CQFD Team,undefined
[7] Université de Bordeaux,undefined
[8] IMB,undefined
[9] UMR 5251,undefined
[10] ENSC - Bordeaux INP,undefined
[11] IMB,undefined
[12] UMR 5251,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we describe a new computational methodology to select the best regression model to predict a numerical variable of interest Y and to select simultaneously the most interesting numerical explanatory variables strongly linked to Y. Three regression models (parametric, semi-parametric and non-parametric) are considered and estimated by multiple linear regression, sliced inverse regression and random forests. Both the variables selection and the model choice are computational. A measure of importance based on random perturbations is calculated for each covariate. The variables above a threshold are selected. Then a learning/test samples approach is used to estimate the Mean Square Error and to determine which model (including variable selection) is the most accurate. The R package modvarsel (MODel and VARiable SELection) implements this computational approach and applies to any regression datasets. After checking the good behavior of the methodology on simulated data, the R package is used to select the proteins predictive of meat tenderness among a pool of 21 candidate proteins assayed in semitendinosus muscle from 71 young bulls. The biomarkers were selected by linear regression (the best regression model) to predict meat tenderness. These biomarkers, we confirm the predominant role of heat shock proteins and metabolic ones.
引用
收藏
相关论文
共 50 条
  • [41] Correlation analysis based relevant variable selection for wind turbine condition monitoring and fault diagnosis
    Han, Huanying
    Yang, Dongsheng
    SUSTAINABLE ENERGY TECHNOLOGIES AND ASSESSMENTS, 2023, 60
  • [42] Process monitoring based on distributed principal component analysis with angle-relevant variable selection
    Xu, Chen
    Liu, Fei
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2019, 15 (06)
  • [43] spikeSlabGAM: Bayesian Variable Selection, Model Choice and Regularization for Generalized Additive Mixed Models in R
    Scheipl, Fabien
    JOURNAL OF STATISTICAL SOFTWARE, 2011, 43 (14):
  • [44] A farewell to the sum of Akaike weights: The benefits of alternative metrics for variable importance estimations in model selection
    Galipaud, Matthias
    Gillingham, Mark A. F.
    Dechaume-Moncharmont, Francois-Xavier
    METHODS IN ECOLOGY AND EVOLUTION, 2017, 8 (12): : 1668 - 1678
  • [45] An efficient variable selection method based on variable permutation and model population analysis for multivariate calibration of NIR spectra
    Bin, Jun
    Ai, Fangfang
    Fan, Wei
    Zhou, Jiheng
    Li, Xin
    Tang, Wenxian
    Liang, Yizeng
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2016, 158 : 1 - 13
  • [46] An efficient variable selection-based Kriging model method for the reliability analysis of slopes with spatially variable soils
    Ding, Jiayi
    Zhou, Jianfang
    Cai, Wei
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2023, 235
  • [47] Variable selection in model-based clustering and discriminant analysis with a regularization approach
    Gilles Celeux
    Cathy Maugis-Rabusseau
    Mohammed Sedki
    Advances in Data Analysis and Classification, 2019, 13 : 259 - 278
  • [48] Variable selection in model-based clustering and discriminant analysis with a regularization approach
    Celeux, Gilles
    Maugis-Rabusseau, Cathy
    Sedki, Mohammed
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2019, 13 (01) : 259 - 278
  • [49] Composite Service Selection Model Based on Two-Dimensional Variable Weight
    Bai Yaxin
    Zhang Hong
    Feng Chao
    Fu Yangzhen
    PROCEEDINGS OF 2016 IEEE 7TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2016), 2016, : 394 - 397
  • [50] A note on path-based variable selection in the penalized proportional hazards model
    Zou, Hui
    BIOMETRIKA, 2008, 95 (01) : 241 - 247