Bayesian high-dimensional covariate selection in non-linear mixed-effects models using the SAEM algorithm

被引:0
作者
Marion Naveau
Guillaume Kon Kam King
Renaud Rincent
Laure Sansonnet
Maud Delattre
机构
[1] UMR MIA Paris-Saclay,Université Paris
[2] MaIAGE,Saclay, AgroParisTech, INRAE
[3] GQE - Le Moulon,Université Paris
来源
Statistics and Computing | 2024年 / 34卷
关键词
High-dimension; Non-linear mixed-effects models; SAEM algorithm; Spike-and-slab prior; Variable selection;
D O I
暂无
中图分类号
学科分类号
摘要
High-dimensional variable selection, with many more covariates than observations, is widely documented in standard regression models, but there are still few tools to address it in non-linear mixed-effects models where data are collected repeatedly on several individuals. In this work, variable selection is approached from a Bayesian perspective and a selection procedure is proposed, combining the use of a spike-and-slab prior and the Stochastic Approximation version of the Expectation Maximisation (SAEM) algorithm. Similarly to Lasso regression, the set of relevant covariates is selected by exploring a grid of values for the penalisation parameter. The SAEM approach is much faster than a classical Markov chain Monte Carlo algorithm and our method shows very good selection performances on simulated data. Its flexibility is demonstrated by implementing it for a variety of nonlinear mixed effects models. The usefulness of the proposed method is illustrated on a problem of genetic markers identification, relevant for genomic-assisted selection in plant breeding.
引用
收藏
相关论文
共 112 条
  • [21] Bodik R(2017)Bayesian variable selection regression of multivariate responses for group data Bayesian Anal. 12 241-846
  • [22] Delattre M(2018)Comparing spike and slab priors for Bayesian variable selection Aust. J. Stat. 40 1023-214
  • [23] Lavielle M(1988)Bayesian variable selection in linear regression J. Am. Stat. Assoc. 83 559-440
  • [24] Poursat MA(2022)Fast selection of nonlinear mixed effect models using penalized likelihood Comput. Stat. Data Anal. 167 3399-464
  • [25] Delyon B(2007)PLINK: a tool set for whole-genome association and population-based linkage analyses Am. J. Hum. Genet. 81 828-639
  • [26] Lavielle M(2019)Using crop growth model stress covariates and AMMI decomposition to better predict genotype-by-environment interactions Theor. Appl. Genet. 132 197-501
  • [27] Moulines E(2014)EMVS: The EM approach to Bayesian variable selection J. Am. Stat. Assoc. 109 419-288
  • [28] Dempster AP(2011)Estimation for high-dimensional linear mixed-effects models using l1-penalization Scand. J. Stat. 38 461-2880
  • [29] Laird NM(1985)A modular system of algorithms for unconstrained minimization ACM Trans. Math. Softw. (TOMS) 11 583-704
  • [30] Rubin DB(1978)Estimating the dimension of a model Ann. Stat. 6 2024-undefined