Gradient boosting for linear mixed models

被引:13
作者
Griesbach, Colin [1 ]
Saefken, Benjamin [2 ]
Waldmann, Elisabeth [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Dept Med Informat Biometry & Epidemiol, Erlangen, Germany
[2] Georg August Univ Gottingen, Chair Stat, Gottingen, Germany
关键词
gradient boosting; mixed models; regularised regression; statistical learning; VARIABLE SELECTION; REGRESSION; REGULARIZATION; PREDICTION; ALGORITHMS;
D O I
10.1515/ijb-2020-0136
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gradient boosting from the field of statistical learning is widely known as a powerful framework for estimation and selection of predictor effects in various regression models by adapting concepts from classification theory. Current boosting approaches also offer methods accounting for random effects and thus enable prediction of mixed models for longitudinal and clustered data. However, these approaches include several flaws resulting in unbalanced effect selection with falsely induced shrinkage and a low convergence rate on the one hand and biased estimates of the random effects on the other hand. We therefore propose a new boosting algorithm which explicitly accounts for the random structure by excluding it from the selection procedure, properly correcting the random effects estimates and in addition providing likelihood-based estimation of the random effects variance structure. The new algorithm offers an organic and unbiased fitting approach, which is shown via simulations and data examples.
引用
收藏
页码:317 / 329
页数:13
相关论文
共 41 条
  • [31] Estimation for High-Dimensional Linear Mixed-Effects Models Using l1-Penalization
    Schelldorfer, Juerg
    Buehlmann, Peter
    Van De Geer, Sara
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2011, 38 (02) : 197 - 214
  • [32] Geoadditive regression modeling of stream biological condition
    Schmid, Matthias
    Hothorn, Torsten
    Maloney, Kelly O.
    Weller, Donald E.
    Potapov, Sergej
    [J]. ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 2011, 18 (04) : 709 - 733
  • [33] Flexible boosting of accelerated failure time models
    Schmid, Matthias
    Hothorn, Torsten
    [J]. BMC BIOINFORMATICS, 2008, 9 (1)
  • [35] A boosting approach to flexible semiparametric mixed models
    Tutz, G.
    Reithinger, F.
    [J]. STATISTICS IN MEDICINE, 2007, 26 (14) : 2872 - 2900
  • [36] Generalized additive modeling with implicit variable selection by likelihood-based boosting
    Tutz, Gerhard
    Binder, Harald
    [J]. BIOMETRICS, 2006, 62 (04) : 961 - 971
  • [37] Tutz G, 2010, STATISTICAL MODELLING AND REGRESSION STRUCTURES, P197, DOI 10.1007/978-3-7908-2413-1_11
  • [38] Conditional Akaike information for mixed-effects models
    Vaida, F
    Blanchard, S
    [J]. BIOMETRIKA, 2005, 92 (02) : 351 - 370
  • [40] Boosting joint models for longitudinal and time-to-event data
    Waldmann, Elisabeth
    Taylor-Robinson, David
    Klein, Nadja
    Kneib, Thomas
    Pressler, Tania
    Schmid, Matthias
    Mayr, Andreas
    [J]. BIOMETRICAL JOURNAL, 2017, 59 (06) : 1104 - 1121