Generalized additive modeling with implicit variable selection by likelihood-based boosting

被引:138
作者
Tutz, Gerhard
Binder, Harald
机构
[1] Univ Munich, Inst Stat, D-80799 Munich, Germany
[2] Univ Regensburg, Klin Psychiat & Psychotherapie, D-93053 Regensburg, Germany
关键词
generalized additive models; likelihood-based boosting; penalized stumps; selection of smoothing parameters; variable selection;
D O I
10.1111/j.1541-0420.2006.00578.x
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The use of generalized additive models in statistical data analysis suffers from the restriction to few explanatory variables and the problems of selection of smoothing parameters. Generalized additive model boosting circumvents these problems by means of stagewise fitting of weak learners. A fitting procedure is derived which works for all simple exponential family distributions, including binomial, Poisson, and normal response variables. The procedure combines the selection of variables and the determination of the appropriate amount of smoothing. Penalized regression splines and the newly introduced penalized stumps are considered as weak learners. Estimates of standard deviations and stopping criteria, which are notorious problems in iterative procedures, are based on an approximate hat matrix. The method is shown to be a strong competitor to common procedures for the fitting of generalized additive models. In particular, in high-dimensional settings with many nuisance predictor variables it performs very well.
引用
收藏
页码:961 / 971
页数:11
相关论文
共 23 条
[1]   Prediction games and arcing algorithms [J].
Breiman, L .
NEURAL COMPUTATION, 1999, 11 (07) :1493-1517
[2]  
Breiman L., 1998, CLASSIFICATION REGRE
[3]   Boosting with the L2 loss:: Regression and classification [J].
Bühlmann, P ;
Yu, B .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2003, 98 (462) :324-339
[4]  
Chambers J.M., 1991, Statistical Models in S
[5]   Boosting for tumor classification with gene expression data [J].
Dettling, M ;
Bühlmann, P .
BIOINFORMATICS, 2003, 19 (09) :1061-1069
[6]   Flexible smoothing with B-splines and penalties [J].
Eilers, PHC ;
Marx, BD .
STATISTICAL SCIENCE, 1996, 11 (02) :89-102
[7]   Bayesian inference for generalized additive mixed models based on Markov random field priors [J].
Fahrmeir, L ;
Lang, S .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2001, 50 :201-220
[8]  
Freund Y, 1996, ICML
[9]   Additive logistic regression: A statistical view of boosting - Rejoinder [J].
Friedman, J ;
Hastie, T ;
Tibshirani, R .
ANNALS OF STATISTICS, 2000, 28 (02) :400-407
[10]  
Friedman J., 2001, The elements of statistical learning, V1, DOI DOI 10.1007/978-0-387-21606-5