A tree-based varying coefficient model

被引:0
作者
Zakrisson, Henning [1 ]
Lindholm, Mathias [1 ]
机构
[1] Stockholm Univ, Dept Math, Stockholm, Sweden
关键词
Generalised linear models; Multivariate gradient boosting; Feature selection; Interaction effects; Early stopping; REGRESSION;
D O I
10.1007/s00180-025-01603-8
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The paper introduces a tree-based varying coefficient model (VCM) where the varying coefficients are modelled using the cyclic gradient boosting machine (CGBM) from Delong et al. (On cyclic gradient boosting machines, 2023). Modelling the coefficient functions using a CGBM allows for dimension-wise early stopping and feature importance scores. The dimension-wise early stopping not only reduces the risk of dimension-specific overfitting, but also reveals differences in model complexity across dimensions. The use of feature importance scores allows for simple feature selection and easy model interpretation. The model is evaluated on the same simulated and real data examples as those used in Richman and W & uuml;thrich (Scand Actuar J 2023:71-95, 2023), and the results show that it produces results in terms of out of sample loss that are comparable to those of their neural network-based VCM called LocalGLMnet.
引用
收藏
页数:30
相关论文
共 31 条
[1]  
Al-Shedivat M, 2020, J MACH LEARN RES, V21
[2]  
Alvarez-Melis D, 2018, ADV NEUR IN, V31
[3]  
[Anonymous], 2011, The elements of statistical learning: data mining, inference, and prediction, DOI 10.1007/978-0-387-84858-7
[4]   Tree-structured modelling of varying coefficients [J].
Berger, Moritz ;
Tutz, Gerhard ;
Schmid, Matthias .
STATISTICS AND COMPUTING, 2019, 29 (02) :217-229
[5]  
Breiman L., 1984, Classification and Regression Trees
[6]   Coefficient-Wise Tree-Based Varying Coefficient Regression with vcrpart [J].
Buergin, Reto ;
Ritschard, Gilbert .
JOURNAL OF STATISTICAL SOFTWARE, 2017, 80 (06) :1-33
[7]   Tree-based varying coefficient regression for longitudinal ordinal responses [J].
Buergin, Reto ;
Ritschard, Gilbert .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2015, 86 :65-80
[8]  
Buhlmann PL, 2002, Research report/Seminar fur Statistik, Eidgenossische Technische Hochschule (ETH), V109
[9]   A Tree-Based Semi-Varying Coefficient Model for the COM-Poisson Distribution [J].
Chatla, Suneel Babu ;
Shmueli, Galit .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2020, 29 (04) :827-846
[10]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794