Unbiased regression trees for longitudinal and clustered data

被引:54
作者
Fu, Wei [1 ]
Simonoff, Jeffrey S. [1 ]
机构
[1] NYU, Leonard N Stern Sch Business, New York, NY 10003 USA
关键词
Clustered data; Longitudinal data; Mixed models; Regression tree; MODELS;
D O I
10.1016/j.csda.2015.02.004
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A new version of the RE-EM regression tree method for longitudinal and clustered data is presented. The RE-EM tree is a methodology that combines the structure of mixed effects models for longitudinal and clustered data with the flexibility of tree-based estimation methods. The RE-EM tree is less sensitive to parametric assumptions and provides improved predictive power compared to linear models with random effects and regression trees without random effects. The previously-suggested methodology used the CART tree algorithm for tree building, and therefore that RE-EM regression tree method inherits the tendency of CART to split on variables with more possible split points at the expense of those with fewer split points. A revised version of the RE-EM regression tree corrects for this bias by using the conditional inference tree as the underlying tree algorithm instead of CART. Simulation studies show that the new version is indeed unbiased, and has several improvements over the original RE-EM regression tree in terms of prediction accuracy and the ability to recover the correct tree structure. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:53 / 74
页数:22
相关论文
共 18 条
[1]  
Breiman L., 1984, CLASSIFICATION REGRE
[2]   Multivariate regression trees: a new technique for modeling species-environment relationships [J].
De'Ath, G .
ECOLOGY, 2002, 83 (04) :1105-1117
[3]   The fatality effects of highway speed limits by gender and age [J].
Dee, TS ;
Sela, RJ .
ECONOMICS LETTERS, 2003, 79 (03) :401-408
[4]   Tree-Structured Mixed-Effects Regression Modeling for Longitudinal Data [J].
Eo, Soo-Heang ;
Cho, HyungJun .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2014, 23 (03) :740-760
[5]   Mixed effects regression trees for clustered data [J].
Hajjem, Ahlem ;
Bellavance, Francois ;
Larocque, Denis .
STATISTICS & PROBABILITY LETTERS, 2011, 81 (04) :451-459
[6]  
Hothorn T., 2014, Working Papers in Economics and Statistics
[7]   Unbiased recursive partitioning: A conditional inference framework [J].
Hothorn, Torsten ;
Hornik, Kurt ;
Zeileis, Achim .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2006, 15 (03) :651-674
[8]   RANDOM-EFFECTS MODELS FOR LONGITUDINAL DATA [J].
LAIRD, NM ;
WARE, JH .
BIOMETRICS, 1982, 38 (04) :963-974
[9]   REGRESSION TREES FOR LONGITUDINAL AND MULTIRESPONSE DATA [J].
Loh, Wei-Yin ;
Zheng, Wei .
ANNALS OF APPLIED STATISTICS, 2013, 7 (01) :495-522
[10]  
Loh WY, 2002, STAT SINICA, V12, P361