Fence methods for mixed model selection

被引:65
作者
Jiang, Jiming [1 ]
Rao, J. Sunil [3 ]
Gu, Zhonghua [2 ]
Nguyen, Thuan
机构
[1] Univ Calif Davis, Dept Stat, Davis, CA 95616 USA
[2] Alza Corp, Mountain View, CA 94039 USA
[3] Case Western Reserve Univ, Dept Epidemiol & Biostat, Cleveland, OH 44106 USA
基金
美国国家科学基金会;
关键词
adaptive fence; consistency; F-B fence; finite sample performance; GLMM; linear mixed model; model selection;
D O I
10.1214/07-AOS517
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Many model search strategies involve trading off model fit with model complexity in a penalized goodness of fit measure. Asymptotic properties for these types of procedures in settings like linear regression and ARMA time series have been studied, but these do not naturally extend to nonstandard situations such as mixed effects models, where simple definition of the sample size is not meaningful. This paper introduces a new class of strategies, known as fence methods, for mixed model selection, which includes linear and generalized linear mixed models. The idea involves a procedure to isolate a subgroup of what are known as correct models (of which the optimal model is a member). This is accomplished by constructing a statistical fence, or barrier, to carefully eliminate incorrect models. Once the fence is constructed, the optimal model is selected from among those within the fence according to a criterion which can be made flexible. In addition, we propose two variations of the fence. The first is a stepwise procedure to handle situations of many predictors; the second is an adaptive approach for choosing a tuning constant. We give sufficient conditions for consistency of fence and its variations, a desirable property for a good model selection procedure. The methods are illustrated through simulation studies and real data analysis.
引用
收藏
页码:1669 / 1692
页数:24
相关论文
共 29 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]  
Akaike H., 1973, 2 INT S INFORM THEOR, P267, DOI [DOI 10.1007/978-1-4612-1694-0_15, 10.1007/978-1-4612-1694-0_15]
[3]   Multipoint quantitative-trait linkage analysis in general pedigrees [J].
Almasy, L ;
Blangero, J .
AMERICAN JOURNAL OF HUMAN GENETICS, 1998, 62 (05) :1198-1211
[4]  
[Anonymous], 2001, MODEL SELECTION
[5]   AN ERROR-COMPONENTS MODEL FOR PREDICTION OF COUNTY CROP AREAS USING SURVEY AND SATELLITE DATA [J].
BATTESE, GE ;
HARTER, RM ;
FULLER, WA .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1988, 83 (401) :28-36
[6]  
BOZDOGAN H, 1994, P 1 US JAP C FRONT S, V3, pR9
[7]  
DATTA GS, 2001, MODEL SELECTION, P208
[8]  
deLeeuw J., 1992, Breakthroughs in Statistics: Foundations and Basic Theory, P599, DOI DOI 10.1007/978-1-4612-0919-5_37
[9]  
Fabrizi E., 2004, NEW APPROXIMATION BA
[10]   ESTIMATES OF INCOME FOR SMALL PLACES - APPLICATION OF JAMES-STEIN PROCEDURES TO CENSUS-DATA [J].
FAY, RE ;
HERRIOT, RA .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (366) :269-277