Optimal model averaging estimation for correlation structure in generalized estimating equations

被引:10
作者
Fang, Fang [1 ]
Li, Jialiang [2 ]
Wang, Jingli [3 ]
机构
[1] East China Normal Univ, Sch Stat, Shanghai, Peoples R China
[2] Natl Univ Singapore, Dept Stat & Appl Probabil, Duke NUS Grad Med Sch, Singapore Eye Res Inst, 6 Sci Dr 2, Singapore 117546, Singapore
[3] Natl Univ Singapore, Dept Stat & Appl Probabil, Singapore, Singapore
关键词
Asymptotical optimality; Efficient estimation; Generalized estimating equation; Longitudinal data; Misspecification; Model averaging; Working correlation; LONGITUDINAL DATA; SEMIPARAMETRIC ESTIMATION; COEFFICIENT MODELS; SELECTION; REGRESSION; INFERENCE;
D O I
10.1080/03610918.2017.1419260
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Longitudinal data analysis requires a proper estimation of the within-cluster correlation structure in order to achieve efficient estimates of the regression parameters. When applying likelihood-based methods one may select an optimal correlation structure by the AIC or BIC. However, such information criteria are not applicable for estimating equation based approaches. In this paper we develop a model averaging approach to estimate the correlation matrix by a weighted sum of a group of patterned correlation matrices under the GEE framework. The optimal weight is determined by minimizing the difference between the weighted sum and a consistent yet inefficient estimator of the correlation structure. The computation of our proposed approach only involves a standard quadratic programming on top of the standard GEE procedure and can be easily implemented in practice. We provide theoretical justifications and extensive numerical simulations to support the application of the proposed estimator. A couple of well-known longitudinal data sets are revisited where we implement and illustrate our methodology.
引用
收藏
页码:1574 / 1593
页数:20
相关论文
共 32 条
[1]  
[Anonymous], 1995, Nonlinear models for repeated measurement data
[2]   Model selection: An integral part of inference [J].
Buckland, ST ;
Burnham, KP ;
Augustin, NH .
BIOMETRICS, 1997, 53 (02) :603-618
[3]   EFFICIENT ESTIMATION IN SEMIVARYING COEFFICIENT MODELS FOR LONGITUDINAL/CLUSTERED DATA [J].
Cheng, Ming-Yen ;
Honda, Toshio ;
Li, Jialiang .
ANNALS OF STATISTICS, 2016, 44 (05) :1988-2017
[4]   NONPARAMETRIC INDEPENDENCE SCREENING AND STRUCTURE IDENTIFICATION FOR ULTRA-HIGH DIMENSIONAL LONGITUDINAL DATA [J].
Cheng, Ming-Yen ;
Honda, Toshio ;
Li, Jialiang ;
Peng, Heng .
ANNALS OF STATISTICS, 2014, 42 (05) :1819-1849
[5]   An asymptotic theory for model selection inference in general semiparametric problems [J].
Claeskens, Gerda ;
Carroll, Raymond J. .
BIOMETRIKA, 2007, 94 (02) :249-265
[6]  
CROWDER M, 1995, BIOMETRIKA, V82, P407
[7]   Analysis of longitudinal data with semiparametric estimation of covariance function [J].
Fan, Jianqing ;
Huang, Tao ;
Li, Runze .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2007, 102 (478) :632-641
[8]   Semiparametric Estimation of Covariance Matrixes for Longitudinal Data [J].
Fan, Jianqing ;
Wu, Yichao .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (484) :1520-1533
[9]   Detecting the violation of variance homogeneity in mixed models [J].
Fang, Xicheng ;
Li, Jialiang ;
Wong, Weng Kee ;
Fu, Bo .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2016, 25 (06) :2506-2520
[10]  
Fitzmaurice G., 2011, Applied Longitudinal Data Analysis, V2nd