Multiply Robust Estimation in Regression Analysis With Missing Data

被引:102
作者
Han, Peisong [1 ]
机构
[1] Univ Waterloo, Dept Stat & Actuarial Sci, Waterloo, ON N2L 3G1, Canada
关键词
Extreme weights; Empirical likelihood; Double robustness; Augmented inverse probability weighting (AIPW); Missing at random (MAR); Estimating functions; CD4 CELL COUNTS; EMPIRICAL-LIKELIHOOD; SEMIPARAMETRIC ESTIMATION; IMPROVING EFFICIENCY; LONGITUDINAL DATA; OUTCOME DATA; INFERENCE; MODELS; IMPUTATION; SCORE;
D O I
10.1080/01621459.2014.880058
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Doubly robust estimators are widely used in missing-data analysis. They provide double protection on estimation consistency against model misspecifications. However, they allow only a single model for the missingness mechanism and a single model for the data distribution, and the assumption that one of these two models is correctly specified is restrictive in practice. For regression analysis with possibly missing outcome, we propose an estimation method that allows multiple models for both the missingness mechanism and the data distribution. The resulting estimator is consistent if any one of those multiple models is correctly specified, and thus provides multiple protection on consistency. This estimator is also robust against extreme values of the fitted missingness probability, which, for most doubly robust estimators, can lead to erroneously large inverse probability weights that may jeopardize the numerical performance. The numerical implementation of the proposed method through a modified Newton-Raphson algorithm is discussed. The asymptotic distribution of the resulting estimator is derived, based on which we study the estimation efficiency and provide ways to improve the efficiency. As an application, we analyze the data collected from the AIDS Clinical Trials Group Protocol 175.
引用
收藏
页码:1159 / 1173
页数:15
相关论文
共 52 条
[1]  
[Anonymous], 2008, DEV MANAGEMENT HUMAN
[2]   Doubly robust estimation in missing data and causal inference models [J].
Bang, H .
BIOMETRICS, 2005, 61 (04) :962-972
[3]  
Box GEP, 1987, Empirical model-building and response surfaces
[4]   Improving efficiency and robustness of the doubly robust estimator for a population mean with incomplete data [J].
Cao, Weihua ;
Tsiatis, Anastasios A. ;
Davidian, Marie .
BIOMETRIKA, 2009, 96 (03) :723-734
[5]   Using empirical likelihood methods to obtain range restricted weights in regression estimators for surveys [J].
Chen, J ;
Sitter, RR ;
Wu, C .
BIOMETRIKA, 2002, 89 (01) :230-237
[6]   Semiparametric efficient estimation for the auxiliary outcome problem with the conditional mean model [J].
Chen, JB ;
Breslow, NE .
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2004, 32 (04) :359-372
[7]   Adjusted empirical likelihood and its properties [J].
Chen, Jiahua ;
Variyath, Asokan Mulayath ;
Abraham, Bovas .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2008, 17 (02) :426-443
[8]   Improving semiparametric estimation by using surrogate data [J].
Chen, Song Xi ;
Leung, Denis H. Y. ;
Qin, Jing .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :803-823
[9]   Semiparametric estimation of treatment effect in a pretest-posttest study with missing data [J].
Davidian, M ;
Tsiatis, AA ;
Leon, S .
STATISTICAL SCIENCE, 2005, 20 (03) :261-282
[10]   A trial comparing nucleoside monotherapy with combination therapy in HIV-infected adults with CD4 cell counts from 200 to 500 per cubic millimeter [J].
Hammer, SM ;
Katzenstein, DA ;
Hughes, MD ;
Gundacker, H ;
Schooley, RT ;
Haubrich, RH ;
Henry, WK ;
Lederman, MM ;
Phair, JP ;
Niu, M ;
Hirsch, MS ;
Merigan, TC ;
Blaschke, TF ;
Simpson, D ;
McLaren, C ;
Rooney, J ;
Salgo, M .
NEW ENGLAND JOURNAL OF MEDICINE, 1996, 335 (15) :1081-1090