A multiple robust propensity score method for longitudinal analysis with intermittent missing data

被引:14
作者
Chen, Chixiang [1 ]
Shen, Biyi [1 ]
Liu, Aiyi [2 ]
Wu, Rongling [1 ]
Wang, Ming [1 ]
机构
[1] Penn State Coll Med, Dept Publ Hlth Sci, Div Biostat & Bioinformat, Hershey, PA 17033 USA
[2] NICHHD, Biostat & Bioinformat Branch, NIH, Bethesda, MD 20892 USA
关键词
empirical likelihood; missing at random; propensity scores; semiparametric models; variable selection; GENERALIZED ESTIMATING EQUATIONS; WEIGHTED ESTIMATING EQUATIONS; MODEL SELECTION; EMPIRICAL-LIKELIHOOD; REGRESSION-MODELS; BINARY DATA; IMPUTATION; ESTIMATORS; EFFICIENCY; INFERENCE;
D O I
10.1111/biom.13330
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Longitudinal data are very popular in practice, but they are often missing in either outcomes or time-dependent risk factors, making them highly unbalanced and complex. Missing data may contain various missing patterns or mechanisms, and how to properly handle it for unbiased and valid inference still presents a significant challenge. Here, we propose a novel semiparametric framework for analyzing longitudinal data with both missing responses and covariates that are missing at random and intermittent, a general and widely encountered situation in observational studies. Within this framework, we consider multiple robust estimation procedures based on innovative calibrated propensity scores, which offers additional relaxation of the misspecification of missing data mechanisms and shows more satisfactory numerical performance. Also, the corresponding robust information criterion on consistent variable selection for our proposed model is developed based on empirical likelihood-based methods. These advocated methods are evaluated in both theory and extensive simulation studies in a variety of situations, showing competing properties and advantages compared to the existing approaches. We illustrate the utility of our approach by analyzing the data from the HIV Epidemiology Research Study.
引用
收藏
页码:519 / 532
页数:14
相关论文
共 29 条
[1]   Doubly robust estimation in missing data and causal inference models [J].
Bang, H .
BIOMETRICS, 2005, 61 (04) :962-972
[2]   Doubly Robust Estimates for Binary Longitudinal Data Analysis with Missing Response and Missing Covariates [J].
Chen, Baojiang ;
Zhou, Xiao-Hua .
BIOMETRICS, 2011, 67 (03) :830-842
[3]   Weighted Generalized Estimating Functions for Longitudinal Response and Covariate Data That Are Missing at Random [J].
Chen, Baojiang ;
Yi, Grace Y. ;
Cook, Richard J. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2010, 105 (489) :336-353
[4]  
Chen C., 2020, ARXIV200613281
[5]   Empirical-likelihood-based criteria for model selection on marginal analysis of longitudinal data with dropout missingness [J].
Chen, Chixiang ;
Shen, Biyi ;
Zhang, Lijun ;
Xue, Yuan ;
Wang, Ming .
BIOMETRICS, 2019, 75 (03) :950-965
[6]   Multiply robust imputation procedures for the treatment of item nonresponse in surveys [J].
Chen, Sixia ;
Haziza, David .
BIOMETRIKA, 2017, 104 (02) :439-453
[7]   CALIBRATION ESTIMATORS IN SURVEY SAMPLING [J].
DEVILLE, JC ;
SARNDAL, CE .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1992, 87 (418) :376-382
[8]   Model selection in the weighted generalized estimating equations for longitudinal data with dropout [J].
Gosho, Masahiko .
BIOMETRICAL JOURNAL, 2016, 58 (03) :570-587
[9]   Heart and Estrogen/progestin Replacement Study (HERS): Design, methods, and baseline characteristics [J].
Grady, D ;
Applegate, W ;
Bush, T ;
Furberg, C ;
Riggs, B ;
Hulley, SB .
CONTROLLED CLINICAL TRIALS, 1998, 19 (04) :314-335
[10]   Intrinsic efficiency and multiple robustness in longitudinal studies with drop-out [J].
Han, Peisong .
BIOMETRIKA, 2016, 103 (03) :683-700