Bias from the use of generalized estimating equations to analyze incomplete longitudinal binary data

被引:5
|
作者
Copas, Andrew J. [1 ,2 ]
Seaman, Shaun R. [3 ]
机构
[1] MRC, Clin Trials Unit, London NW1 2DA, England
[2] UCL, London NW1 2DA, England
[3] MRC, Biostat Unit, Inst Publ Hlth, Cambridge CB2 0SR, England
关键词
binary data; generalized estimating equations; missing data; missing at random; repeated measures; MISSING DATA; REPEATED OUTCOMES; DROP-OUTS; SEMIPARAMETRIC REGRESSION; LINEAR-MODELS; ASSOCIATION; NONRESPONSE; RESPONSES; TRIAL; RATES;
D O I
10.1080/02664760902939604
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Patient dropout is a common problem in studies that collect repeated binary measurements. Generalized estimating equations (GEE) are often used to analyze such data. The dropout mechanism may be plausibly missing at random (MAR), i.e. unrelated to future measurements given covariates and past measurements. In this case, various authors have recommended weighted GEE with weights based on an assumed dropout model, or an imputation approach, or a doubly robust approach based on weighting and imputation. These approaches provide asymptotically unbiased inference, provided the dropout or imputation model (as appropriate) is correctly specified. Other authors have suggested that, provided the working correlation structure is correctly specified, GEE using an improved estimator of the correlation parameters ('modified GEE') show minimal bias. These modified GEE have not been thoroughly examined. In this paper, we study the asymptotic bias under MAR dropout of these modified GEE, the standard GEE, and also GEE using the true correlation. We demonstrate that all three methods are biased in general. The modified GEE may be preferred to the standard GEE and are subject to only minimal bias in many MAR scenarios but in others are substantially biased. Hence, we recommend the modified GEE be used with caution.
引用
收藏
页码:911 / 922
页数:12
相关论文
共 50 条
  • [1] Multiply Robust Weighted Generalized Estimating Equations for Incomplete Longitudinal Binary Data Using Empirical Likelihood
    Komazaki, Hiroshi
    Doi, Masaaki
    Yonemoto, Naohiro
    Sato, Tosiya
    STATISTICS IN BIOPHARMACEUTICAL RESEARCH, 2024, 16 (01): : 116 - 129
  • [2] Joint generalized estimating equations for longitudinal binary data
    Huang, Youjun
    Pan, Jianxin
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2021, 155
  • [3] Extended generalized estimating equations for binary familial data with incomplete families
    Fitzgerald, PEB
    BIOMETRICS, 2002, 58 (04) : 718 - 726
  • [4] Penalized joint generalized estimating equations for longitudinal binary data
    Huang, Youjun
    Pan, Jianxin
    BIOMETRICAL JOURNAL, 2022, 64 (01) : 57 - 73
  • [5] Model selection in the weighted generalized estimating equations for longitudinal data with dropout
    Gosho, Masahiko
    BIOMETRICAL JOURNAL, 2016, 58 (03) : 570 - 587
  • [6] Semiparametric Efficient Estimation for Incomplete Longitudinal Binary Data, With Application to Smoking Trends
    Perin, Jamie
    Preisser, John S.
    Rathouz, Paul J.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2009, 104 (488) : 1373 - 1384
  • [7] Modified weights based generalized quasilikelihood inferences in incomplete longitudinal binary models
    Sutradhar, Brajendra C.
    Mallick, Taslim S.
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2010, 38 (02): : 217 - 231
  • [8] Selection of Working Correlation Structure in Weighted Generalized Estimating Equation Method for Incomplete Longitudinal Data
    Gosho, Masahiko
    Hamada, Chikuma
    Yoshimura, Isao
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2014, 43 (01) : 62 - 81
  • [9] Using generalized estimating equations and extensions in randomized trials with missing longitudinal patient reported outcome data
    Bell, Melanie L.
    Horton, Nicholas J.
    Dhillon, Haryana M.
    Bray, Victoria J.
    Vardy, Janette
    PSYCHO-ONCOLOGY, 2018, 27 (09) : 2125 - 2131
  • [10] Doubly robust generalized estimating equations for longitudinal data
    Seaman, Shaun
    Copas, Andrew
    STATISTICS IN MEDICINE, 2009, 28 (06) : 937 - 955