Practical issues in using generalized estimating equations for inference on transitions in longitudinal data: What is being estimated?

被引:17
作者
Bible, Joe [1 ]
Albert, Paul S. [2 ]
Simons-Morton, Bruce G. [3 ]
Liu, Danping [2 ]
机构
[1] Clemson Univ, Dept Math Sci, Clemson, SC USA
[2] NCI, Div Canc Epidemiol & Genet, Biostat Branch, Bethesda, MD 20892 USA
[3] Eunice Kennedy Shriver Natl Inst Child Hlth & Hum, Hlth Behav Branch, 9609 Med Ctr Dr, Rockville, MD 20850 USA
基金
美国国家卫生研究院;
关键词
binary Markov model; misspecification; random effects; transition model; working correlation; MODELS; GEE;
D O I
10.1002/sim.8014
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Generalized estimating equations (GEEs) are commonly used to estimate transition models. When the Markov assumption does not hold but first-order transition probabilities are still of interest, the transition inference is sensitive to the choice of working correlation. In this paper, we consider a random process transition model as the true underlying data generating mechanism, which characterizes subject heterogeneity and complex dependence structure of the outcome process in a very flexible way. We formally define two types of transition probabilities at the population level: "naive transition probabilities" that average across all the transitions and "population-average transition probabilities" that average the subject-specific transition probabilities. Through asymptotic bias calculations and finite-sample simulations, we demonstrate that the unstructured working correlation provides unbiased estimators of the population-average transition probabilities while the independence working correlation provides unbiased estimators of the naive transition probabilities. For population-average transition estimation, we demonstrate that the sandwich estimator fails for unstructured GEE and recommend the use of either jackknife or bootstrap variance estimates. The proposed method is motivated by and applied to the NEXT Generation Health Study, where the interest is in estimating the population-average transition probabilities of alcohol use in adolescents.
引用
收藏
页码:903 / 916
页数:14
相关论文
共 16 条
  • [1] [Anonymous], 2002, ANAL LONGITUDINAL DA
  • [2] Analyzing longitudinal data and use of the generalized linear model in health and social sciences
    Arnau, Jaume
    Bono, Roser
    Bendayan, Rebecca
    Blanca, Maria J.
    [J]. QUALITY & QUANTITY, 2016, 50 (02) : 693 - 707
  • [3] A comparison of GEE and random effects models for distinguishing heterogeneity, nonstationarity and state dependence in a collection of short binary event series
    Crouchley, R.
    Davies, R. B.
    [J]. STATISTICAL MODELLING, 2001, 1 (04) : 271 - 285
  • [4] A comparison of population average and random-effect models for the analysis of longitudinal count data with base-line information
    Crouchley, R
    Davies, RB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1999, 162 : 331 - 347
  • [5] Bootstrapping GEE models for fMRI regional connectivity
    D'Angelo, Gina M.
    Lazar, Nicole A.
    Zhou, Gongfu
    Eddy, William F.
    Morris, John C.
    Sheline, Yvette I.
    [J]. NEUROIMAGE, 2012, 63 (04) : 1890 - 1900
  • [6] The R Package geepack for Generalized Estimating Equations
    Halekoh, U
    Hojsgaard, S
    Yan, J
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2006, 15 (02): : 1 - 11
  • [7] Effect of frailty on marginal regression estimates in survival analysis
    Henderson, R
    Oman, P
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1999, 61 : 367 - 379
  • [8] LONGITUDINAL DATA-ANALYSIS USING GENERALIZED LINEAR-MODELS
    LIANG, KY
    ZEGER, SL
    [J]. BIOMETRIKA, 1986, 73 (01) : 13 - 22
  • [9] Lumley T, 2001, J ROY STAT SOC A STA, V164, P209
  • [10] Akaike's information criterion in generalized estimating equations
    Pan, W
    [J]. BIOMETRICS, 2001, 57 (01) : 120 - 125