The effect of number of clusters and magnitude of within-cluster homogeneity in outcomes on the performance of four variance estimators for a marginal multivariable Cox regression model fit to clustered data in the context of observational research

被引:0
作者
Austin, Peter C. [1 ,2 ,3 ]
机构
[1] ICES, V106, 2075 Bayview Ave, Toronto, ON M4N 3M5, Canada
[2] Univ Toronto, Inst Hlth Policy Management & Evaluat, Toronto, ON, Canada
[3] Sunnybrook Res Inst, Toronto, ON, Canada
基金
加拿大健康研究院;
关键词
clustered data; Cox regression; health services research; Monte Carlo simulations; survival analysis; variance estimation; LONGITUDINAL DATA-ANALYSIS; RANDOMIZED-TRIALS; INFERENCE;
D O I
10.1002/sim.10126
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Researchers often estimate the association between the hazard of a time-to-event outcome and the characteristics of individuals and the clusters in which individuals are nested. Lin and Wei's robust variance estimator is often used with a Cox regression model fit to clustered data. Recently, alternative variance estimators have been proposed: the Fay-Graubard estimator, the Kauermann-Carroll estimator, and the Mancl-DeRouen estimator. Using Monte Carlo simulations, we found that, when fitting a marginal Cox regression model with both individual-level and cluster-level covariates: (i) in the presence of weak to moderate within-cluster homogeneity of outcomes, the Lin-Wei variance estimator can result in estimates of the SE with moderate bias when the number of clusters is fewer than 20-30, while in the presence of strong within-cluster homogeneity, it can result in biased estimation even when the number of clusters is as large as 100; (ii) when the number of clusters was less than approximately 20, the Fay-Graubard variance estimator tended to result in estimates of SE with the lowest bias; (iii) when the number of clusters exceeded approximately 20, the Mancl-DeRouen estimator tended to result in estimated standard errors with the lowest bias; (iv) the Mancl-DeRouen estimator used with a t-distribution tended to result in 95% confidence that had the best performance of the estimators; (v) when the magnitude of within-cluster homogeneity in outcomes was strong or very strong, all methods resulted in confidence intervals with lower than advertised coverage rates even when the number of clusters was very large.
引用
收藏
页码:3264 / 3279
页数:16
相关论文
共 22 条
[1]   The iterative bisection procedure: a useful tool for determining parameter values in data-generating processes in Monte Carlo simulations [J].
Austin, Peter C. .
BMC MEDICAL RESEARCH METHODOLOGY, 2023, 23 (01)
[2]   A Tutorial on Multilevel Survival Analysis: Methods, Models and Applications [J].
Austin, Peter C. .
INTERNATIONAL STATISTICAL REVIEW, 2017, 85 (02) :185-203
[3]   Generating survival times to simulate Cox proportional hazards models [J].
Bender, R ;
Augustin, T ;
Blettner, M .
STATISTICS IN MEDICINE, 2005, 24 (11) :1713-1723
[4]  
Duchateau L, 2008, STAT BIOL HEALTH, P1
[5]   Small-sample adjustments for Wald-type tests using sandwich estimators [J].
Fay, MP ;
Graubard, BI .
BIOMETRICS, 2001, 57 (04) :1198-+
[6]   Maintaining the validity of inference in small-sample stepped wedge cluster randomized trials with binary outcomes when using generalized estimating equations [J].
Ford, Whitney P. ;
Westgate, Philip M. .
STATISTICS IN MEDICINE, 2020, 39 (21) :2779-2792
[7]   A comparison of bias-corrected empirical covariance estimators with generalized estimating equations in small-sample longitudinal study settings [J].
Ford, Whitney P. ;
Westgate, Philip M. .
STATISTICS IN MEDICINE, 2018, 37 (28) :4318-4329
[8]   Improved standard error estimator for maintaining the validity of inference in cluster randomized trials with a small number of clusters [J].
Ford, Whitney P. ;
Westgate, Philip M. .
BIOMETRICAL JOURNAL, 2017, 59 (03) :478-495
[9]  
Health Services Research Unit UoA, 2023, DATABASE INTRACORREL
[10]  
Hougaard P, 2014, CH CRC HANDB MOD STA, P457