A review and empirical comparison of causal inference methods for clustered observational data with application to the evaluation of the effectiveness of medical devices

被引:20
作者
Cafri, Guy [1 ]
Wang, Wei [1 ]
Chan, Priscilla H. [1 ]
Austin, Peter C. [2 ,3 ]
机构
[1] Kaiser Permanente, Surg Outcomes & Anal, 8954 Rio San Diego Dr,Suite 406, San Diego, CA 92108 USA
[2] Inst Clin & Evaluat Sci, Toronto, ON, Canada
[3] Univ Toronto, Inst Hlth Policy Management & Evaluat, Toronto, ON, Canada
关键词
Observational studies; propensity scores; clustering; orthopedics; medical devices; MARGINAL STRUCTURAL MODELS; PROPENSITY SCORE METHODS; INVERSE PROBABILITY; SURVIVAL-DATA; SENSITIVITY; HETEROGENEITY; OUTCOMES; BALANCE; TIME; SELECTION;
D O I
10.1177/0962280218799540
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Observational studies are commonplace in medicine. A frequent concern is confounding bias due to differences in patient characteristics across treatment groups, but other important issues include dependency among observations nested within clusters (e.g. patients clustered within physicians or surgeons) and confounding due to cluster characteristics (e.g. physician or surgeon experience or training). Given the frequency with which these issues arise in medical research, as well as their relative complexity, methods for the analysis of clustered observational data are reviewed. We argue for estimating causal treatment effects using marginal models that either match or weight observations using a suitable distance metric (e.g. the propensity score). Simulation results demonstrated that methods incorporating clustering into calculation of the variance were generally more accurate than those that did not. Moreover, methods that account for cluster confounding when estimating the treatment effect were least biased and most accurate. Throughout the paper we illustrate the proposed methods in a medical device setting that compares the effectiveness of femoral heads used in total hip replacements. Whenever possible the clustered aspect of the data should be considered in the design of the study when constructing the distance measure or in the matching process, as well as in the analysis when estimating the variance of the treatment effect.
引用
收藏
页码:3142 / 3162
页数:21
相关论文
共 72 条
[1]   HETEROGENEITY IN SURVIVAL ANALYSIS [J].
AALEN, OO .
STATISTICS IN MEDICINE, 1988, 7 (11) :1121-1137
[2]   Calculating the number needed to treat for trials where the outcome is time to an event [J].
Altman, DG ;
Andersen, PK .
BRITISH MEDICAL JOURNAL, 1999, 319 (7223) :1492-1495
[3]  
[Anonymous], 2009, QUANTITATIVE APPL SO
[4]  
[Anonymous], SURVIVAL ANAL STATE
[5]   Propensity score matching with clustered data. An application to the estimation of the impact of caesarean section on the Apgar score [J].
Arpino, Bruno ;
Cannas, Massimo .
STATISTICS IN MEDICINE, 2016, 35 (12) :2074-2091
[6]   The specification of the propensity score in multilevel observational studies [J].
Arpino, Bruno ;
Mealli, Fabrizia .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (04) :1770-1780
[7]   Conditioning on the propensity score can result in biased estimation of common measures of treatment effect: A Monte Carlo study (p n/a) [J].
Austin, Peter C. ;
Grootendorst, Paul ;
Normand, Sharon-Lise T. ;
Anderson, Geoffrey M. .
STATISTICS IN MEDICINE, 2007, 26 (16) :3208-3210
[8]   A Tutorial on Multilevel Survival Analysis: Methods, Models and Applications [J].
Austin, Peter C. .
INTERNATIONAL STATISTICAL REVIEW, 2017, 85 (02) :185-203
[9]   The median hazard ratio: a useful measure of variance and general contextual effects in multilevel survival analysis [J].
Austin, Peter C. ;
Wagner, Philippe ;
Merlo, Juan .
STATISTICS IN MEDICINE, 2017, 36 (06) :928-938
[10]   Variance estimation when using inverse probability of treatment weighting (IPTW) with survival analysis [J].
Austin, Peter C. .
STATISTICS IN MEDICINE, 2016, 35 (30) :5642-5655