Inverse probability weighted Cox model in multi-site studies without sharing individual-level data

被引:31
作者
Shu, Di [1 ,2 ]
Yoshida, Kazuki [3 ,4 ]
Fireman, Bruce H. [5 ]
Toh, Sengwee [1 ,2 ]
机构
[1] Harvard Med Sch, Dept Populat Med, 401 Pk Dr,Suite 401 East, Boston, MA 02215 USA
[2] Harvard Pilgrim Hlth Care Inst, 401 Pk Dr,Suite 401 East, Boston, MA 02215 USA
[3] Brigham & Womens Hosp, Dept Med, 75 Francis St, Boston, MA 02115 USA
[4] Harvard Med Sch, Boston, MA 02115 USA
[5] Kaiser Permanente Northern Calif, Div Res, Oakland, CA USA
基金
美国医疗保健研究与质量局;
关键词
Cox model; distributed data networks; inverse probability weighting; multi-site study; privacy protection; risk set; MARGINAL STRUCTURAL MODELS; PROPENSITY SCORE METHODS; CLUSTERED SURVIVAL-DATA; WEB SERVICE; PRIVACY; REGRESSION; INFERENCE; SAFETY; SECURE;
D O I
10.1177/0962280219869742
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
The inverse probability weighted Cox proportional hazards model can be used to estimate the marginal hazard ratio. In multi-site studies, it may be infeasible to pool individual-level datasets due to privacy and other considerations. We propose three methods for making inference on hazard ratios without the need for pooling individual-level datasets across sites. The first method requires a summary-level eight-column risk-set table to produce the same hazard ratio estimate and robust sandwich variance estimate as those from the corresponding pooled individual-level data analysis (reference analysis). The second and third methods, which are based on two bootstrap re-sampling strategies, require a summary-level four-column risk-set table and bootstrap-based risk-set tables from each site to produce the same hazard ratio and bootstrap variance estimates as those from their reference analyses. All three methods require only one file transfer between the data-contributing sites and the analysis center. We justify these methods theoretically, illustrate their use, and demonstrate their statistical performance using both simulated and real-world data.
引用
收藏
页码:1668 / 1681
页数:14
相关论文
共 44 条
[1]   Propensity score matching with clustered data. An application to the estimation of the impact of caesarean section on the Apgar score [J].
Arpino, Bruno ;
Cannas, Massimo .
STATISTICS IN MEDICINE, 2016, 35 (12) :2074-2091
[2]   The specification of the propensity score in multilevel observational studies [J].
Arpino, Bruno ;
Mealli, Fabrizia .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (04) :1770-1780
[3]   Variance estimation when using inverse probability of treatment weighting (IPTW) with survival analysis [J].
Austin, Peter C. .
STATISTICS IN MEDICINE, 2016, 35 (30) :5642-5655
[4]   A Tutorial and Case Study in Propensity Score Analysis: An Application to Estimating the Effect of In-Hospital Smoking Cessation Counseling on Mortality [J].
Austin, Peter C. .
MULTIVARIATE BEHAVIORAL RESEARCH, 2011, 46 (01) :119-151
[5]   The Vaccine Safety Datalink: A Model for Monitoring Immunization Safety [J].
Baggs, James ;
Gee, Julianne ;
Lewis, Edwin ;
Fowler, Gabrielle ;
Benson, Patti ;
Lieu, Tracy ;
Naleway, Allison ;
Klein, Nicola P. ;
Baxter, Roger ;
Belongia, Edward ;
Glanz, Jason ;
Hambidge, Simon J. ;
Jacobsen, Steven J. ;
Jackson, Lisa ;
Nordin, Jim ;
Weintraub, Eric .
PEDIATRICS, 2011, 127 :S45-S53
[6]   Developing the Sentinel System - A National Resource for Evidence Development [J].
Behrman, Rachel E. ;
Benner, Joshua S. ;
Brown, Jeffrey S. ;
McClellan, Mark ;
Woodcock, Janet ;
Platt, Richard .
NEW ENGLAND JOURNAL OF MEDICINE, 2011, 364 (06) :498-499
[7]   Generating survival times to simulate Cox proportional hazards models [J].
Bender, R ;
Augustin, T ;
Blettner, M .
STATISTICS IN MEDICINE, 2005, 24 (11) :1713-1723
[8]  
BINDER DA, 1992, BIOMETRIKA, V79, P139, DOI 10.2307/2337154
[9]   Distributed Health Data Networks A Practical and Preferred Approach to Multi-Institutional Evaluations of Comparative Effectiveness, Safety, and Quality of Care [J].
Brown, Jeffrey S. ;
Holmes, John H. ;
Shah, Kiran ;
Hall, Ken ;
Lazarus, Ross ;
Platt, Richard .
MEDICAL CARE, 2010, 48 (06) :S45-S51
[10]   A random effects model for multivariate failure time data from multicenter clinical trials [J].
Cai, JW ;
Sen, PK ;
Zhou, HB .
BIOMETRICS, 1999, 55 (01) :182-189