Adding propensity scores to pure prediction models fails to improve predictive performance

被引:1
作者
Nowacki, Amy S. [1 ]
Wells, Brian J. [1 ]
Yu, Changhong [1 ]
Kattan, Michael W. [1 ]
机构
[1] Cleveland Clin, Dept Quantitat Hlth Sci, Cleveland, OH 44106 USA
来源
PEERJ | 2013年 / 1卷
关键词
Prediction; Propensity score; Calibration curve; Concordance index; Multivariable regression; MORTALITY; SURVIVAL; RISK;
D O I
10.7717/peerj.123
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background. Propensity score usage seems to be growing in popularity leading researchers to question the possible role of propensity scores in prediction modeling, despite the lack of a theoretical rationale. It is suspected that such requests are due to the lack of differentiation regarding the goals of predictive modeling versus causal inference modeling. Therefore, the purpose of this study is to formally examine the effect of propensity scores on predictive performance. Our hypothesis is that a multivariable regression model that adjusts for all covariates will perform as well as or better than those models utilizing propensity scores with respect to model discrimination and calibration. Methods. The most commonly encountered statistical scenarios for medical prediction (logistic and proportional hazards regression) were used to investigate this research question. Random cross-validation was performed 500 times to correct for optimism. The multivariable regression models adjusting for all covariates were compared with models that included adjustment for or weighting with the propensity scores. The methods were compared based on three predictive performance measures: (1) concordance indices; (2) Brier scores; and (3) calibration curves. Results. Multivariable models adjusting for all covariates had the highest average concordance index, the lowest average Brier score, and the best calibration. Propensity score adjustment and inverse probability weighting models without adjustment for all covariates performed worse than full models and failed to improve predictive performance with full covariate adjustment. Conclusion. Propensity score techniques did not improve prediction performance measures beyond multivariable adjustment. Propensity scores are not recommended if the analytical goal is pure prediction modeling.
引用
收藏
页数:12
相关论文
共 36 条
[1]   Cancer-Specific and Other-Cause Mortality After Radical Prostatectomy Versus Observation in Patients with Prostate Cancer: Competing-Risks Analysis of a Large North American Population-Based Cohort [J].
Abdollah, Firas ;
Sun, Maxine ;
Schmitges, Jan ;
Tian, Zhe ;
Jeldres, Claudio ;
Briganti, Alberto ;
Shariat, Shahrohk F. ;
Perrotte, Paul ;
Montorsi, Francesco ;
Karakiewicz, Pierre I. .
EUROPEAN UROLOGY, 2011, 60 (05) :920-930
[2]  
[Anonymous], BIOMETRICS
[3]  
[Anonymous], MATCHED SAMPLING CAU
[4]   A propensity analysis of the risk of vascular complications after cardiac catheterization procedures with the use of vascular closure devices [J].
Arora, Nipun ;
Matheny, Michael E. ;
Sepke, Carrie ;
Resnic, Frederic S. .
AMERICAN HEART JOURNAL, 2007, 153 (04) :606-611
[5]   A comparison of the ability of different propensity score models to balance measured variables between treated and untreated subjects: a Monte Carlo study [J].
Austin, Peter C. ;
Grootendorst, Paul ;
Anderson, Geoffrey M. .
STATISTICS IN MEDICINE, 2007, 26 (04) :734-753
[6]  
Brier G. W., 1950, Monthly weather review, V78, P1, DOI [DOI 10.1175/1520-0493(1950)078, DOI 10.1175/1520-0493(1950)078ANDLT
[7]  
0001:VOFEITANDGT
[8]  
2.0.CO
[9]  
2, 10.1175/1520-0493(1950)078()0001:VOFEIT()2.0.CO
[10]  
2, DOI 10.1175/1520-0493(1950)0782.0.CO