Models with interactions overestimated heterogeneity of treatment effects and were prone to treatment mistargeting

被引:54
作者
van Klaveren, David [1 ,2 ]
Balan, Theodor A. [3 ]
Steyerberg, Ewout W. [1 ,3 ]
Kent, David M. [2 ]
机构
[1] Erasmus Univ, Dept Publ Hlth, Med Ctr, Rotterdam, Netherlands
[2] Tufts Med Ctr, Predict Analyt & Comparat Effectiveness Ctr, Inst Clin Res & Hlth Policy Studies, Boston, MA 02111 USA
[3] Leiden Univ, Med Ctr, Dept Biomed Data Sci, Leiden, Netherlands
关键词
Personalized medicine; Heterogeneity of treatment effect; Treatment benefit; Prediction models; Regression analysis; Penalized regression analysis; INDIVIDUAL PATIENTS; VALIDATION; BENEFIT;
D O I
10.1016/j.jclinepi.2019.05.029
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objectives: We aimed to compare the performance of different regression modeling approaches for the prediction of heterogeneous treatment effects. Study Design and Setting: We simulated trial samples (n = 3,600; 80% power for a treatment odds ratio of 0.8) from a superpopulation (N = 1,000,000) with 12 binary risk predictors, both without and with six true treatment interactions. We assessed predictions of treatment benefit for four regression models: a "risk model" (with a constant effect of treatment assignment) and three "effect models" (including interactions of risk predictors with treatment assignment). Three novel performance measures were evaluated: calibration for benefit (i.e., observed vs. predicted risk difference in treated vs. untreated), discrimination for benefit, and prediction error for benefit. Results: The risk modeling approach was well-calibrated for benefit, whereas effect models were consistently overfit, even with doubled sample sizes. Penalized regression reduced miscalibration of the effect models considerably. In terms of discrimination and prediction error, the risk modeling approach was superior in the absence of true treatment effect interactions, whereas penalized regression was optimal in the presence of true treatment interactions. Conclusion: A risk modeling approach yields models consistently well calibrated for benefit. Effect modeling may improve discrimination for benefit in the presence of true interactions but is prone to overfitting. Hence, effect models-including only plausible interactions-should be fitted using penalized regression. (C) 2019 Elsevier Inc. All rights reserved.
引用
收藏
页码:72 / 83
页数:12
相关论文
共 22 条
[1]  
Abadie A, 2013, NATL BUREAU EC RES W
[2]   Events per variable (EPV) and the relative performance of different strategies for estimating the out-of-sample validity of logistic regression models [J].
Austin, Peter C. ;
Steyerberg, Ewout W. .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2017, 26 (02) :796-808
[3]   Benefit and harm of intensive blood pressure treatment: Derivation and validation of risk models using data from the SPRINT and ACCORD trials [J].
Basu, Sanjay ;
Sussman, Jeremy B. ;
Rigdon, Joseph ;
Steimle, Lauren ;
Denton, Brian T. ;
Hayward, Rodney A. .
PLOS MEDICINE, 2017, 14 (10)
[4]   Three simple rules to ensure reasonably credible subgroup analyses [J].
Burke, James F. ;
Sussman, Jeremy B. ;
Kent, David M. ;
Hayward, Rodney A. .
BMJ-BRITISH MEDICAL JOURNAL, 2015, 351
[5]   Using Internally Developed Risk Models to Assess Heterogeneity in Treatment Effects in Clinical Trials [J].
Burke, James F. ;
Hayward, Rodney A. ;
Nelson, Jason P. ;
Kent, David M. .
CIRCULATION-CARDIOVASCULAR QUALITY AND OUTCOMES, 2014, 7 (01) :163-169
[6]   Multiplicity considerations in subgroup analysis [J].
Dmitrienko, Alex ;
Millen, Brian ;
Lipkovich, Ilya .
STATISTICS IN MEDICINE, 2017, 36 (28) :4446-4454
[7]   Anatomical and clinical characteristics to guide decision making between coronary artery bypass surgery and percutaneous coronary intervention for individual patients: development and validation of SYNTAX score II [J].
Farooq, Vasim ;
van Klaveren, David ;
Steyerberg, Ewout W. ;
Meliga, Emanuele ;
Vergouwe, Yvonne ;
Chieffo, Alaide ;
Kappetein, Arie Pieter ;
Colombo, Antonio ;
Holmes, David R., Jr. ;
Mack, Michael ;
Feldman, Ted ;
Morice, Marie-Claude ;
Stahle, Elisabeth ;
Onuma, Yoshinobu ;
Morel, Marie-angele ;
Garcia-Garcia, Hector M. ;
van Es, Gerrit Anne ;
Dawkins, Keith D. ;
Mohr, Friedrich W. ;
Serruys, Patrick W. .
LANCET, 2013, 381 (9867) :639-650
[8]  
Harrell FE, 2015, SPRINGER SER STAT, DOI 10.1007/978-3-319-19425-7
[9]   Limitations of applying summary results of clinical trials to individual patients - The need for risk stratification [J].
Kent, David M. ;
Hayward, Rodney A. .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2007, 298 (10) :1209-1212
[10]   The stroke-thrombolytic predictive instrument - A predictive instrument for intravenous thrombolysis in acute ischemic stroke [J].
Kent, David M. ;
Selker, Harry P. ;
Ruthazer, Robin ;
Bluhmki, Erich ;
Hacke, Werner .
STROKE, 2006, 37 (12) :2957-2962