Penalized maximum likelihood estimation to directly adjust diagnostic and prognostic prediction models for overoptimism: a clinical example

被引:134
作者
Moons, KGM
Donders, ART
Steyerberg, EW
Harrell, FE
机构
[1] Univ Utrecht, Med Ctr, Julius Ctr Hlth Sci & Primary Care, NL-3508 TA Utrecht, Netherlands
[2] Univ Utrecht, Ctr Biostat, Utrecht, Netherlands
[3] Erasmus Univ, Med Ctr, Dept Publ Hlth, Rotterdam, Netherlands
[4] Vanderbilt Univ, Med Ctr, Dept Biostat, Nashville, TN USA
关键词
prediction research; overoptimism; overfitting; penalization; bootstrapping; shrinkage;
D O I
10.1016/j.jclinepi.2004.01.020
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background and objective: There is growing interest in developing prediction models. The accuracy of such models when applied in new patient samples is commonly lower than estimated from the development sample. This may be because of differences between the samples and/or because the developed model was overfitted (too optimistic). Various methods, including bootstrapping techniques exist for afterwards shrinking the regression coefficients and the model's discrimination and calibration for overoptimism. Penalized maximum likelihood estimation (PMLE) is a more rigorous method because adjustment for overfitting is directly built into the model development, instead of relying on shrinkage afterwards. PMLE has been described mainly in the statistical literature and is rarely applied to empirical data. Using empirical data, we illustrate the use of PMLE to develop a prediction model. Methods: The accuracy of the final PMLE model will be contrasted with the final models derived by ordinary stepwise logistic regression without and with shrinkage afterwards. The potential advantages and disadvantages of PMLE over the other two strategies are discussed. Results: PMLE leads to smaller prediction errors, provides for model reduction to a user-defined degree, and may differently shrink each predictor for overoptimism without sacrificing much discriminative accuracy of the model. Conclusion: PMLE is an easily applicable and promising method to directly adjust clinical prediction models for overoptimism. (C) 2004 Elsevier Inc. All rights reserved.
引用
收藏
页码:1262 / 1270
页数:9
相关论文
共 42 条
[1]   BOOTSTRAP INVESTIGATION OF THE STABILITY OF A COX REGRESSION-MODEL [J].
ALTMAN, DG ;
ANDERSEN, PK .
STATISTICS IN MEDICINE, 1989, 8 (07) :771-783
[2]  
Altman DG, 2000, STAT MED, V19, P453, DOI 10.1002/(SICI)1097-0258(20000229)19:4<453::AID-SIM350>3.3.CO
[3]  
2-X
[4]   Simplifying a prognostic model: a simulation study based on clinical data [J].
Ambler, G ;
Brady, AR ;
Royston, P .
STATISTICS IN MEDICINE, 2002, 21 (24) :3803-3822
[5]  
ATKINSON AC, 1980, BIOMETRIKA, V67, P413, DOI 10.1093/biomet/67.2.413
[6]   External validation of the SAPS II, APACHE II and APACHE III prognostic models in South England: a multicentre study [J].
Beck, DH ;
Smith, GB ;
Pappachan, JV ;
Millar, B .
INTENSIVE CARE MEDICINE, 2003, 29 (02) :249-256
[7]   External validation is necessary in, prediction research: A clinical example [J].
Bleeker, SE ;
Moll, HA ;
Steyerberg, EW ;
Donders, ART ;
Derksen-Lubsen, G ;
Grobbee, DE ;
Moons, KGM .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2003, 56 (09) :826-832
[8]   MODEL UNCERTAINTY, DATA MINING AND STATISTICAL-INFERENCE [J].
CHATFIELD, C .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1995, 158 :419-466
[9]   Importance of events per independent variable in proportional hazards analysis .1. Background, goals, and general strategy [J].
Concato, J ;
Peduzzi, P ;
Holford, TR ;
Feinstein, AR .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1995, 48 (12) :1495-1501
[10]  
Draper N. R., 1998, Applied Regression Analysis, DOI DOI 10.1002/9781118625590.CH15