Variable selection strategies and its importance in clinical prediction modelling

被引:650
作者
Chowdhury, Mohammad Ziaul Islam [1 ]
Turin, Tanvir C. [1 ,2 ]
机构
[1] Univ Calgary, Cumming Sch Med, Dept Community Hlth Sci, Calgary, AB, Canada
[2] Univ Calgary, Cumming Sch Med, Dept Family Med, Calgary, AB, Canada
关键词
SERIOUS BACTERIAL-INFECTION; REGRESSION-ANALYSIS; EVENTS; HYPERTENSION; FEVER; RISK; RULE;
D O I
10.1136/fmch-2019-000262
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Clinical prediction models are used frequently in clinical practice to identify patients who are at risk of developing an adverse outcome so that preventive measures can be initiated. A prediction model can be developed in a number of ways; however, an appropriate variable selection strategy needs to be followed in all cases. Our purpose is to introduce readers to the concept of variable selection in prediction modelling, including the importance of variable selection and variable reduction strategies. We will discuss the various variable selection techniques that can be applied during prediction model building (backward elimination, forward selection, stepwise selection and all possible subset selection), and the stopping rule/selection criteria in variable selection (p values, Akaike information criterion, Bayesian information criterion and Mallows' C-p statistic). This paper focuses on the importance of including appropriate variables, following the proper steps, and adopting the proper methods when selecting variables for prediction models.
引用
收藏
页数:7
相关论文
共 29 条
[1]   Model selection for ecologists: the worldviews of AIC and BIC [J].
Aho, Ken ;
Derryberry, DeWayne ;
Peterson, Teri .
ECOLOGY, 2014, 95 (03) :631-636
[2]  
[Anonymous], 2009, Clinical prediction models: A practical approach to development, validation, and updating, DOI DOI 10.1007/978-0-387-77244-8
[3]  
[Anonymous], 2010, J Target Meas Anal Mark, DOI [DOI 10.1057/JT.2009.26, 10.1057/jt.2009.26., 10.1057/jt.2009.26]
[4]   What you see may not be what you get: A brief, nontechnical introduction to overfitting in regression-type models [J].
Babyak, MA .
PSYCHOSOMATIC MEDICINE, 2004, 66 (03) :411-421
[5]   Validating and updating a prediction rule for serious bacterial infection in patients with fever without source [J].
Bleeker, S. E. ;
Derksen-Lubsen, G. ;
Grobbee, D. E. ;
T Donders, A. R. ;
Moons, K. G. M. ;
Moll, H. A. .
ACTA PAEDIATRICA, 2007, 96 (01) :100-104
[6]  
Bleeker SE, 2001, ACTA PAEDIATR, V90, P1226, DOI 10.1080/080352501317130236
[7]   Multimodel inference - understanding AIC and BIC in model selection [J].
Burnham, KP ;
Anderson, DR .
SOCIOLOGICAL METHODS & RESEARCH, 2004, 33 (02) :261-304
[8]   Prediction models for the risk of new-onset hypertension in ethnic Chinese in Taiwan [J].
Chien, K-L ;
Hsu, H-C ;
Su, T-C ;
Chang, W-T ;
Sung, F-C ;
Chen, M-F ;
Lee, Y-T .
JOURNAL OF HUMAN HYPERTENSION, 2011, 25 (05) :294-303
[9]  
Dang JT, 2019, SURG ENDOSC, V21, P1
[10]   The interpretation of Mallows's C-p-statistic [J].
Gilmour, SG .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1996, 45 (01) :49-56