Variable selection strategies and its importance in clinical prediction modelling

被引:559
作者
Chowdhury, Mohammad Ziaul Islam [1 ]
Turin, Tanvir C. [1 ,2 ]
机构
[1] Univ Calgary, Cumming Sch Med, Dept Community Hlth Sci, Calgary, AB, Canada
[2] Univ Calgary, Cumming Sch Med, Dept Family Med, Calgary, AB, Canada
关键词
SERIOUS BACTERIAL-INFECTION; REGRESSION-ANALYSIS; EVENTS; HYPERTENSION; FEVER; RISK; RULE;
D O I
10.1136/fmch-2019-000262
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Clinical prediction models are used frequently in clinical practice to identify patients who are at risk of developing an adverse outcome so that preventive measures can be initiated. A prediction model can be developed in a number of ways; however, an appropriate variable selection strategy needs to be followed in all cases. Our purpose is to introduce readers to the concept of variable selection in prediction modelling, including the importance of variable selection and variable reduction strategies. We will discuss the various variable selection techniques that can be applied during prediction model building (backward elimination, forward selection, stepwise selection and all possible subset selection), and the stopping rule/selection criteria in variable selection (p values, Akaike information criterion, Bayesian information criterion and Mallows' C-p statistic). This paper focuses on the importance of including appropriate variables, following the proper steps, and adopting the proper methods when selecting variables for prediction models.
引用
收藏
页数:7
相关论文
共 29 条
  • [1] Model selection for ecologists: the worldviews of AIC and BIC
    Aho, Ken
    Derryberry, DeWayne
    Peterson, Teri
    [J]. ECOLOGY, 2014, 95 (03) : 631 - 636
  • [2] [Anonymous], 2008, Clinical prediction models: a practical approach to development, validation, and updating
  • [3] What you see may not be what you get: A brief, nontechnical introduction to overfitting in regression-type models
    Babyak, MA
    [J]. PSYCHOSOMATIC MEDICINE, 2004, 66 (03): : 411 - 421
  • [4] Validating and updating a prediction rule for serious bacterial infection in patients with fever without source
    Bleeker, S. E.
    Derksen-Lubsen, G.
    Grobbee, D. E.
    T Donders, A. R.
    Moons, K. G. M.
    Moll, H. A.
    [J]. ACTA PAEDIATRICA, 2007, 96 (01) : 100 - 104
  • [5] Bleeker SE, 2001, ACTA PAEDIATR, V90, P1226, DOI 10.1080/080352501317130236
  • [6] Multimodel inference - understanding AIC and BIC in model selection
    Burnham, KP
    Anderson, DR
    [J]. SOCIOLOGICAL METHODS & RESEARCH, 2004, 33 (02) : 261 - 304
  • [7] Prediction models for the risk of new-onset hypertension in ethnic Chinese in Taiwan
    Chien, K-L
    Hsu, H-C
    Su, T-C
    Chang, W-T
    Sung, F-C
    Chen, M-F
    Lee, Y-T
    [J]. JOURNAL OF HUMAN HYPERTENSION, 2011, 25 (05) : 294 - 303
  • [8] Dang JT, 2019, SURG ENDOSC, V21, P1
  • [9] The interpretation of Mallows's C-p-statistic
    Gilmour, SG
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1996, 45 (01) : 49 - 56
  • [10] Guyon I., 2003, J MACH LEARN RES, V3, P1157