Vital signs-based deterioration prediction model assumptions can lead to losses in prediction performance

被引:4
作者
Blythe, Robin [1 ]
Parsons, Rex [1 ]
Barnett, Adrian G. [1 ]
McPhail, Steven M. [1 ,2 ]
White, Nicole M. [1 ,3 ]
机构
[1] Queensland Univ Technol, Fac Hlth, Australian Ctr Hlth Serv Innovat, Ctr Healthcare Transformat,Sch Publ Hlth & Social, 60 Musk Ave, Kelvin Grove, Qld 4059, Australia
[2] Metro South Hlth, Digital Hlth & Informat, 199 Ipswich Rd, Brisbane, Qld 4102, Australia
[3] 60 Musk Ave, Brisbane, Qld 4059, Australia
基金
澳大利亚国家健康与医学研究理事会;
关键词
Computerized medical record systems; Clinical prediction model; Missing data; Clinical deterioration; Early warning score; Area under curve; MISSING DATA; RISK; IMPUTATION; BIAS;
D O I
10.1016/j.jclinepi.2023.05.020
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objective: Vital signs-based models are complicated by repeated measures per patient and frequently missing data. This paper inves-tigated the impacts of common vital signs modeling assumptions during clinical deterioration prediction model development.Study Design and Setting: Electronic medical record (EMR) data from five Australian hospitals (1 January 2019-31 December 2020) were used. Summary statistics for each observation's prior vital signs were created. Missing data patterns were investigated using boosted decision trees, then imputed with common methods. Two example models predicting in-hospital mortality were developed, as follows: lo-gistic regression and eXtreme Gradient Boosting. Model discrimination and calibration were assessed using the C-statistic and nonpara-metric calibration plots.Results: The data contained 5,620,641 observations from 342,149 admissions. Missing vitals were associated with observation fre-quency, vital sign variability, and patient consciousness. Summary statistics improved discrimination slightly for logistic regression and markedly for eXtreme Gradient Boosting. Imputation method led to notable differences in model discrimination and calibration. Model calibration was generally poor.Conclusion: Summary statistics and imputation methods can improve model discrimination and reduce bias during model develop-ment, but it is questionable whether these differences are clinically significant. Researchers should consider why data are missing during model development and how this may impact clinical utility.& COPY; 2023 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页码:106 / 115
页数:10
相关论文
共 49 条
  • [41] Van Buuren S, 1999, STAT MED, V18, P681, DOI 10.1002/(SICI)1097-0258(19990330)18:6<681::AID-SIM71>3.0.CO
  • [42] 2-R
  • [43] van Buuren S, 2011, J STAT SOFTW, V45, P1
  • [44] Regression shrinkage methods for clinical prediction models do not guarantee improved performance: Simulation study
    Van Calster, Ben
    van Smeden, Maarten
    De Cock, Bavo
    Steyerberg, Ewout W.
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2020, 29 (11) : 3166 - 3178
  • [45] Calibration: the Achilles heel of predictive analytics
    van Calster, Ben
    McLernon, David J.
    van Smeden, Maarten
    Wynants, Laure
    Steyerberg, Ewout W.
    [J]. BMC MEDICINE, 2019, 17 (01)
  • [46] Developing, validating, updating and judging the impact of prognostic models for respiratory diseases
    Van Royen, Florien S.
    Moons, Karel G. M.
    Geersing, Geert-Jan
    Van Smeden, Maarten
    [J]. EUROPEAN RESPIRATORY JOURNAL, 2022, 60 (03)
  • [47] Development and validation of a prediction model with missing predictor data: a practical approach
    Vergouwe, Yvonne
    Royston, Patrick
    Moons, Karel G. M.
    Altman, Douglas G.
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2010, 63 (02) : 205 - 214
  • [48] Prediction models for diagnosis and prognosis of covid-19 infection: systematic review and critical appraisal
    Wynants, Laure
    Van Calster, Ben
    Collins, Gary S.
    Riley, Richard D.
    Heinze, Georg
    Schuit, Ewoud
    Albu, Elena
    Arshi, Banafsheh
    Bellou, Vanesa
    Bonten, Marc M. J.
    Dahly, Darren L.
    Damen, Johanna A.
    Debray, Thomas P. A.
    de Jong, Valentijn M. T.
    De Vos, Maarten
    Dhiman, Paula
    Ensor, Joie
    Gao, Shan
    Haller, Maria C.
    Harhay, Michael O.
    Henckaerts, Liesbet
    Heus, Pauline
    Hoogland, Jeroen
    Hudda, Mohammed
    Jenniskens, Kevin
    Kammer, Michael
    Kreuzberger, Nina
    Lohmann, Anna
    Levis, Brooke
    Luijken, Kim
    Ma, Jie
    Martin, Glen P.
    McLernon, David J.
    Andaur Navarro, Constanza L.
    Reitsma, Johannes B.
    Sergeant, Jamie C.
    Shi, Chunhu
    Skoetz, Nicole
    Smits, Luc J. M.
    Snell, Kym I. E.
    Sperrin, Matthew
    Spijker, Rene
    Steyerberg, Ewout W.
    Takada, Toshihiko
    Tzoulaki, Ioanna
    van Kuijk, Sander M. J.
    van Bussel, Bas C. T.
    van der Horst, Iwan C. C.
    Reeve, Kelly
    van Royen, Florien S.
    [J]. BMJ-BRITISH MEDICAL JOURNAL, 2020, 369
  • [49] Multiple imputation for time series data with Amelia package
    Zhang, Zhongheng
    [J]. ANNALS OF TRANSLATIONAL MEDICINE, 2016, 4 (03)