Comparison of Multimodal Deep Learning Approaches for Predicting Clinical Deterioration in Ward Patients: Observational Cohort Study

被引:0
作者
Kotula, Charles A. [1 ]
Martin, Jennie [1 ]
Carey, Kyle A. [2 ]
Edelson, Dana P. [2 ]
Dligach, Dmitriy [3 ]
Mayampurath, Anoop [1 ,4 ]
Afshar, Majid [1 ,4 ]
Churpek, Matthew M. [1 ,4 ]
机构
[1] Univ Wisconsin Madison, Dept Med, 610 Walnut St, Madison, WI 53792 USA
[2] Univ Chicago, Dept Med, Chicago, IL USA
[3] Loyola Univ Chicago, Dept Comp Sci, Chicago, IL USA
[4] Univ Wisconsin Madison, Dept Biostat & Med Informat, Madison, WI USA
基金
美国国家卫生研究院;
关键词
clinical deterioration; deep learning; time series; artificial intelligence; machine learning; EARLY WARNING SCORE; HOSPITAL MORTALITY; INTENSIVE-CARE; CARDIAC-ARREST; IDENTIFICATION; VALIDATION; SYSTEM; IMPACT; TEXT;
D O I
10.2196/75340
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Implementing machine learning models to identify clinical deterioration in the wards is associated with decreased morbidity and mortality. However, these models have high false positive rates and only use structured data. Objective: We aimed to compare models with and without information from clinical notes for predicting deterioration. Methods: Adults admitted to the wards at the University of Chicago (development cohort) and University of Wisconsin-Madison (external validation cohort) were included. Predictors consisted of structured and unstructured variables extracted from notes as concept unique identifiers (CUIs). We parameterized CUIs in 5 ways: standard tokenization (ST), International Classification of Diseases rollup using tokenization (ICDR-T), International Classification of Diseases rollup using binary variables (ICDR-BV), concept unique identifiers as SapBERT embedding (SE), and concept unique identifier clustering using SapBERT embeddings (CC). Each parameterization method combined with structured data and each structured data-only method were compared for predicting intensive care unit transfer or death in the next 24 hours using deep recurrent neural networks. Results: The development (University of Chicago) cohort included 284,302 patients, while the external validation (University of Wisconsin-Madison) cohort included 248,055 patients. In total, 4.9% (n=26,281) of patients experienced the outcome. The SE model achieved the highest area under the precision-recall curve (0.208), followed by CC (0.199) and the structured-only model (0.199), ICDR-BV (0.194), ICDR-T (0.166), and ST (0.158). The CC and structured-only models achieved the highest area under the receiver operating characteristic (0.870), followed by ICDR-T (0.867), ICDR-BV (0.866), ST (0.860), and SE (0.859). Regarding sensitivity and positive predictive value, the CC model achieved the greatest positive predictive value (12.53%) and sensitivity (52.15%) at the cutoff that flagged 5% of the observations in the test set. At the 15% cutoff, the ICDR-T, CC, and ICDR-BV models tied for the highest positive predictive value at 5.67%, while their sensitivities were 70.95%, 70.92%, and 70.86%, respectively. All models were well calibrated, achieving Brier scores in the range of 0.011-0.012. The modified integrated gradients method revealed that CUIs corresponding to terms such as "NPO-nothing by mouth," "chemotherapy," "transplanted tissue," and "dialysis procedure" were most predictive of deterioration. Conclusions: A multimodal model combining structured data with embeddings using SapBERT had the highest area under the precision-recall curve, but performance was similar between models with and without CUIs. Although the addition of CUIs from notes to structured data did not meaningfully improve model performance for predicting clinical deterioration, models using CUIs could provide clinicians with relevant information and additional clinical context for supporting decision-making.
引用
收藏
页数:14
相关论文
共 46 条
[1]   Deployment of Real-time Natural Language Processing and Deep Learning Clinical Decision Support in the Electronic Health Record: Pipeline Implementation for an Opioid Misuse Screener in Hospitalized Adults [J].
Afshar, Majid ;
Adelaine, Sabrina ;
Resnik, Felice ;
Mundt, Marlon P. ;
Long, John ;
Leaf, Margaret ;
Ampian, Theodore ;
Wills, Graham J. ;
Schnapp, Benjamin ;
Chao, Michael ;
Brown, Randy ;
Joyce, Cara ;
Sharma, Brihat ;
Dligach, Dmitriy ;
Burnside, Elizabeth S. ;
Mahoney, Jane ;
Churpek, Matthew M. ;
Patterson, Brian W. ;
Liao, Frank .
JMIR MEDICAL INFORMATICS, 2023, 11
[2]  
[Anonymous], 2024, UMLS knowledge sources: file downloads-release 2024AA
[3]  
[Anonymous], 2024, SMPH (public)/department of medicine/UW-ICU-data-science-lab-public/ctakesprocessing. GitLab
[4]   Delayed Rapid Response Team Activation Is Associated With Increased Hospital Mortality, Morbidity, and Length of Stay in a Tertiary Care Institution [J].
Barwise, Amelia ;
Thongprayoon, Charat ;
Gajic, Ognjen ;
Jensen, Jeffrey ;
Herasevich, Vitaly ;
Pickering, Brian W. .
CRITICAL CARE MEDICINE, 2016, 44 (01) :54-63
[5]   Recognising clinical instability in hospital patients before cardiac arrest or unplanned admission to intensive care - A pilot study in a tertiary-care hospital [J].
Buist, MD ;
Jarmolowski, E ;
Burton, PR ;
Bernard, SA ;
Waxman, BP ;
Anderson, J .
MEDICAL JOURNAL OF AUSTRALIA, 1999, 171 (01) :22-25
[6]  
Che Zhengping, 2016, AMIA Annu Symp Proc, V2016, P371
[7]   Using recurrent neural network models for early detection of heart failure onset [J].
Choi, Edward ;
Schuetz, Andy ;
Stewart, Walter F. ;
Sun, Jimeng .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2017, 24 (02) :361-370
[8]   Multicenter Development and Prospective Validation of eCARTv5: A Gradient-Boosted Machine-Learning Early Warning Score [J].
Churpek, Matthew M. ;
Carey, Kyle A. ;
Snyder, Ashley ;
Winslow, Christopher J. ;
Gilbert, Emily ;
Shah, Nirav S. ;
Patterson, Brian W. ;
Afshar, Majid ;
Weiss, Alan ;
Amin, Devendra N. ;
Rhodes, Deborah J. ;
Edelson, Dana P. .
CRITICAL CARE EXPLORATIONS, 2025, 7 (04) :e1232
[9]   Association between intensive care unit transfer delay and hospital mortality: A multicenter investigation [J].
Churpek, Matthew M. ;
Wendlandt, Blair ;
Zadravecz, Frank J. ;
Adhikari, Richa ;
Winslow, Christopher ;
Edelson, Dana P. .
JOURNAL OF HOSPITAL MEDICINE, 2016, 11 (11) :757-762
[10]   Comparison of Natural Language Processing of Clinical Notes With a Validated Risk-Stratification Tool to Predict Severe Maternal Morbidity [J].
Clapp, Mark A. ;
Kim, Ellen ;
James, Kaitlyn E. ;
Perlis, Roy H. ;
Kaimal, Anjali J. ;
McCoy, Thomas H. ;
Easter, Sarah Rae .
JAMA NETWORK OPEN, 2022, 5 (10) :E2234924