Development and external validation of multimodal postoperative acute kidney injury risk machine learning models

被引:2
|
作者
Karway, George K. [1 ]
Koyner, Jay L. [2 ]
Caskey, John [1 ]
Spicer, Alexandra B. [1 ]
Carey, Kyle A. [2 ]
Gilbert, Emily R. [3 ]
Dligach, Dmitriy [4 ]
Mayampurath, Anoop [1 ,5 ]
Afshar, Majid [1 ,5 ]
Churpek, Matthew M. [1 ,5 ,6 ]
机构
[1] Univ Wisconsin, Dept Med, 600 Highland Ave, Madison, WI 53792 USA
[2] Univ Chicago, Dept Med, Sect Nephrol, Chicago, IL 60637 USA
[3] Loyola Univ Chicago, Dept Med, Chicago, IL 60153 USA
[4] Loyola Univ Chicago, Dept Comp Sci, Chicago, IL 60626 USA
[5] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI 53726 USA
[6] Univ Wisconsin, Dept Biostat & Med Informat, 600 Highland Ave, Madison, WI 53792 USA
关键词
multimodal models; artificial intelligence; intensive care unit; machine learning; acute kidney injury; natural language processing; ELECTRONIC MEDICAL-RECORDS; DE-IDENTIFICATION; PREDICTION; MORTALITY; INFORMATION; TEXT; DISCRIMINATION; CALIBRATION; DIAGNOSIS; OUTCOMES;
D O I
10.1093/jamiaopen/ooad109
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objectives To develop and externally validate machine learning models using structured and unstructured electronic health record data to predict postoperative acute kidney injury (AKI) across inpatient settings.Materials and Methods Data for adult postoperative admissions to the Loyola University Medical Center (2009-2017) were used for model development and admissions to the University of Wisconsin-Madison (2009-2020) were used for validation. Structured features included demographics, vital signs, laboratory results, and nurse-documented scores. Unstructured text from clinical notes were converted into concept unique identifiers (CUIs) using the clinical Text Analysis and Knowledge Extraction System. The primary outcome was the development of Kidney Disease Improvement Global Outcomes stage 2 AKI within 7 days after leaving the operating room. We derived unimodal extreme gradient boosting machines (XGBoost) and elastic net logistic regression (GLMNET) models using structured-only data and multimodal models combining structured data with CUI features. Model comparison was performed using the receiver operating characteristic curve (AUROC), with Delong's test for statistical differences.Results The study cohort included 138 389 adult patient admissions (mean [SD] age 58 [16] years; 11 506 [8%] African-American; and 70 826 [51%] female) across the 2 sites. Of those, 2959 (2.1%) developed stage 2 AKI or higher. Across all data types, XGBoost outperformed GLMNET (mean AUROC 0.81 [95% confidence interval (CI), 0.80-0.82] vs 0.78 [95% CI, 0.77-0.79]). The multimodal XGBoost model incorporating CUIs parameterized as term frequency-inverse document frequency (TF-IDF) showed the highest discrimination performance (AUROC 0.82 [95% CI, 0.81-0.83]) over unimodal models (AUROC 0.79 [95% CI, 0.78-0.80]).Discussion A multimodality approach with structured data and TF-IDF weighting of CUIs increased model performance over structured data-only models.Conclusion These findings highlight the predictive power of CUIs when merged with structured data for clinical prediction models, which may improve the detection of postoperative AKI. Acute kidney injury (AKI) after an operation, called postoperative AKI, is common in hospitalized patients and associated with increased morbidity and mortality. Early detection of high-risk patients could facilitate timely treatment and improve outcomes. Although a few studies have developed machine learning (ML) models to identify patients with postoperative AKI, these are primarily limited to structured data (eg, laboratory values) and ignore predictors from clinical notes. Further, models built from clinical notes are often not externally validated because doing so risks leaking protected health information.Given these limitations in the field, we developed and externally validated ML models to predict postoperative AKI using structured data and information from clinical notes. To preserve patient privacy, we used concept unique identifiers (CUIs), which are de-identified medical terms from clinical notes. We compared unimodal models with structured data to multimodal models with CUIs plus structured data, as well as different approaches to modeling the CUI data. We found that multimodal models significantly improved model performance compared to unimodal models. We also found that normalizing CUI data based on term frequency had the highest performance. In conclusion, using CUIs to account for information in clinical notes adds significant value for predicting postoperative AKI.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Internal and External Validation of a Machine Learning Risk Score for Acute Kidney Injury
    Churpek, Matthew M.
    Carey, Kyle A.
    Edelson, Dana P.
    Singh, Tripti
    Astor, Brad C.
    Gilbert, Emily R.
    Winslow, Christopher
    Shah, Nirav
    Afshar, Majid
    Koyner, Jay L.
    JAMA NETWORK OPEN, 2020, 3 (08) : E2012892
  • [2] Development, External Validation, and Visualization of Machine Learning Models for Predicting Occurrence of Acute Kidney Injury after Cardiac Surgery
    Shao, Jiakang
    Liu, Feng
    Ji, Shuaifei
    Song, Chao
    Ma, Yan
    Shen, Ming
    Sun, Yuntian
    Zhu, Siming
    Guo, Yilong
    Liu, Bing
    Wu, Yuanbin
    Qin, Handai
    Lai, Shengwei
    Fan, Yunlong
    REVIEWS IN CARDIOVASCULAR MEDICINE, 2023, 24 (08)
  • [3] Calibration drift in regression and machine learning models for acute kidney injury
    Davis, Sharon E.
    Lasko, Thomas A.
    Chen, Guanhua
    Siew, Edward D.
    Matheny, Michael E.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2017, 24 (06) : 1052 - 1061
  • [4] Internal and External Validation of Machine Learning Models for Predicting Acute Kidney Injury Following Non-Cardiac Surgery Using Open Datasets
    Lee, Sang-Wook
    Jang, Jaewon
    Seo, Woo-Young
    Lee, Donghee
    Kim, Sung-Hoon
    JOURNAL OF PERSONALIZED MEDICINE, 2024, 14 (06):
  • [5] Development and validation of a machine-learning model for predicting the risk of death in sepsis patients with acute kidney injury
    Dong, Lei
    Liu, Pei
    Qi, Zhili
    Lin, Jin
    Duan, Meili
    HELIYON, 2024, 10 (09)
  • [6] Development and Validation of a Machine Learning Predictive Model for Cardiac Surgery-Associated Acute Kidney Injury
    Li, Qian
    Lv, Hong
    Chen, Yuye
    Shen, Jingjia
    Shi, Jia
    Zhou, Chenghui
    JOURNAL OF CLINICAL MEDICINE, 2023, 12 (03)
  • [7] External validation of the Madrid Acute Kidney Injury Prediction Score
    Del Carpio, Jacqueline
    Paz Marco, Maria
    Luisa Martin, Maria
    Craver, Lourdes
    Jatem, Elias
    Gonzalez, Jorge
    Chang, Pamela
    Ibarz, Mercedes
    Pico, Silvia
    Falcon, Gloria
    Canales, Marina
    Huertas, Elisard
    Romero, Inaki
    Nieto, Nacho
    Segarra, Alfons
    CLINICAL KIDNEY JOURNAL, 2021, 14 (11) : 2377 - 2382
  • [8] A Machine Learning Algorithm Predicting Acute Kidney Injury in Intensive Care Unit Patients (NAVOY Acute Kidney Injury): Proof-of-Concept Study
    Persson, Inger
    Grunwald, Adam
    Morvan, Ludivine
    Becedas, David
    Arlbrandt, Martin
    JMIR FORMATIVE RESEARCH, 2023, 7
  • [9] Construction and validation of prognostic models in critically Ill patients with sepsis-associated acute kidney injury: interpretable machine learning approach
    Fan, Zhiyan
    Jiang, Jiamei
    Xiao, Chen
    Chen, Youlei
    Xia, Quan
    Wang, Juan
    Fang, Mengjuan
    Wu, Zesheng
    Chen, Fanghui
    JOURNAL OF TRANSLATIONAL MEDICINE, 2023, 21 (01)
  • [10] The Development of a Machine Learning Inpatient Acute Kidney Injury Prediction Model
    Koyner, Jay L.
    Carey, Kyle A.
    Edelson, Dana P.
    Churpek, Matthew M.
    CRITICAL CARE MEDICINE, 2018, 46 (07) : 1070 - 1077