An early sepsis prediction model utilizing machine learning and unbalanced data processing in a clinical context

被引:1
|
作者
Zhou, Luyao [1 ]
Shao, Min [2 ]
Wang, Cui [2 ]
Wang, Yu [1 ]
机构
[1] Anhui Med Univ, Sch Biomed Engn, Hefei 230032, Peoples R China
[2] Anhui Med Univ, Affiliated Hosp 1, Dept Crit Care Med, Hefei, Peoples R China
关键词
Machine learning; Prediction model; Sepsis; Data imbalance; Shapley additive explanation; Clinical decision; BIOMARKERS; DIAGNOSIS;
D O I
10.1016/j.pmedr.2024.102841
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background: Early and accurate diagnoses of sepsis patients are essential to reduce the mortality. However, the sepsis is still diagnosed in a traditional way in China despite the increasing number of related studies, which may to some extent lead to delays in the treatment. Methods: The study included 2,385 patients, including 364 with sepsis, collected from the First Affiliated Hospital of Anhui Medical University and partner hospitals from April to July 2022. External validation was conducted using the MIMIC-III database (over 60,000 patients from 2001 to 2012) and the eICU Collaborative Research Database (139,000 patients from 2014 to 2015). Multiple algorithm models, along with the SHapley Additive exPlanations (SHAP) analysis, are applied to explore the main risk factors for the accurate prediction of the sepsis. Multiple Imputations for filling missing data and the Synthetic Minority Oversampling (SMOTE) balancing method for balancing data are used for the data processing. Result: Eighteen diagnostic features are used in the predictive model for early sepsis. The Random Forest model has the best performance among all the models, with an Area Under the Curve (AUC) of 87% and an F1-score (F1) of 77%. Moreover, the interpretation from the SHAP analysis is generally consistent with the current clinical situation. Conclusion: The study revealed the relationship between these 18 clinical features and diagnostic outcomes. The results indicate that patients with laboratory values of Systolic Blood Pressure, Albumin, and Heart Rate exceeding certain thresholds are at a high likelihood of developing sepsis.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Early Prediction of Neonatal Sepsis From Synthetic Clinical Data Using Machine Learning
    Lyra, Simon
    Jin, Jinyi
    Leonhardt, Steffen
    Lueken, Markus
    2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [2] Clinical validation and optimization of machine learning models for early prediction of sepsis
    Liu, Xi
    Li, Meiyi
    Liu, Xu
    Luo, Yuting
    Yang, Dong
    Hui, Ouyang
    He, Jiaoling
    Xia, Jinyu
    Xiao, Fei
    FRONTIERS IN MEDICINE, 2025, 12
  • [3] Early Prediction of Sepsis using Machine Learning
    Shankar, Anuraag
    Diwan, Mufaddal
    Singh, Snigdha
    Nahrpurawala, Husain
    Bhowmick, Tanusri
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 837 - 842
  • [4] Early Prediction of Sepsis Based on Machine Learning Algorithm
    Zhao, Xin
    Shen, Wenqian
    Wang, Guanjun
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [5] Development and clinical impact assessment of a machine-learning model for early prediction of late-onset sepsis
    van den Berg, Merel
    Medina, O'Jay
    Loohuis, Ingmar
    van der Flier, Michiel
    Dudink, Jeroen
    Benders, Manon
    Bartels, Richard
    Vijlbrief, Daniel
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 163
  • [6] Machine learning for the early prediction of acute respiratory distress syndrome (ARDS) in patients with sepsis in the ICU based on clinical data
    Jiang, Zhenzhen
    Liu, Leping
    Du, Lin
    Lv, Shanshan
    Liang, Fang
    Luo, Yanwei
    Wang, Chunjiang
    Shen, Qin
    HELIYON, 2024, 10 (06)
  • [7] A MACHINE LEARNING ALGORITHM FOR EARLY PREDICTION OF SEPSIS IN INTENSIVE CARE
    Sjovall, Fredrik
    Persson, Inger
    CRITICAL CARE MEDICINE, 2023, 51 (01) : 609 - 609
  • [8] Sepsis prediction, early detection, and identification using clinical text for machine learning: a systematic review
    Yan, Melissa Y.
    Gustad, Lise Tuset
    Nytro, Oystein
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2022, 29 (03) : 559 - 575
  • [9] Early Triage Prediction for Outpatient Care Based on Heterogeneous Medical Data Utilizing Machine Learning
    Salman, Omar Sadeq
    Latiff, Nurul Mu'azzah Abdul
    Arifin, Sharifah Hafizah Syed
    Salman, Omar Hussein
    PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY, 2024, 32 (05):
  • [10] Development and validation of a machine learning model integrated with the clinical workflow for early detection of sepsis
    Mahyoub, Mohammed A.
    Yadav, Ravi R.
    Dougherty, Kacie
    Shukla, Ajit
    FRONTIERS IN MEDICINE, 2023, 10