An early sepsis prediction model utilizing machine learning and unbalanced data processing in a clinical context

被引:1
作者
Zhou, Luyao [1 ]
Shao, Min [2 ]
Wang, Cui [2 ]
Wang, Yu [1 ]
机构
[1] Anhui Med Univ, Sch Biomed Engn, Hefei 230032, Peoples R China
[2] Anhui Med Univ, Affiliated Hosp 1, Dept Crit Care Med, Hefei, Peoples R China
关键词
Machine learning; Prediction model; Sepsis; Data imbalance; Shapley additive explanation; Clinical decision; BIOMARKERS; DIAGNOSIS;
D O I
10.1016/j.pmedr.2024.102841
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background: Early and accurate diagnoses of sepsis patients are essential to reduce the mortality. However, the sepsis is still diagnosed in a traditional way in China despite the increasing number of related studies, which may to some extent lead to delays in the treatment. Methods: The study included 2,385 patients, including 364 with sepsis, collected from the First Affiliated Hospital of Anhui Medical University and partner hospitals from April to July 2022. External validation was conducted using the MIMIC-III database (over 60,000 patients from 2001 to 2012) and the eICU Collaborative Research Database (139,000 patients from 2014 to 2015). Multiple algorithm models, along with the SHapley Additive exPlanations (SHAP) analysis, are applied to explore the main risk factors for the accurate prediction of the sepsis. Multiple Imputations for filling missing data and the Synthetic Minority Oversampling (SMOTE) balancing method for balancing data are used for the data processing. Result: Eighteen diagnostic features are used in the predictive model for early sepsis. The Random Forest model has the best performance among all the models, with an Area Under the Curve (AUC) of 87% and an F1-score (F1) of 77%. Moreover, the interpretation from the SHAP analysis is generally consistent with the current clinical situation. Conclusion: The study revealed the relationship between these 18 clinical features and diagnostic outcomes. The results indicate that patients with laboratory values of Systolic Blood Pressure, Albumin, and Heart Rate exceeding certain thresholds are at a high likelihood of developing sepsis.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] A simplified machine learning model utilizing platelet-related genes for predicting poor prognosis in sepsis
    Diao, Yingying
    Zhao, Yan
    Li, Xinyao
    Li, Baoyue
    Huo, Ran
    Han, Xiaoxu
    FRONTIERS IN IMMUNOLOGY, 2023, 14
  • [42] Tensor learning of pointwise mutual information from EHR data for early prediction of sepsis
    Nesaragi, Naimahmed
    Patidar, Shivnarayan
    Aggarwal, Vaneet
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 134
  • [43] Deep Learning from Heterogeneous Sequences of Sparse Medical Data for Early Prediction of Sepsis
    Ul Alam, Mahbub
    Henriksson, Aron
    Valik, John Karlsson
    Ward, Logan
    Naucler, Pontus
    Dalianis, Hercules
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 5: HEALTHINF, 2020, : 45 - 55
  • [44] Early detection of sepsis using machine learning algorithms
    El-Aziz, Rasha M. Abd
    Rayan, Alanazi
    ALEXANDRIA ENGINEERING JOURNAL, 2025, 111 : 47 - 56
  • [45] A machine learning model for the early prediction of ovarian cancer using real world data
    de la Oliva Roque, Victor Manuel
    Esteban-Medina, Alberto
    Alejos Collado, Laura
    Louceras Munecas, Carlos
    Munoyerro-Muniz, Dolores
    Villegas, Roman
    Dopazo Blazquez, Joaquin
    FEBS OPEN BIO, 2024, 14 : 14 - 14
  • [46] Integrative Prediction Model for Radiation Pneumonitis: Genetic and Clinical-Pathological Factors Utilizing Machine Learning
    Choi, S. H.
    Kim, E.
    Seol, M. Y.
    Chung, Y.
    Yoon, H. I.
    JOURNAL OF THORACIC ONCOLOGY, 2024, 19 (10) : S488 - S488
  • [47] An 8-gene machine learning model improves clinical prediction of severe dengue progression
    Liu, Yiran E.
    Saul, Sirle
    Rao, Aditya Manohar
    Robinson, Makeda Lucretia
    Agudelo Rojas, Olga Lucia
    Maria Sanz, Ana
    Verghese, Michelle
    Solis, Daniel
    Sibai, Mamdouh
    Huang, Chun Hong
    Sahoo, Malaya Kumar
    Margarita Gelvez, Rosa
    Bueno, Nathalia
    Estupinan Cardenas, Maria Isabel
    Villar Centeno, Luis Angel
    Rojas Garrido, Elsa Marina
    Rosso, Fernando
    Donato, Michele
    Pinsky, Benjamin A.
    Einav, Shirit
    Khatri, Purvesh
    GENOME MEDICINE, 2022, 14 (01)
  • [48] Machine learning for early prediction of sepsis-associated acute brain injury
    Ge, Chenglong
    Deng, Fuxing
    Chen, Wei
    Ye, Zhiwen
    Zhang, Lina
    Ai, Yuhang
    Zou, Yu
    Peng, Qianyi
    FRONTIERS IN MEDICINE, 2022, 9
  • [49] Machine learning model to predict sepsis in ICU patients with intracerebral hemorrhage
    Lei Tang
    Ye Li
    Ji Zhang
    Feng Zhang
    Qiaoling Tang
    Xiangbin Zhang
    Sai Wang
    Yupeng Zhang
    Siyuan Ma
    Ran Liu
    Lei Chen
    Junyi Ma
    Xuelun Zou
    Tianxing Yao
    Rongmei Tang
    Huifang Zhou
    Lianxu Wu
    Yexiang Yi
    Yi Zeng
    Duolao Wang
    Le Zhang
    Scientific Reports, 15 (1)
  • [50] Application of Machine Learning for Clinical Subphenotype Identification in Sepsis
    Hu, Chang
    Li, Yiming
    Wang, Fengyun
    Peng, Zhiyong
    INFECTIOUS DISEASES AND THERAPY, 2022, 11 (05) : 1949 - 1964