Machine learning-driven prediction of hospital admissions using gradient boosting and GPT-2

被引:0
|
作者
Zhang, Xingyu [1 ]
Wang, Hairong [3 ]
Yu, Guan [4 ]
Zhang, Wenbin [2 ]
机构
[1] Univ Pittsburgh, Sch Hlth & Rehabil Sci, Dept Commun Sci & Disorders, Pittsburgh, PA USA
[2] Florida Int Univ, Knight Fdn Sch Comp & Informat Sci, Miami, FL USA
[3] Georgia Inst Technol, Coll Comp, Atlanta, GA USA
[4] Univ Pittsburgh, Sch Publ Hlth, Dept Biostat & Hlth Data Sci, Pittsburgh, PA USA
来源
DIGITAL HEALTH | 2025年 / 11卷
关键词
Hospital admission prediction; emergency department; machine learning; natural language processing; TRIAGE;
D O I
10.1177/20552076251331319
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Accurately predicting hospital admissions from the emergency department (ED) is essential for improving patient care and resource allocation. This study aimed to predict hospital admissions by integrating both structured clinical data and unstructured text data using machine learning models. Methods: Data were obtained from the 2021 National Hospital Ambulatory Medical Care Survey-Emergency Department (NHAMCS-ED), including adult patients aged 18 years and older. Structured data included demographics, visit characteristics, vital signs, and medical history, while unstructured data consisted of free-text chief complaints and injury descriptions. A Gradient Boosting Classifier (GBC) was applied to structured data, while a fine-tuned GPT-2 model processed the unstructured text. A combined model was created by averaging the outputs of both models. Model performance was evaluated using 5-fold cross-validation, assessing accuracy, precision, sensitivity, specificity, and area under the receiver operating characteristic curve (AUC-ROC). Results: Among the 13,115 patients, 2264 (17.3%) were admitted to the hospital. The combined model outperformed the individual structured and unstructured models, achieving an accuracy of 75.8%, precision of 39.5%, sensitivity of 75.8%, and specificity of 75.8%. In comparison, the structured data model achieved 73.8% accuracy, while the unstructured model reached 64.6%. The combined model had the highest AUC, indicating superior performance. Conclusions: Combining structured and unstructured data using machine learning significantly improves the prediction of hospital admissions from the ED. This integrated approach can enhance decision-making and optimize ED operations.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Machine learning-driven catalyst design, synthesis and performance prediction for CO2 hydrogenation
    Asif, Muhammad
    Yao, Chengxi
    Zuo, Zitu
    Bilal, Muhammad
    Zeb, Hassan
    Lee, Seungjae
    Wang, Ziyang
    Kim, Taesung
    JOURNAL OF INDUSTRIAL AND ENGINEERING CHEMISTRY, 2025, 144 : 32 - 47
  • [2] Machine learning-based prediction of hospital prolonged length of stay admission at emergency department: a Gradient Boosting algorithm analysis
    Zeleke, Addisu Jember
    Palumbo, Pierpaolo
    Tubertini, Paolo
    Miglio, Rossella
    Chiari, Lorenzo
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2023, 6
  • [3] Machine Learning-Driven Metabolic Syndrome Prediction: An International Cohort Validation Study
    Li, Zhao
    Wu, Wenzhong
    Kang, Hyunsik
    HEALTHCARE, 2024, 12 (24)
  • [4] Machine Learning-Driven Trust Prediction for MEC-based IoT Services
    Abeysekara, Prabath
    Dong, Hai
    Qin, A. K.
    2019 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (IEEE ICWS 2019), 2019, : 188 - 192
  • [5] An Improved Machine Learning-Driven Framework for Cryptocurrencies Price Prediction With Sentimental Cautioning
    Zubair, Muhammad
    Ali, Jaffar
    Alhussein, Musaed
    Hassan, Shoaib
    Aurangzeb, Khursheed
    Umair, Muhammad
    IEEE ACCESS, 2024, 12 : 51395 - 51418
  • [6] Machine learning-based prediction of CFST columns using gradient tree boosting algorithm
    Vu, Quang-Viet
    Truong, Viet-Hung
    Thai, Huu-Tai
    COMPOSITE STRUCTURES, 2021, 259
  • [7] Machine Learning-Driven Scattering Efficiency Prediction in Passive Daytime Radiative Cooling
    Shi, Changmin
    Zheng, Jiayu
    Wang, Ying
    Gan, Chenjie
    Zhang, Liwen
    Sheldon, Brian W.
    ATMOSPHERE, 2025, 16 (01)
  • [8] Machine Learning-Driven Prediction of Comorbidities and Mortality in Adults With Type 1 Diabetes
    Andersen, Jonas Dahl
    Stoltenberg, Carsten Wridt
    Jensen, Morten Hasselstrom
    Vestergaard, Peter
    Hejlesen, Ole
    Hangaard, Stine
    JOURNAL OF DIABETES SCIENCE AND TECHNOLOGY, 2024,
  • [9] Machine learning-driven solar irradiance prediction: advancing renewable energy in Rajasthan
    Tandon, Aayushi
    Awasthi, Amit
    Pattnayak, Kanhu Charan
    Tandon, Aditya
    Choudhury, Tanupriya
    Kotecha, Ketan
    DISCOVER APPLIED SCIENCES, 2025, 7 (02)
  • [10] Machine learning-driven prediction of average localization error in wireless sensor networks
    Lan Zhang
    Yunfeng Zhao
    International Journal of System Assurance Engineering and Management, 2025, 16 (4) : 1468 - 1484