A Machine Learning Model to Predict Length of Stay and Mortality among Diabetes and Hypertension Inpatients

被引:5
作者
Barsasella, Diana [1 ,2 ,3 ]
Bah, Karamo [1 ,2 ]
Mishra, Pratik [4 ]
Uddin, Mohy [5 ]
Dhar, Eshita [1 ,2 ]
Suryani, Dewi Lena [3 ]
Setiadi, Dedi [3 ]
Masturoh, Imas [3 ]
Sugiarti, Ida [3 ]
Jonnagaddala, Jitendra [6 ]
Syed-Abdul, Shabbir [1 ,2 ,7 ]
机构
[1] Taipei Med Univ, Coll Med Sci & Technol, Grad Inst Biomed Informat, Taipei 106, Taiwan
[2] Taipei Med Univ, Coll Med Sci & Technol, Int Ctr Hlth Informat Technol ICHIT, Taipei 106, Taiwan
[3] Minist Hlth Tasikmalaya, Dept Med Record & Hlth Informat, Hlth Polytech, Tasikmalaya 46115, West Java, Indonesia
[4] CGD Hlth Pty Ltd, Throsby, ACT 2914, Australia
[5] Minist Natl Guard Hlth Affairs, Res Qual Management Sect, King Abdullah Int Med Res Ctr, Riyadh 11481, Saudi Arabia
[6] Univ New South Wales, Sch Populat Hlth, Kensington, NSW 2033, Australia
[7] Taipei Med Univ, Coll Nursing, Sch Gerontol & Long Term Care, Taipei 106, Taiwan
来源
MEDICINA-LITHUANIA | 2022年 / 58卷 / 11期
基金
欧盟地平线“2020”;
关键词
predictive modeling; external validation; length of stay; mortality; type; 2; diabetes; hypertension; machine learning; HOSPITALIZED-PATIENTS; SEVERITY INDEX; OLDER-ADULTS; MELLITUS; HEALTH; TAIWAN; RISK; DISEASE; COMPLICATIONS;
D O I
10.3390/medicina58111568
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background and Objectives: Taiwan is among the nations with the highest rates of Type 2 Diabetes Mellitus (T2DM) and Hypertension (HTN). As more cases are reported each year, there is a rise in hospital admissions for people seeking medical attention. This creates a burden on hospitals and affects the overall management and administration of the hospitals. Hence, this study aimed to develop a machine learning (ML) model to predict the Length of Stay (LoS) and mortality among T2DM and HTN inpatients. Materials and Methods: Using Taiwan's National Health Insurance Research Database (NHIRD), this cohort study consisted of 58,618 patients, where 25,868 had T2DM, 32,750 had HTN, and 6419 had both T2DM and HTN. We analyzed the data with different machine learning models for the prediction of LoS and mortality. The evaluation was done by plotting descriptive statistical graphs, feature importance, precision-recall curve, accuracy plots, and AUC. The training and testing data were set at a ratio of 8:2 before applying ML algorithms. Results: XGBoost showed the best performance in predicting LoS (R2 0.633; RMSE 0.386; MAE 0.123), and RF resulted in a slightly lower performance (R2 0.591; RMSE 0.401; MAE 0.027). Logistic Regression (LoR) performed the best in predicting mortality (CV Score 0.9779; Test Score 0.9728; Precision 0.9432; Recall 0.9786; AUC 0.97 and AUPR 0.93), closely followed by Ridge Classifier (CV Score 0.9736; Test Score 0.9692; Precision 0.9312; Recall 0.9463; AUC 0.94 and AUPR 0.89). Conclusions: We developed a robust prediction model for LoS and mortality of T2DM and HTN inpatients. Linear Regression showed the best performance for LoS, and Logistic Regression performed the best in predicting mortality. The results showed that ML algorithms can not only help healthcare professionals in data-driven decision-making but can also facilitate early intervention and resource planning.
引用
收藏
页数:27
相关论文
共 46 条
[1]   Automated detection of premature delivery using empirical mode and wavelet packet decomposition techniques with uterine electromyogram signals [J].
Acharya, U. Rajendra ;
Sudarshan, Vidya K. ;
Rong, Soon Qing ;
Tan, Zechariah ;
Lim, Choo Min ;
Koh, Joel E. W. ;
Nayak, Sujatha ;
Bhandary, Sulatha V. .
COMPUTERS IN BIOLOGY AND MEDICINE, 2017, 85 :33-42
[2]  
[Anonymous], 1985, WHO TECH REP SER, P1
[3]  
[Anonymous], HYPERTENSION
[4]  
Awad A, 2017, HEALTH SERV MANAG RE, V30, P105, DOI 10.1177/0951484817696212
[5]   Predicting length of stay and mortality among hospitalized patients with type 2 diabetes mellitus and hypertension [J].
Barsasella, Diana ;
Gupta, Srishti ;
Malwade, Shwetambara ;
Aminin ;
Susanti, Yanti ;
Tirmadi, Budi ;
Mutamakin, Agus ;
Jonnagaddala, Jitendra ;
Syed-Abdul, Shabbir .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2021, 154
[6]   Diabetes and vascular disease: pathophysiology, clinical consequences, and medical therapy: part II [J].
Beckman, Joshua A. ;
Paneni, Francesco ;
Cosentino, Francesco ;
Creager, Mark A. .
EUROPEAN HEART JOURNAL, 2013, 34 (31) :2444-U31
[7]   The Lancet NCDI Poverty Commission: bridging a gap in universal health coverage for the poorest billion [J].
Bukhman, Gene ;
Mocumbi, Ana O. ;
Atun, Rifat ;
Becker, Anne E. ;
Bhutta, Zulfiqar ;
Binagwaho, Agnes ;
Clinton, Chelsea ;
Coates, Matthew M. ;
Dain, Katie ;
Ezzati, Majid ;
Gottlieb, Gary ;
Gupta, Indrani ;
Gupta, Neil ;
Hyder, Adnan A. ;
Jain, Yogesh ;
Kruk, Margaret E. ;
Makani, Julie ;
Marx, Andrew ;
Miranda, J. Jaime ;
Norheim, Ole F. ;
Nugent, Rachel ;
Roy, Nobhojit ;
Stefan, Cristina ;
Wallis, Lee ;
Mayosi, Bongani .
LANCET, 2020, 396 (10256) :991-1044
[8]   Type 2 diabetes [J].
Chatterjee, Sudesna ;
Khunti, Kamlesh ;
Davies, Melanie J. .
LANCET, 2017, 389 (10085) :2239-2251
[9]  
Chawla NV, 2010, DATA MINING AND KNOWLEDGE DISCOVERY HANDBOOK, SECOND EDITION, P875, DOI 10.1007/978-0-387-09823-4_45
[10]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)