DEVELOPMENT AND VALIDATION OF MACHINE LEARNING MODELS TO PREDICT UNPLANNED HOSPITALIZATIONS OF PATIENTS WITH DIABETES WITHIN THE NEXT 12 MONTHS

被引:0
作者
Andreychenko, Anna E. [1 ]
Ermak, Andrey D. [1 ]
Gavrilov, Denis V. [1 ]
Novitskiy, Roman E. [1 ]
Gusev, Alexander V. [2 ,3 ]
机构
[1] K SkAI LLC, Petrozavodsk, Russia
[2] Fed Res Inst Hlth Org & Informat, Moscow, Russia
[3] Res & Pract Clin Ctr Diagnost & Telemed Technol, Moscow, Russia
来源
DIABETES MELLITUS | 2024年 / 27卷 / 02期
关键词
diabetes mellitus; hospitalization; predictive models; machine learning; artificial intelligence;
D O I
10.14341/DM13065
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
BACKGROUND: The incidence of diabetes mellitus (DM) both in the Russian Federation and in the world has been steadily increasing for several decades. Stable population growth and current epidemiological characteristics of DM lead to enormous economic costs and significant social losses throughout the world. The disease often progresses with the development of specific complications, while significantly increasing the likelihood of hospitalization. The creation and inference of a machine learning model for predicting hospitalizations of patients with DM to an inpatient medical facility will make it possible to personalize the provision of medical care and optimize the load on the entire healthcare system. AIM: Development and validation of models for predicting unplanned hospitalizations of patients with diabetes due to the disease itself and its complications using machine learning algorithms and data from real clinical practice. MATERIALS AND METHODS: 170,141 depersonalized electronic health records of 23,742 diabetic patients were included in the study. Anamnestic, constitutional, clinical, instrumental and laboratory data, widely used in routine medical practice, were considered as potential predictors, a total of 33 signs. Logistic regression (LR), gradient boosting methods (LightGBM, XGBoost, CatBoost), decision tree-based methods (RandomForest and ExtraTrees), and a neural network-based algorithm (Multi-layer Perceptron) were compared. External validation was performed on the data of the separate region of Russian Federation. RESULTS: The best results and stability to external validation data were shown by the LightGBM model with an AUC of 0.818 (95% CI 0.802-0.834) in internal testing and 0.802 (95% CI 0.773-0.832) in external validation. CONCLUSION: The metrics of the best model were superior to previously published studies. The results of external validation showed the relative stability of the model to new data from another region, that reflects the possibility of the model's application in real clinical practice.
引用
收藏
页码:142 / 157
页数:16
相关论文
共 22 条
[1]  
Awais M, 2020, Arxiv, DOI arXiv:2006.11007
[2]   The global economic burden of diabetes in adults aged 20-79 years: a cost-of-illness study [J].
Bommer, Christian ;
Heesemann, Esther ;
Sagalova, Vera ;
Manne-Goehler, Jennifer ;
Atun, Rifat ;
Barnighausen, Till ;
Vollmer, Sebastian .
LANCET DIABETES & ENDOCRINOLOGY, 2017, 5 (06) :423-430
[3]   Predicting diabetes-related hospitalizations based on electronic health records [J].
Brisimi, Theodora S. ;
Xu, Tingting ;
Wang, Taiyao ;
Dai, Wuyang ;
Paschalidis, Ioannis Ch .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2019, 28 (12) :3667-3682
[4]   Understanding 30-day re-admission after hospitalisation of older patients for diabetes: identifying those at greatest risk [J].
Caughey, Gillian E. ;
Pratt, Nicole L. ;
Barratt, John D. ;
Shakib, Sepehr ;
Kemp-Casey, Anna R. ;
Roughead, Elizabeth E. .
MEDICAL JOURNAL OF AUSTRALIA, 2017, 206 (04) :170-175
[5]  
Collins GS, 2015, J CLIN EPIDEMIOL, V68, P112, DOI [10.7326/M14-0697, 10.1038/bjc.2014.639, 10.1111/eci.12376, 10.1016/j.jclinepi.2014.11.010, 10.7326/M14-0698, 10.1136/bmj.g7594, 10.1186/s12916-014-0241-z, 10.1002/bjs.9736, 10.1016/j.eururo.2014.11.025]
[6]   STANDARDS OF SPECIALIZED DIABETES CARE [J].
Dedov, I. I. ;
Shestakova, M., V ;
Mayorov, A. Yu .
DIABETES MELLITUS, 2021, 24 :1-148
[7]   EPIDEMIOLOGICAL CHARACTERISTICS OF DIABETES MELLITUS IN THE RUSSIAN FEDERATION: CLINICAL AND STATISTICAL ANALYSIS ACCORDING TO THE FEDERAL DIABETES REGISTER DATA OF 01.01.2021 [J].
Dedov, Ivan I. ;
Shestakova, Marina, V ;
Vikulova, Olga K. ;
Zheleznyakova, Anna, V ;
Isakov, Mikhail A. .
DIABETES MELLITUS, 2021, 24 (03) :204-221
[8]  
Ding Yufeng, 2006, An Investigation of Missing Data Methods for Classification Trees
[9]   SpPin and SnNout Are Not Enough. It's Time to Fully Embrace Likelihood Ratios and Probabilistic Reasoning to Achieve Diagnostic Excellence [J].
Fischer, Brett G. ;
Evans, Arthur T. .
JOURNAL OF GENERAL INTERNAL MEDICINE, 2023, 38 (09) :2202-2204
[10]  
Hai AA, Deep Learning vs Traditional Models for Predicting Hospital Readmission among Patients with Diabetes