Multivariable prediction model of complications derived from diabetes mellitus using machine learning on scarce highly unbalanced data

被引:0
作者
Colmenares-Mejia, Claudia C. [1 ]
Rincon-Acuna, Juan C. [2 ,3 ]
Cely, Andres [1 ,4 ]
Gonzalez-Velez, Abel E. [5 ]
Castillo, Andrea [6 ]
Murcia, Jossie [7 ]
Isaza-Ruget, Mario A. [8 ]
机构
[1] Fdn Univ Sanitas, Bogota, DC, Colombia
[2] Univ Santander, Campus Lagos del Cacique, Bucaramanga, Santander, Colombia
[3] Keralty, Corp Data Management, Bogota, DC, Colombia
[4] Univ Nacl Colombia, Bogota, DC, Colombia
[5] Univ Hosp Torrejon, Prevent Med Serv, Torrejon De Ardoz, Spain
[6] EPS Sanitas, Direcc Gest Conocimiento, Bogota, DC, Colombia
[7] Fdn Univ Sanitas, Inst Gerencia & Gest Sanitaria, Bogota, DC, Colombia
[8] Fdn Univ Sanitas, Res Grp INPAC, Bogota, DC, Colombia
关键词
Complications; Diabetes mellitus; Machine learning; Predictive analytics; Risk predictions;
D O I
10.1007/s13410-023-01264-7
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
BackgroundDiabetes mellitus (DM) increases the risk complications in addition to mortality. Quantifying the risk of complications using artificial intelligence could be a way to design comprehensive patient healthcare programs.ObjectivePredicting the probability of macro and microvascular complications in patients with DM through Machine Learning.MethodsRetrospective cohort study. Based on an outpatient follow-up program for diabetic patients, 64,081 records and 287 variables were identified, with highly unbalanced data. Predictive models for chronic kidney disease (CKD), lower extremity amputation (LEA), coronary heart disease (CHD), and early mortality (MOR) were developed. An exhaustive computational method was conducted to find the best combination between machine learning (ML) algorithms and sampling method.ResultsThe best model was determined by assessing its performance through the heuristics obtained from a comprehensive analysis of the accuracy and F1 values for ML, sampling, and dataset. Regarding each complication, 99.9% accuracy was obtained for LEA, 94.3% for CHD, 97.4% for MOR, and 98.8% for CKD. F1 was assessed to identify false positives, with 84.5% for CKD, 63.6% for MOR, 46.2% for LEA, and 44.8% for CHD.ConclusionsThis ML model can be applied to predict CHD, CKD, and MOR. The success of ML predictions lies in the clinical definition of initial variables and their simplification for obtaining variables based on which the algorithms can identify patients that are likely to develop a complication. For clinical application of this system, it is necessary to assess the cross performance of metrics, as found here (accuracy higher 95% and F1-Score higher than 80%).
引用
收藏
页码:528 / 538
页数:11
相关论文
共 50 条
  • [31] Prediction of progression from pre-diabetes to diabetes: Development and validation of a machine learning model
    Cahn, Avivit
    Shoshan, Avi
    Sagiv, Tal
    Yesharim, Rachel
    Goshen, Ran
    Shalev, Varda
    Raz, Itamar
    DIABETES-METABOLISM RESEARCH AND REVIEWS, 2020, 36 (02)
  • [32] Suicide Prediction Using Machine Learning Techniques in Screening and Clinician-Derived Data
    Hack, Laura
    Jovanovic, Tanja
    Carter, Sierra
    Ressler, Kerry
    Smith, Alicia
    BIOLOGICAL PSYCHIATRY, 2017, 81 (10) : S361 - S361
  • [33] Nonalcoholic fatty liver disease and early prediction of gestational diabetes mellitus using machine learning methods
    Lee, Seung Mi
    Hwangbo, Suhyun
    Norwitz, Errol R.
    Koo, Ja Nam
    Oh, Ig Hwan
    Choi, Eun Saem
    Jung, Young Mi
    Kim, Sun Min
    Kim, Byoung Jae
    Kim, Sang Youn
    Kim, Gyoung Min
    Kim, Won
    Joo, Sae Kyung
    Shin, Sue
    Park, Chan-Wook
    Park, Taesung
    Park, Joong Shin
    CLINICAL AND MOLECULAR HEPATOLOGY, 2022, 28 (01) : 105 - 116
  • [34] Machine-learning algorithms in screening for type 2 diabetes mellitus: Data from Fasa Adults Cohort Study
    Karmand, Hanieh
    Andishgar, Aref
    Tabrizi, Reza
    Sadeghi, Alireza
    Pezeshki, Babak
    Ravankhah, Mahdi
    Taherifard, Erfan
    Ahmadizar, Fariba
    ENDOCRINOLOGY DIABETES & METABOLISM, 2024, 7 (02)
  • [35] PREDICTION OF TYPE 2 DIABETES MELLITUS USING FEATURE SELECTION-BASED MACHINE LEARNING ALGORITHMS
    Yilmaz, Atinc
    HEALTH PROBLEMS OF CIVILIZATION, 2022, 16 (02) : 128 - 139
  • [36] A novel prediction model of the risk of pancreatic cancer among diabetes patients using multiple clinical data and machine learning
    Chen, Shih-Min
    Phuc, Phan Thanh
    Nguyen, Phung-Anh
    Burton, Whitney
    Lin, Shwu-Jiuan
    Lin, Weei-Chin
    Lu, Christine Y.
    Hsu, Min-Huei
    Cheng, Chi-Tsun
    Hsu, Jason C.
    CANCER MEDICINE, 2023, 12 (19): : 19987 - 19999
  • [37] A Hybrid Model for Prediction of Diabetes Using Machine Learning Classification Algorithms and Random Projection
    Poornima, V.
    RamyaDevi, R.
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 139 (03) : 1437 - 1449
  • [38] A patient network-based machine learning model for disease prediction: The case of type 2 diabetes mellitus
    Haohui Lu
    Shahadat Uddin
    Farshid Hajati
    Mohammad Ali Moni
    Matloob Khushi
    Applied Intelligence, 2022, 52 : 2411 - 2422
  • [39] A patient network-based machine learning model for disease prediction: The case of type 2 diabetes mellitus
    Lu, Haohui
    Uddin, Shahadat
    Hajati, Farshid
    Moni, Mohammad Ali
    Khushi, Matloob
    APPLIED INTELLIGENCE, 2022, 52 (03) : 2411 - 2422
  • [40] Using Big Data-machine learning models for diabetes prediction and flight delays analytics
    Nibareke, Therence
    Laassiri, Jalal
    JOURNAL OF BIG DATA, 2020, 7 (01)