Multivariable prediction model of complications derived from diabetes mellitus using machine learning on scarce highly unbalanced data

被引:0
|
作者
Colmenares-Mejia, Claudia C. [1 ]
Rincon-Acuna, Juan C. [2 ,3 ]
Cely, Andres [1 ,4 ]
Gonzalez-Velez, Abel E. [5 ]
Castillo, Andrea [6 ]
Murcia, Jossie [7 ]
Isaza-Ruget, Mario A. [8 ]
机构
[1] Fdn Univ Sanitas, Bogota, DC, Colombia
[2] Univ Santander, Campus Lagos del Cacique, Bucaramanga, Santander, Colombia
[3] Keralty, Corp Data Management, Bogota, DC, Colombia
[4] Univ Nacl Colombia, Bogota, DC, Colombia
[5] Univ Hosp Torrejon, Prevent Med Serv, Torrejon De Ardoz, Spain
[6] EPS Sanitas, Direcc Gest Conocimiento, Bogota, DC, Colombia
[7] Fdn Univ Sanitas, Inst Gerencia & Gest Sanitaria, Bogota, DC, Colombia
[8] Fdn Univ Sanitas, Res Grp INPAC, Bogota, DC, Colombia
关键词
Complications; Diabetes mellitus; Machine learning; Predictive analytics; Risk predictions;
D O I
10.1007/s13410-023-01264-7
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
BackgroundDiabetes mellitus (DM) increases the risk complications in addition to mortality. Quantifying the risk of complications using artificial intelligence could be a way to design comprehensive patient healthcare programs.ObjectivePredicting the probability of macro and microvascular complications in patients with DM through Machine Learning.MethodsRetrospective cohort study. Based on an outpatient follow-up program for diabetic patients, 64,081 records and 287 variables were identified, with highly unbalanced data. Predictive models for chronic kidney disease (CKD), lower extremity amputation (LEA), coronary heart disease (CHD), and early mortality (MOR) were developed. An exhaustive computational method was conducted to find the best combination between machine learning (ML) algorithms and sampling method.ResultsThe best model was determined by assessing its performance through the heuristics obtained from a comprehensive analysis of the accuracy and F1 values for ML, sampling, and dataset. Regarding each complication, 99.9% accuracy was obtained for LEA, 94.3% for CHD, 97.4% for MOR, and 98.8% for CKD. F1 was assessed to identify false positives, with 84.5% for CKD, 63.6% for MOR, 46.2% for LEA, and 44.8% for CHD.ConclusionsThis ML model can be applied to predict CHD, CKD, and MOR. The success of ML predictions lies in the clinical definition of initial variables and their simplification for obtaining variables based on which the algorithms can identify patients that are likely to develop a complication. For clinical application of this system, it is necessary to assess the cross performance of metrics, as found here (accuracy higher 95% and F1-Score higher than 80%).
引用
收藏
页码:528 / 538
页数:11
相关论文
共 50 条
  • [1] Prediction of complications in diabetes mellitus using machine learning models with transplanted topic model features
    Han, Benedict Choonghyun
    Kim, Jimin
    Choi, Jinwook
    BIOMEDICAL ENGINEERING LETTERS, 2024, 14 (01) : 163 - 171
  • [2] Prediction of complications in diabetes mellitus using machine learning models with transplanted topic model features
    Benedict Choonghyun Han
    Jimin Kim
    Jinwook Choi
    Biomedical Engineering Letters, 2024, 14 : 163 - 171
  • [3] Diabetes mellitus prediction and diagnosis from a data preprocessing and machine learning perspective
    Olisah, Chollette C.
    Smith, Lyndon
    Smith, Melvyn
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 220
  • [4] Cardiovascular complications in a diabetes prediction model using machine learning: a systematic review
    Kee, Ooi Ting
    Harun, Harmiza
    Mustafa, Norlaila
    Murad, Nor Azian Abdul
    Chin, Siok Fong
    Jaafar, Rosmina
    Abdullah, Noraidatulakma
    CARDIOVASCULAR DIABETOLOGY, 2023, 22 (01)
  • [5] Cardiovascular complications in a diabetes prediction model using machine learning: a systematic review
    Ooi Ting Kee
    Harmiza Harun
    Norlaila Mustafa
    Nor Azian Abdul Murad
    Siok Fong Chin
    Rosmina Jaafar
    Noraidatulakma Abdullah
    Cardiovascular Diabetology, 22
  • [6] Prediction model for gestational diabetes mellitus using the XG Boost machine learning algorithm
    Hu, Xiaoqi
    Hu, Xiaolin
    Yu, Ya
    Wang, Jia
    FRONTIERS IN ENDOCRINOLOGY, 2023, 14
  • [7] Prediction of Diabetes Mellitus Progression Using Supervised Machine Learning
    Chauhan, Apoorva S.
    Varre, Mathew S.
    Izuora, Kenneth
    Trabia, Mohamed B.
    Dufek, Janet S.
    SENSORS, 2023, 23 (10)
  • [8] An early prediction model for gestational diabetes mellitus created using machine learning algorithms
    Yang, Zhifen
    Shi, Xiaoyue
    Wang, Shengpu
    Du, Lijia
    Zhang, Xiaoying
    Zhang, Kun
    Zhang, Yongqiang
    Ma, Jinlong
    Zheng, Rui
    INTERNATIONAL JOURNAL OF GYNECOLOGY & OBSTETRICS, 2025,
  • [9] Predicting complications of diabetes mellitus using advanced machine learning algorithms
    Ljubic, Branimir
    Hai, Ameen Abdel
    Stanojevic, Marija
    Diaz, Wilson
    Polimac, Daniel
    Pavlovski, Martin
    Obradovic, Zoran
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (09) : 1343 - 1351
  • [10] IDMPF: intelligent diabetes mellitus prediction framework using machine learning
    Ismail, Leila
    Materwala, Huned
    APPLIED COMPUTING AND INFORMATICS, 2025, 21 (1/2) : 78 - 89