Predicting Diabetic Retinopathy and Nephropathy Complications Using Machine Learning Techniques

被引:0
作者
Manjunath, D. R. [1 ]
Lohith, J. J. [2 ]
Selva Kumar, S. [3 ]
Das, Abhijit [4 ]
机构
[1] BMS Coll Engn, Dept CSE IoT&CS, Bengaluru 560019, India
[2] Nagarjuna Coll Engn & Technol, Dept CSE AI&ML, Bengaluru 562110, India
[3] BMS Coll Engn, Dept CSE, Bengaluru 560019, India
[4] Manipal Acad Higher Educ, Manipal Inst Technol, Dept Informat Technol, Manipal 560064, India
关键词
Diabetes; Accuracy; Machine learning; Machine learning algorithms; Data models; Support vector machines; Random forests; Predictive models; Medical diagnostic imaging; Classification algorithms; Diabetic retinopathy; diabetic nephropathy; machine learning; XGBoost; clinical prediction; feature importance; RISK; MODELS;
D O I
10.1109/ACCESS.2025.3562483
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Diabetes and its complications, especially Diabetic Retinopathy (DR) and Diabetic Nephropathy (DN) is a big challenge to the global healthcare system and needs accurate predictive models to help in early diagnosis and intervention. In this study we used a dataset from a reputed medical center in India with 767 patient records and 22 attributes including demographic details, clinical markers and treatment plans. We used a suite of advanced machine learning algorithms-Random Forest, XGBoost, LightGBM, CatBoost, Neural Networks and ensemble approaches like Voting and Stacking Classifiers to see their performance on original, oversampled and undersampled datasets. Through feature engineering, sampling strategies and hyperparameter tuning the models performed well on all the datasets. Surprisingly the models performed well even on the original imbalanced dataset which can be attributed to the power of the models and hyperparameter tuning. Ensemble methods like Voting and Stacking Classifiers performed better and achieved near perfect metrics (AUC = 1.0) in oversampled datasets. Hyperparameter tuning further improved the performance, reduced RMSE and log loss and increased accuracy and recall in all the configurations. This shows the importance of model optimization in real world clinical datasets which are imbalanced and noisy. This paper shows the possibility of machine learning based frameworks in diabetic complication management by predicting accurately and in time. These models can be integrated into clinical decision support systems (CDSS) to give insights to clinicians, improve patient outcomes through personalized interventions and optimize resource allocation. Future work will be to validate this on different populations, include longitudinal patient data and integrate real time electronic health records (EHR) for deployment in hospitals.
引用
收藏
页码:70228 / 70253
页数:26
相关论文
共 42 条
[1]   Performance of the Garvan Fracture Risk Calculator in Individuals with Diabetes: A Registry-Based Cohort Study [J].
Agarwal, Arnav ;
Leslie, William D. ;
Nguyen, Tuan, V ;
Morin, Suzanne N. ;
Lix, Lisa M. ;
Eisman, John A. .
CALCIFIED TISSUE INTERNATIONAL, 2022, 110 (06) :658-665
[2]  
Agrawal P., 2015, Int. Res. J. Eng. Tech., V2
[3]   AN INTRODUCTION TO KERNEL AND NEAREST-NEIGHBOR NONPARAMETRIC REGRESSION [J].
ALTMAN, NS .
AMERICAN STATISTICIAN, 1992, 46 (03) :175-185
[4]   A Novel Proposal for Deep Learning-Based Diabetes Prediction: Converting Clinical Data to Image Data [J].
Aslan, Muhammet Fatih ;
Sabanci, Kadir .
DIAGNOSTICS, 2023, 13 (04)
[5]   Risk factors for type 2 diabetes mellitus: An exposure-wide umbrella review of meta-analyses [J].
Bellou, Vanesa ;
Belbasis, Lazaros ;
Tzoulaki, Ioanna ;
Evangelou, Evangelos .
PLOS ONE, 2018, 13 (03)
[6]   Prediction and diagnosis of future diabetes risk: a machine learning approach [J].
Birjais, Roshan ;
Mourya, Ashish Kumar ;
Chauhan, Ritu ;
Kaur, Harleen .
SN APPLIED SCIENCES, 2019, 1 (09)
[7]  
Breiman L., 2001, MACH LEARN, V45, P5
[8]   Lung cancer risk of airborne particles for Italian population [J].
Buonanno, G. ;
Giovinco, G. ;
Morawska, L. ;
Stabile, L. .
ENVIRONMENTAL RESEARCH, 2015, 142 :443-451
[9]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[10]   Machine learning algorithms for predicting the risk of fracture in patients with diabetes in China [J].
Chu, Sijia ;
Jiang, Aijun ;
Chen, Lyuzhou ;
Zhang, Xi ;
Shen, Xiurong ;
Zhou, Wan ;
Ye, Shandong ;
Chen, Chao ;
Zhang, Shilu ;
Zhang, Li ;
Chen, Yang ;
Miao, Ya ;
Wang, Wei .
HELIYON, 2023, 9 (07)