A Cloud-Based Optimized Ensemble Model for Risk Prediction of Diabetic Progression-An Azure Machine Learning Perspective

被引:0
|
作者
Daliya, V. K. [1 ]
Ramesh, T. K. [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Dept Elect & Commun Engn, Amrita Sch Engn, Bengaluru 560035, India
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Diabetes; Diseases; Predictive models; Glucose; Prediction algorithms; Optimization; Machine learning; Classification algorithms; Nearest neighbor methods; Accuracy; Diabetic prediction; ensemble learning; KNN; LightGBM; voting classifier; azure cloud; azure machine learning; GLUCOSE; REGRESSION; SYSTEM;
D O I
10.1109/ACCESS.2025.3528033
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The application of Machine Learning for predictive analysis in healthcare, particularly for diseases like diabetes, has proven highly beneficial. This study introduces an optimized Light Gradient-Boosting Machine (Light GBM) and K-Nearest Neighbour (KNN) based ensemble algorithm for predicting diabetic progression of Type 2 Diabetes, classifying it as high or low risk, using patient health parameters and serum measurements. Our model uses LightGBM, a rapid and efficient gradient boosting framework, coupled with KNN, which uses proximity to classify data points. The proposed model uses various optimization techniques, such as 10 fold cross validation, grid search method etc. to get the best results out of the ensemble model. As the model combines optimized version of LightGBM and KNN through a voting classifier which uses soft voting technique to find the final class, it utilizes the predictive capabilities of both the methods in an effective manner. The experiment is performed and implemented in Microsoft's Azure cloud, using Azure Machine Learning service, that leverages the advantages of cloud computing with respect to scalability, security and its potential integration possibilities into IoT-based smart healthcare systems.This aspect highlights its versatility and impact with respect to remote monitoring of patients as well. The ensemble achieves an 83.2% Area Under the Curve (AUC) of Receiver Operating Characteristics (ROC) score, indicating good classification efficiency. It produced 75% accuracy as well. The proposed model is compared with other classification and ensemble models, showcasing its superiority against other models.The ensemble is also tested with some meta heuristic optimization methods, which produced comparable scores. The method's effectiveness is validated against another risk prediction dataset, proving its reliability. The model's accurate predictions can aid individuals in understanding disease progression risks and guide medical professionals in intervention strategies.
引用
收藏
页码:11560 / 11575
页数:16
相关论文
共 50 条
  • [41] Risk prediction model of metabolic syndrome in perimenopausal women based on machine learning
    Wang, Xiaoxue
    Wang, Zijun
    Chen, Shichen
    Yang, Mukun
    Chen, Yi
    Miao, Linqing
    Bai, Wenpei
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2024, 188
  • [43] Prediction of 3-year risk of diabetic kidney disease using machine learning based on electronic medical records
    Dong, Zheyi
    Wang, Qian
    Ke, Yujing
    Zhang, Weiguang
    Hong, Quan
    Liu, Chao
    Liu, Xiaomin
    Yang, Jian
    Xi, Yue
    Shi, Jinlong
    Zhang, Li
    Zheng, Ying
    Lv, Qiang
    Wang, Yong
    Wu, Jie
    Sun, Xuefeng
    Cai, Guangyan
    Qiao, Shen
    Yin, Chengliang
    Su, Shibin
    Chen, Xiangmei
    JOURNAL OF TRANSLATIONAL MEDICINE, 2022, 20 (01)
  • [44] A Machine Learning-Based Prediction Model for Cardiovascular Risk in Women With Preeclampsia
    Wang, Guan
    Zhang, Yanbo
    Li, Sijin
    Zhang, Jun
    Jiang, Dongkui
    Li, Xiuzhen
    Li, Yulin
    Du, Jie
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2021, 8
  • [45] Machine learning-based risk prediction model for arteriovenous fistula stenosis
    Shu, Peng
    Huang, Ling
    Huo, Shanshan
    Qiu, Jun
    Bai, Haitao
    Wang, Xia
    Xu, Fang
    EUROPEAN JOURNAL OF MEDICAL RESEARCH, 2025, 30 (01)
  • [46] Diabetes risk prediction model based on community follow-up data using machine learning
    Jiang, Liangjun
    Xia, Zhenhua
    Zhu, Ronghui
    Gong, Haimei
    Wang, Jing
    Li, Juan
    Wang, Lei
    PREVENTIVE MEDICINE REPORTS, 2023, 35
  • [47] Risk prediction model based on machine learning for predicting miscarriage among pregnant patients with immune abnormalities
    Wu, Yue
    Yu, Xixuan
    Li, Mengting
    Zhu, Jing
    Yue, Jun
    Wang, Yan
    Man, Yicun
    Zhou, Chao
    Tong, Rongsheng
    Wu, Xingwei
    FRONTIERS IN PHARMACOLOGY, 2024, 15
  • [48] A machine learning prediction model for cancer risk in patients with type 2 diabetes based on clinical tests
    Qiu, Bin
    Chen, Hang
    Zhang, Enke
    Ma, Fuchun
    An, Gaili
    Zong, Yuan
    Shang, Liang
    Zhang, Yulian
    Zhu, Huolan
    TECHNOLOGY AND HEALTH CARE, 2024, 32 (03) : 1431 - 1443
  • [49] Ultra-Short-Term Building Cooling Load Prediction Model Based on Feature Set Construction and Ensemble Machine Learning
    Ding, Yan
    Su, Hao
    Kong, Xiangfei
    Zhang, Zhenqin
    IEEE ACCESS, 2020, 8 : 178733 - 178745
  • [50] An explainable ensemble machine learning model to elucidate the influential drilling parameters based on rate of penetration prediction
    Feng, Zhipeng
    Gani, Hamdan
    Damayanti, Annisa Dwi
    Gani, Helmy
    GEOENERGY SCIENCE AND ENGINEERING, 2023, 231