A Cloud-Based Optimized Ensemble Model for Risk Prediction of Diabetic Progression-An Azure Machine Learning Perspective

被引:0
|
作者
Daliya, V. K. [1 ]
Ramesh, T. K. [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Dept Elect & Commun Engn, Amrita Sch Engn, Bengaluru 560035, India
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Diabetes; Diseases; Predictive models; Glucose; Prediction algorithms; Optimization; Machine learning; Classification algorithms; Nearest neighbor methods; Accuracy; Diabetic prediction; ensemble learning; KNN; LightGBM; voting classifier; azure cloud; azure machine learning; GLUCOSE; REGRESSION; SYSTEM;
D O I
10.1109/ACCESS.2025.3528033
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The application of Machine Learning for predictive analysis in healthcare, particularly for diseases like diabetes, has proven highly beneficial. This study introduces an optimized Light Gradient-Boosting Machine (Light GBM) and K-Nearest Neighbour (KNN) based ensemble algorithm for predicting diabetic progression of Type 2 Diabetes, classifying it as high or low risk, using patient health parameters and serum measurements. Our model uses LightGBM, a rapid and efficient gradient boosting framework, coupled with KNN, which uses proximity to classify data points. The proposed model uses various optimization techniques, such as 10 fold cross validation, grid search method etc. to get the best results out of the ensemble model. As the model combines optimized version of LightGBM and KNN through a voting classifier which uses soft voting technique to find the final class, it utilizes the predictive capabilities of both the methods in an effective manner. The experiment is performed and implemented in Microsoft's Azure cloud, using Azure Machine Learning service, that leverages the advantages of cloud computing with respect to scalability, security and its potential integration possibilities into IoT-based smart healthcare systems.This aspect highlights its versatility and impact with respect to remote monitoring of patients as well. The ensemble achieves an 83.2% Area Under the Curve (AUC) of Receiver Operating Characteristics (ROC) score, indicating good classification efficiency. It produced 75% accuracy as well. The proposed model is compared with other classification and ensemble models, showcasing its superiority against other models.The ensemble is also tested with some meta heuristic optimization methods, which produced comparable scores. The method's effectiveness is validated against another risk prediction dataset, proving its reliability. The model's accurate predictions can aid individuals in understanding disease progression risks and guide medical professionals in intervention strategies.
引用
收藏
页码:11560 / 11575
页数:16
相关论文
共 50 条
  • [21] Water quality fluctuations prediction and Debi estimation based on stochastic optimized weighted ensemble learning machine
    Poursaeid, Mojtaba
    Poursaeed, Amir Hossein
    Shabanlou, Saeid
    PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2024, 188 : 1160 - 1174
  • [22] Development of a machine learning-based model for the prediction and progression of diabetic kidney disease: A single centred retrospective study
    Nayak, Sandhya
    Amin, Ashwini
    Reghunath, Swetha R.
    Thunga, Girish
    Acharya, U. Dinesh
    Shivashankara, K. N.
    Attur, Ravindra Prabhu
    Acharya, Leelavathi D.
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2024, 190
  • [23] A Cloud-Based Software Defect Prediction System Using Data and Decision-Level Machine Learning Fusion
    Aftab, Shabib
    Abbas, Sagheer
    Ghazal, Taher M.
    Ahmad, Munir
    Hamadi, Hussam Al
    Yeun, Chan Yeob
    Khan, Muhammad Adnan
    MATHEMATICS, 2023, 11 (03)
  • [24] Cloud-based battery failure prediction and early warning using multi-source signals and machine learning
    Zhang, Xiaoxi
    Pan, Yongjun
    Cao, Yangzheng
    Liu, Binghe
    Yu, Xinxin
    JOURNAL OF ENERGY STORAGE, 2024, 93
  • [25] Ensemble Gain Ratio Feature Selection (EGFS) Model with Machine Learning and Data Mining Algorithms for Disease Risk Prediction
    Pasha, Syed Javeed
    Mohamed, E. Syed
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT-2020), 2020, : 590 - 596
  • [26] Fatigue Life Prediction of GLARE Composites Using Regression Tree Ensemble-Based Machine Learning Model
    Sai, Wei
    Chai, Gin Boay
    Srikanth, Narasimalu
    ADVANCED THEORY AND SIMULATIONS, 2020, 3 (06)
  • [27] Ensemble-Based Risk Scoring with Extreme Learning Machine for Prediction of Adverse Cardiac Events
    Liu, Nan
    Sakamoto, Jeffrey Tadashi
    Cao, Jiuwen
    Koh, Zhi Xiong
    Ho, Andrew Fu Wah
    Lin, Zhiping
    Ong, Marcus Eng Hock
    COGNITIVE COMPUTATION, 2017, 9 (04) : 545 - 554
  • [28] Ensemble-Based Risk Scoring with Extreme Learning Machine for Prediction of Adverse Cardiac Events
    Nan Liu
    Jeffrey Tadashi Sakamoto
    Jiuwen Cao
    Zhi Xiong Koh
    Andrew Fu Wah Ho
    Zhiping Lin
    Marcus Eng Hock Ong
    Cognitive Computation, 2017, 9 : 545 - 554
  • [29] Construction of disability risk prediction model for the elderly based on machine learning
    Jing Chen
    Yifei Ren
    Jie Ding
    Qingqing Hu
    Jiajia Xu
    Jun Luo
    Zhaowen Wu
    Ting Chu
    Scientific Reports, 15 (1)
  • [30] Prediction and analysis of risk factors for diabetic retinopathy based on machine learning and interpretable models
    Wang, Xu
    Wang, Weijie
    Ren, Huiling
    Li, Xiaoying
    Wen, Yili
    HELIYON, 2024, 10 (09)