CLASSIFICATION OF DIABETES USING ENSEMBLE MACHINE LEARNING TECHNIQUES

被引:0
|
作者
Ashisha G.R. [1 ]
Mary X.A. [2 ]
Raja J.M. [3 ]
机构
[1] Electronics and Instrumentation Engineering, Karunya Institute of Technology and Sciences, Coimbatore
[2] Robotics Engineering, Karunya Institute of Technology and Sciences, Coimbatore
[3] Computer Science Engineering, Karunya Institute of Technology and Sciences, Coimbatore
来源
Scalable Computing | 2024年 / 25卷 / 04期
关键词
Diabetes; Ensemble Voting Classifier; Gradient Boost; Machine Learning; Random Over Sampling;
D O I
10.12694/scpe.v25i4.2873
中图分类号
学科分类号
摘要
Diabetes is a widespread chronic condition that impacts people all over the globe and requires a clear and timely diagnosis. Untreated diabetes leads to retinopathy, nephropathy, and damage to the nervous system. In this context, Machine Learning (ML) might be used to detect health problems early, diagnose them, and track their progress. Ensemble techniques are a promising approach that combines many classifiers to improve forecast accuracy and resilience. This study investigates the categorization of diabetes using an ensemble machine learning technique known as a voting classifier. Using a variety of classifiers, including Light Gradient Boosting Machine (LightGBM), Gradient Boost classifier (GBC), and Random Forest (RF). The predictions are aggregated using voting methods to get a final classification result. The research is carried out using two benchmarking datasets: the Pima Indian Diabetes Dataset (PIDD) and the German Dataset. The Boruta technique is used to choose the best attributes from the datasets, while the Random Over Sampling approach balances the range of classes and eliminates abnormal data using the interquartile range approach. The findings showed that the combination of the Boruta feature selection algorithm and ensemble Voting Classifier performed better for both PIDD and German datasets with an accuracy of 93% and 90% respectively. These algorithms are evaluated and the maximum accuracy is produced using the combination of the Boruta feature selection algorithm and ensemble Voting Classifier. This research helps medical professionals in the early prediction of diabetes, reducing physician’s time. © 2024 SCPE.
引用
收藏
页码:3172 / 3180
页数:8
相关论文
共 50 条
  • [21] Crop Yield Prediction Using Ensemble Machine Learning Techniques
    P. Kuppan
    V. Vishwa Priya
    SN Computer Science, 5 (8)
  • [22] Performance prediction of roadheaders using ensemble machine learning techniques
    Seker, Sadi Evren
    Ocak, Ibrahim
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (04): : 1103 - 1116
  • [23] Performance prediction of roadheaders using ensemble machine learning techniques
    Sadi Evren Seker
    Ibrahim Ocak
    Neural Computing and Applications, 2019, 31 : 1103 - 1116
  • [24] Brain Tumor Classification Using an Ensemble of Deep Learning Techniques
    Patro, S. Gopal Krishna
    Govil, Nikhil
    Saxena, Surabhi
    Kishore Mishra, Brojo
    Taha Zamani, Abu
    Ben Miled, Achraf
    Parveen, Nikhat
    Elshafie, Hashim
    Hamdan, Mosab
    IEEE ACCESS, 2024, 12 : 162094 - 162106
  • [25] Breast Tumor Classification Using an Ensemble Machine Learning Method
    Assiri, Adel S.
    Nazir, Saima
    Velastin, Sergio A.
    JOURNAL OF IMAGING, 2020, 6 (06)
  • [26] Extreme learning machine and ensemble techniques for classification of rolling element bearing defects
    Upadhyay N.
    Chourasiya S.K.
    Life Cycle Reliability and Safety Engineering, 2022, 11 (2) : 189 - 201
  • [27] A Robust Ensemble Machine Learning Model with Advanced Voting Techniques for Comment Classification
    Shiplu, Ariful Islam
    Rahman, Md Mostafizer
    Watanobe, Yutaka
    BIG DATA ANALYTICS IN ASTRONOMY, SCIENCE, AND ENGINEERING, BDA 2023, 2024, 14516 : 141 - 159
  • [28] Stacked Ensemble-Based Type-2 Diabetes Prediction Using Machine Learning Techniques
    Rahim M.A.
    Hossain M.A.
    Hossain M.N.
    Shin J.
    Yun K.S.
    Annals of Emerging Technologies in Computing, 2023, 7 (01) : 30 - 39
  • [29] An ensemble learning approach for diabetes prediction using boosting techniques
    Ganie, Shahid Mohammad
    Pramanik, Pijush Kanti Dutta
    Malik, Majid Bashir
    Mallik, Saurav
    Qin, Hong
    FRONTIERS IN GENETICS, 2023, 14
  • [30] Early Prediction of Diabetes Using an Ensemble of Machine Learning Models
    Dutta, Aishwariya
    Hasan, Md Kamrul
    Ahmad, Mohiuddin
    Awal, Md Abdul
    Islam, Md Akhtarul
    Masud, Mehedi
    Meshref, Hossam
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (19)