CLASSIFICATION OF DIABETES USING ENSEMBLE MACHINE LEARNING TECHNIQUES

被引:0
|
作者
Ashisha G.R. [1 ]
Mary X.A. [2 ]
Raja J.M. [3 ]
机构
[1] Electronics and Instrumentation Engineering, Karunya Institute of Technology and Sciences, Coimbatore
[2] Robotics Engineering, Karunya Institute of Technology and Sciences, Coimbatore
[3] Computer Science Engineering, Karunya Institute of Technology and Sciences, Coimbatore
来源
Scalable Computing | 2024年 / 25卷 / 04期
关键词
Diabetes; Ensemble Voting Classifier; Gradient Boost; Machine Learning; Random Over Sampling;
D O I
10.12694/scpe.v25i4.2873
中图分类号
学科分类号
摘要
Diabetes is a widespread chronic condition that impacts people all over the globe and requires a clear and timely diagnosis. Untreated diabetes leads to retinopathy, nephropathy, and damage to the nervous system. In this context, Machine Learning (ML) might be used to detect health problems early, diagnose them, and track their progress. Ensemble techniques are a promising approach that combines many classifiers to improve forecast accuracy and resilience. This study investigates the categorization of diabetes using an ensemble machine learning technique known as a voting classifier. Using a variety of classifiers, including Light Gradient Boosting Machine (LightGBM), Gradient Boost classifier (GBC), and Random Forest (RF). The predictions are aggregated using voting methods to get a final classification result. The research is carried out using two benchmarking datasets: the Pima Indian Diabetes Dataset (PIDD) and the German Dataset. The Boruta technique is used to choose the best attributes from the datasets, while the Random Over Sampling approach balances the range of classes and eliminates abnormal data using the interquartile range approach. The findings showed that the combination of the Boruta feature selection algorithm and ensemble Voting Classifier performed better for both PIDD and German datasets with an accuracy of 93% and 90% respectively. These algorithms are evaluated and the maximum accuracy is produced using the combination of the Boruta feature selection algorithm and ensemble Voting Classifier. This research helps medical professionals in the early prediction of diabetes, reducing physician’s time. © 2024 SCPE.
引用
收藏
页码:3172 / 3180
页数:8
相关论文
共 50 条
  • [31] An exploration on text classification using machine learning techniques
    Athanasios, Tzimourtas
    Spyros, Bakalakos
    Panagiota, Tselenti
    Athanasios, Voulodimos
    25TH PAN-HELLENIC CONFERENCE ON INFORMATICS WITH INTERNATIONAL PARTICIPATION (PCI2021), 2021, : 247 - 249
  • [32] Classification of Sentimental Reviews Using Machine Learning Techniques
    Tripathy, Abinash
    Agrawal, Ankit
    Rath, Santanu Kumar
    3RD INTERNATIONAL CONFERENCE ON RECENT TRENDS IN COMPUTING 2015 (ICRTC-2015), 2015, 57 : 821 - 829
  • [33] ONLINE NEWS CLASSIFICATION USING MACHINE LEARNING TECHNIQUES
    Ahmed, Jeelani
    Ahmed, Muqeem
    IIUM ENGINEERING JOURNAL, 2021, 22 (02): : 210 - 225
  • [34] Classification of yoga pose using machine learning techniques
    Palanimeera, J.
    Ponmozhi, K.
    MATERIALS TODAY-PROCEEDINGS, 2021, 37 : 2930 - 2933
  • [35] Classification of Time Signals Using Machine Learning Techniques
    Jadoon, Ishfaq Ahmad
    Logofatu, Doina
    Islam, Mohammad Nahin
    24TH INTERNATIONAL CONFERENCE ON ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EAAAI/EANN 2023, 2023, 1826 : 85 - 96
  • [36] Classification of WatSan Technologies Using Machine Learning Techniques
    Al Nuaimi, Hala
    Abdelmagid, Mohamed
    Bouabid, Ali
    Chrysikopoulos, Constantinos V. V.
    Maalouf, Maher
    WATER, 2023, 15 (15)
  • [37] Patient Discharge Classification Using Machine Learning Techniques
    Gramaje A.
    Thabtah F.
    Abdelhamid N.
    Ray S.K.
    Annals of Data Science, 2021, 8 (04) : 755 - 767
  • [38] ECG beat classification using machine learning techniques
    Jambukia, Shweta H.
    Dabhi, Vipul K.
    Prajapati, Harshadkumar B.
    INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2018, 26 (01) : 32 - 53
  • [39] Classification of cardiac arrhythmia using machine learning techniques
    Firyulina, M. A.
    Kashirina, I. L.
    APPLIED MATHEMATICS, COMPUTATIONAL SCIENCE AND MECHANICS: CURRENT PROBLEMS, 2020, 1479
  • [40] Patient care classification using machine learning techniques
    Melhem, Shatha
    Al-Aiad, Ahmad
    Al-Ayyad, Muhammad Saleh
    2021 12TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2021, : 57 - 62