Diabetes prediction model using machine learning techniques

被引:0
|
作者
Sandip Kumar Singh Modak
Vijay Kumar Jha
机构
[1] Sarla Birla University,Department of Computer Science & Engineering
[2] Birla Institute of Technology,Department of Computer Science & Engineering
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Diabetes; Machine learning; SVM; Random Forest and Naïve Bayes;
D O I
暂无
中图分类号
学科分类号
摘要
Diabetes has emerged as a significant global health concern, contributing to various severe complications such as kidney disease, vision loss, and coronary issues. Leveraging machine learning algorithms in medical services has shown promise in accurate disease diagnosis and treatment, thereby alleviating the burden on healthcare professionals. The field of diabetes forecasting has rapidly evolved, offering the potential for early intervention and patient empowerment. To this end, our study presents an innovative diabetes prediction model employing a range of machine learning techniques, including Logistic Regression, SVM, Naïve Bayes, and Random Forest. In addition to these foundational techniques, we harness the power of ensemble learning to further enhance prediction accuracy and robustness. Specifically, we explore ensemble methods such as XGBoost, LightGBM, CatBoost, Adaboost, and Bagging. These techniques amalgamate predictions from multiple base learners, yielding a more precise and resilient final prediction. Our proposed framework is developed and trained using Python, utilizing a real-world dataset sourced from Kaggle. Our methodology is rigorously examined through performance evaluation metrics, including the confusion matrix, sensitivity, and accuracy measurements. Among the ensemble techniques tested, CatBoost emerges as the most effective, boasting an impressive accuracy rate of 95.4% compared to XGBoost's 94.3%. Furthermore, CatBoost's higher AUC-ROC score of 0.99 reinforces its potential superiority over XGBoost, which achieved an AUC-ROC score of 0.98.
引用
收藏
页码:38523 / 38549
页数:26
相关论文
共 50 条
  • [31] Classification and prediction of diabetes disease using machine learning paradigm
    Maniruzzaman, Md.
    Rahman, Md. Jahanur
    Ahammed, Benojir
    Abedin, Md. Menhazul
    HEALTH INFORMATION SCIENCE AND SYSTEMS, 2020, 8 (01)
  • [32] Analysing Feature Importances for Diabetes Prediction using Machine Learning
    Dutta, Debadri
    Paul, Debpriyo
    Ghosh, Parthajeet
    2018 IEEE 9TH ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS AND MOBILE COMMUNICATION CONFERENCE (IEMCON), 2018, : 924 - 928
  • [33] Optimization of an Analysis Method for Diabetes Prediction Using Classical and Ensemble Machine Learning Techniques
    Naranjo, Edison
    Arguero, Berenice
    Hurtado, Remigio
    PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2024, VOL 3, 2024, 1013 : 527 - 536
  • [34] Classification and prediction of diabetes disease using machine learning paradigm
    Md. Maniruzzaman
    Md. Jahanur Rahman
    Benojir Ahammed
    Md. Menhazul Abedin
    Health Information Science and Systems, 8
  • [35] Diabetes Detection and Prediction Using Machine Learning/IoT: A Survey
    Sharma, Neha
    Singh, Ashima
    ADVANCED INFORMATICS FOR COMPUTING RESEARCH, ICAICR 2018, PT I, 2019, 955 : 471 - 479
  • [36] Prediction of Diabetes at Early Stage using Interpretable Machine Learning
    Islam, Mohammad Sajidul
    Alam, Md Minul
    Ahamed, Afsana
    Meerza, Syed Imran Ali
    SOUTHEASTCON 2023, 2023, : 261 - 265
  • [37] Diabetes Prediction using SMOTE and Machine Learning
    Sarayu, Maganti Khyathi
    Bhanu, Shaik Ayesha
    Deekshitha, Karanam
    Meghana, Maduri
    Joseph, Iwin Thanakumar
    2024 SECOND INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTING AND INFORMATICS, ICICI 2024, 2024, : 15 - 20
  • [38] Construction Model Using Machine Learning Techniques for the Prediction of Rice Produce for Farmers
    Inyaem, Uraiwan
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC), 2018, : 870 - 874
  • [39] Diabetes Prediction using Machine Learning Algorithms
    Mujumdar, Aishwarya
    Vaidehi, V.
    2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING ICRTAC -DISRUP - TIV INNOVATION , 2019, 2019, 165 : 292 - 299
  • [40] Multidimensional binary layered model for census prediction using machine learning techniques
    El-Salhi S.
    Albqowr H.M.S.
    Igried B.
    Awwad S.
    Dittakan K.
    Asian Journal of Civil Engineering, 2024, 25 (6) : 4877 - 4891