Diabetes Prediction: Optimization of Machine Learning through Feature Selection and Dimensionality Reduction

被引:1
作者
Aouragh, Abd Allah [1 ]
Bahaj, Mohamed [1 ]
Toufik, Fouad [2 ]
机构
[1] Hassan 1st Univ, Fac Sci & Tech, MIET Lab, Settat, Morocco
[2] Mohammed V Univ, Higher Sch Technol, Comp Sci Lab, Sale, Morocco
关键词
diabetes; machine learning; balancing; feature selection; dimensionality reduction; grid search;
D O I
10.3991/ijoe.v20i08.47765
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Diabetes, a pervasive global health concern, presents diagnostic challenges due to its nuanced onset and far-reaching implications. Traditional diagnostic approaches, reliant on time-consuming assessments, necessitate a paradigm shift towards more efficient methodol- ogies. In response, this study introduces a diagnostic support system leveraging the power of optimized machine learning algorithms. Addressing class imbalance within a dataset comprising 768 records, our methodology intricately weaves together feature selection, dimensionality reduction techniques, and grid search optimization. Specifically, the Extra Trees model, fine-tuned via grid search, emerges as the most potent, showcasing remarkable performance metrics: an accuracy score of 92.5%, an F1 -score of 93.7%, and an AUC-ROC of 92.47%. These findings underscore the pivotal role of machine learning in reshaping diabetes diagnosis, offering transformative possibilities for global healthcare enhancement.
引用
收藏
页码:100 / 114
页数:15
相关论文
共 31 条
  • [1] Al-Zebari A., 2019, 2019 1st International Informatics and Software Engineering Conference (UBMYK), P1
  • [2] Alanazi N., 2023, J Health Inf Developing Ctries, V17, P01
  • [3] Early Diagnosis of Diabetes: A Comparison of Machine Learning Methods
    Alzboon, Mowafaq Salem
    Al-Batah, Mohammad Subhi
    Alqaraleh, Muhyeeddin
    Abuashour, Ahmad
    Bader, Ahmad Fuad Hamadah
    [J]. INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (15) : 144 - 165
  • [5] Aouragh Abd Allah, 2022, 2022 IEEE 3rd International Conference on Electronics, Control, Optimization and Computer Science (ICECOCS), P1, DOI 10.1109/ICECOCS55148.2022.9983211
  • [6] Aouragh A. A., 2023, 2023 IEEE INT C ADV, P1, DOI [10.1109/ADACIS59737.2023.10424089, DOI 10.1109/ADACIS59737.2023.10424089]
  • [7] Balabaeva Ksenia, 2021, Computational Science - ICCS 2021. 21st International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12744), P623, DOI 10.1007/978-3-030-77967-2_51
  • [8] Hyperparameter optimization: Foundations, algorithms, best practices, and open challenges
    Bischl, Bernd
    Binder, Martin
    Lang, Michel
    Pielok, Tobias
    Richter, Jakob
    Coors, Stefan
    Thomas, Janek
    Ullmann, Theresa
    Becker, Marc
    Boulesteix, Anne-Laure
    Deng, Difan
    Lindauer, Marius
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2023, 13 (02)
  • [9] Extremely Randomized Trees-Based Scheme for Stealthy Cyber-Attack Detection in Smart Grid Networks
    Camana, Mario R.
    Ahmed, Saeed
    Garcia, Carla E.
    Koo, Insoo
    [J]. IEEE ACCESS, 2020, 8 : 19921 - 19933
  • [10] Mobile Applications for Diabetes Self-Care and Approach to Machine Learning
    Cedeno-Moreno, Denis
    Vargas-Lombardo, Miguel
    [J]. INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2020, 16 (08) : 25 - 38