Diabetes Prediction: Optimization of Machine Learning through Feature Selection and Dimensionality Reduction

被引:2
作者
Aouragh, Abd Allah [1 ]
Bahaj, Mohamed [1 ]
Toufik, Fouad [2 ]
机构
[1] Hassan 1st Univ, Fac Sci & Tech, MIET Lab, Settat, Morocco
[2] Mohammed V Univ, Higher Sch Technol, Comp Sci Lab, Sale, Morocco
关键词
diabetes; machine learning; balancing; feature selection; dimensionality reduction; grid search;
D O I
10.3991/ijoe.v20i08.47765
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Diabetes, a pervasive global health concern, presents diagnostic challenges due to its nuanced onset and far-reaching implications. Traditional diagnostic approaches, reliant on time-consuming assessments, necessitate a paradigm shift towards more efficient methodol- ogies. In response, this study introduces a diagnostic support system leveraging the power of optimized machine learning algorithms. Addressing class imbalance within a dataset comprising 768 records, our methodology intricately weaves together feature selection, dimensionality reduction techniques, and grid search optimization. Specifically, the Extra Trees model, fine-tuned via grid search, emerges as the most potent, showcasing remarkable performance metrics: an accuracy score of 92.5%, an F1 -score of 93.7%, and an AUC-ROC of 92.47%. These findings underscore the pivotal role of machine learning in reshaping diabetes diagnosis, offering transformative possibilities for global healthcare enhancement.
引用
收藏
页码:100 / 114
页数:15
相关论文
共 31 条
[1]  
Al-Zebari A., 2019, 2019 1 INT INF SOFTW, P1
[2]  
Alanazi N., 2023, J Health Inf Developing Ctries, V17, P01
[3]   Early Diagnosis of Diabetes: A Comparison of Machine Learning Methods [J].
Alzboon, Mowafaq Salem ;
Al-Batah, Mohammad Subhi ;
Alqaraleh, Muhyeeddin ;
Abuashour, Ahmad ;
Bader, Ahmad Fuad Hamadah .
INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (15) :144-165
[5]  
Aouragh Abd Allah, 2022, 2022 IEEE 3rd International Conference on Electronics, Control, Optimization and Computer Science (ICECOCS), P1, DOI 10.1109/ICECOCS55148.2022.9983211
[6]  
Aouragh A. A., 2023, 2023 IEEE INT C ADV, P1, DOI [10.1109/ADACIS59737.2023.10424089, DOI 10.1109/ADACIS59737.2023.10424089]
[7]  
Balabaeva Ksenia, 2021, Computational Science - ICCS 2021. 21st International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12744), P623, DOI 10.1007/978-3-030-77967-2_51
[8]   Hyperparameter optimization: Foundations, algorithms, best practices, and open challenges [J].
Bischl, Bernd ;
Binder, Martin ;
Lang, Michel ;
Pielok, Tobias ;
Richter, Jakob ;
Coors, Stefan ;
Thomas, Janek ;
Ullmann, Theresa ;
Becker, Marc ;
Boulesteix, Anne-Laure ;
Deng, Difan ;
Lindauer, Marius .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2023, 13 (02)
[9]   Extremely Randomized Trees-Based Scheme for Stealthy Cyber-Attack Detection in Smart Grid Networks [J].
Camana, Mario R. ;
Ahmed, Saeed ;
Garcia, Carla E. ;
Koo, Insoo .
IEEE ACCESS, 2020, 8 :19921-19933
[10]   Mobile Applications for Diabetes Self-Care and Approach to Machine Learning [J].
Cedeno-Moreno, Denis ;
Vargas-Lombardo, Miguel .
INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2020, 16 (08) :25-38