A Hybrid Machine-Learning Model Based on Global and Local Learner Algorithms for Diabetes Mellitus Prediction

被引:9
作者
Rufo, Derara Duba [1 ]
Debelee, Taye Girma [2 ]
Negera, Worku Gachena [3 ]
机构
[1] Dilla Univ, Dilla, Snnpr, Ethiopia
[2] Addis Ababa Sci & Technol Univ, Addis Ababa, Ethiopia
[3] Ethiopian Artificial Intelligence Ctr, Addis Ababa, Ethiopia
关键词
Diabetes Prediction; Data Mining; Synthetic Minority Oversampling; Global Learning; Local Learning; Stacking; CLASSIFICATION;
D O I
10.4028/www.scientific.net/JBBBE.54.65
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Health is a critical condition for living things even before the technology exists. Nowadays the healthcare domain provides a lot of scope for research as it has extremely evolved. The most researched areas of health sectors include diabetes mellitus (DM), breast cancer, brain tumor, etc. DM is a severe chronic disease that affects human health and has a high rate throughout the world. Early prediction of DM is important to reduce its risk and even avoid it. In this study, we propose a DM prediction model based on global and local learner algorithms. The proposed global and local learners stacking (GLLS) model; combines the prediction algorithms from two largely different but complementary machine learning paradigms, specifically XGBoost and NB from global learning whereas KNN and SVM (with RBF kernel) from local learning and aggregates them by stacking ensemble technique using LR as meta-learner. The effectiveness of the GLLS model was proved by comparing several performance measures and the results of different contrast experiments. The evaluation results on UCI Pima Indian diabetes data-set (PIDD) indicates the model has achieved the better prediction performance of 99.5%, 99.5%, 99.5%, 99.1%, and 100% in terms of accuracy, AUC, F1 score, sensitivity, and specificity respectively, compared to other research results mentioned in the literature. Moreover, to better validate the GLLS model performance, three additional medical data sets; Messidor, WBC, ILPD, are considered and the model also achieved an accuracy of 82.1%, 98.6%, and 89.3% respectively. Experimental results proved the effectiveness and superiority of our proposed GLLS model.
引用
收藏
页码:65 / 88
页数:24
相关论文
共 46 条
[31]  
Mitchell T.M., 2017, Machine Learning
[32]   Predicting Diabetes Onset: an Ensemble Supervised Learning Approach [J].
Nnamoko, Nonso ;
Hussain, Abir ;
England, David .
2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, :554-560
[33]   Automatic pectoral muscle removal in mammograms [J].
Rahimeto, Samuel ;
Debelee, Taye Girma ;
Yohannes, Dereje ;
Schwenker, Friedhelm .
EVOLVING SYSTEMS, 2021, 12 (02) :519-526
[34]  
Raschka S, 2017, Scikit-Learn, and TensorFlow, V3
[35]  
Raza K., 2019, U-Healthcare Monitoring Systems, P179, DOI [10.1016/B978-0-12-815370-3.00008-6, DOI 10.1016/B978-0-12-815370-3.00008-6]
[36]   Global and regional diabetes prevalence estimates for 2019 and projections for 2030 and 2045: Results from the International Diabetes Federation Diabetes Atlas, 9th edition [J].
Saeedi, Pouya ;
Petersohn, Inga ;
Salpea, Paraskevi ;
Malanda, Belma ;
Karuranga, Suvi ;
Unwin, Nigel ;
Colagiuri, Stephen ;
Guariguata, Leonor ;
Motala, Ayesha A. ;
Ogurtsova, Katherine ;
Shaw, Jonathan E. ;
Bright, Dominic ;
Williams, Rhys ;
Almutairi, Reem ;
Montoya, Pablo Aschner ;
Basit, Abdul ;
Besancon, Stephane ;
Bommer, Christian ;
Borgnakke, Wenche ;
Boyko, Edward ;
Chan, Juliana ;
Divakar, Hema ;
Esteghamati, Alireza ;
Forouhi, Nita ;
Franco, Laercio ;
Gregg, Edward ;
Hassanein, Mohamed ;
Ke, Calvin ;
Levitt, Dinky ;
Lim, Lee-Ling ;
Ogle, Graham D. ;
Owens, David ;
Pavkov, Meda ;
Pearson-Stuttard, Jonathan ;
Ramachandran, Ambady ;
Rathmann, Wolfgang ;
Riaz, Musarrat ;
Simmons, David ;
Sinclair, Alan ;
Sobngwi, Eugene ;
Thomas, Rebecca ;
Ward, Heather ;
Wild, Sarah ;
Yang, Xilin ;
Yuen, Lili ;
Zhang, Ping .
DIABETES RESEARCH AND CLINICAL PRACTICE, 2019, 157
[37]   Diagnosis of diabetes type-II using hybrid machine learning based ensemble model [J].
Sarwar A. ;
Ali M. ;
Manhas J. ;
Sharma V. .
International Journal of Information Technology, 2020, 12 (2) :419-428
[38]  
Sisodia Deepti, 2018, Procedia Computer Science, V132, P1578, DOI 10.1016/j.procs.2018.05.122
[39]  
Srivastava Y, 2019, PROCEEDINGS 2019 AMITY INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AICAI), P321, DOI [10.1109/aicai.2019.8701307, 10.1109/AICAI.2019.8701307]
[40]  
Vigneswari D, 2019, INT CONF ADVAN COMPU, P84, DOI [10.1109/ICACCS.2019.8728388, 10.1109/icaccs.2019.8728388]