A Hybrid Machine-Learning Model Based on Global and Local Learner Algorithms for Diabetes Mellitus Prediction

被引:9
作者
Rufo, Derara Duba [1 ]
Debelee, Taye Girma [2 ]
Negera, Worku Gachena [3 ]
机构
[1] Dilla Univ, Dilla, Snnpr, Ethiopia
[2] Addis Ababa Sci & Technol Univ, Addis Ababa, Ethiopia
[3] Ethiopian Artificial Intelligence Ctr, Addis Ababa, Ethiopia
关键词
Diabetes Prediction; Data Mining; Synthetic Minority Oversampling; Global Learning; Local Learning; Stacking; CLASSIFICATION;
D O I
10.4028/www.scientific.net/JBBBE.54.65
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Health is a critical condition for living things even before the technology exists. Nowadays the healthcare domain provides a lot of scope for research as it has extremely evolved. The most researched areas of health sectors include diabetes mellitus (DM), breast cancer, brain tumor, etc. DM is a severe chronic disease that affects human health and has a high rate throughout the world. Early prediction of DM is important to reduce its risk and even avoid it. In this study, we propose a DM prediction model based on global and local learner algorithms. The proposed global and local learners stacking (GLLS) model; combines the prediction algorithms from two largely different but complementary machine learning paradigms, specifically XGBoost and NB from global learning whereas KNN and SVM (with RBF kernel) from local learning and aggregates them by stacking ensemble technique using LR as meta-learner. The effectiveness of the GLLS model was proved by comparing several performance measures and the results of different contrast experiments. The evaluation results on UCI Pima Indian diabetes data-set (PIDD) indicates the model has achieved the better prediction performance of 99.5%, 99.5%, 99.5%, 99.1%, and 100% in terms of accuracy, AUC, F1 score, sensitivity, and specificity respectively, compared to other research results mentioned in the literature. Moreover, to better validate the GLLS model performance, three additional medical data sets; Messidor, WBC, ILPD, are considered and the model also achieved an accuracy of 82.1%, 98.6%, and 89.3% respectively. Experimental results proved the effectiveness and superiority of our proposed GLLS model.
引用
收藏
页码:65 / 88
页数:24
相关论文
共 46 条
[1]   Detection of Bacterial Wilt on Enset Crop Using Deep Learning Approach [J].
Afework, Yidnekachew Kibru ;
Debelee, Taye Girma .
INTERNATIONAL JOURNAL OF ENGINEERING RESEARCH IN AFRICA, 2020, 51 :131-146
[2]  
AHA DW, 1991, MACH LEARN, V6, P37, DOI 10.1007/BF00153759
[3]  
Alam T. M., 2019, Informatics in Medicine Unlocked, V16
[4]   Predicting diabetes mellitus using SMOTE and ensemble machine learning approach: The Henry Ford ExercIse Testing (FIT) project [J].
Alghamdi, Manal ;
Al-Mallah, Mouaz ;
Keteylan, Steven ;
Brawner, Clinton ;
Ehrman, Jonathan ;
Sakr, Sherif .
PLOS ONE, 2017, 12 (07)
[5]  
Bhavana N., 2018, INT J FUTURE REVOLUT, V4, P463
[6]   Enhanced Region Growing for Brain Tumor MR Image Segmentation [J].
Biratu, Erena Siyoum ;
Schwenker, Friedhelm ;
Debelee, Taye Girma ;
Kebede, Samuel Rahimeto ;
Negera, Worku Gachena ;
Molla, Hasset Tamirat .
JOURNAL OF IMAGING, 2021, 7 (02)
[7]   Prediction and diagnosis of future diabetes risk: a machine learning approach [J].
Birjais, Roshan ;
Mourya, Ashish Kumar ;
Chauhan, Ritu ;
Kaur, Harleen .
SN APPLIED SCIENCES, 2019, 1 (09)
[8]  
Chawla NV, 2010, DATA MINING AND KNOWLEDGE DISCOVERY HANDBOOK, SECOND EDITION, P875, DOI 10.1007/978-0-387-09823-4_45
[9]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[10]  
Chhabra G., 2017, Indian Journal of Science and Technology, V10, P1, DOI DOI 10.17485/ijst/2017/v10i19/110646