PREDICTION OF TYPE 2 DIABETES MELLITUS USING FEATURE SELECTION-BASED MACHINE LEARNING ALGORITHMS

被引:2
作者
Yilmaz, Atinc [1 ,2 ]
机构
[1] Beykent Univ, Dept Comp Engn, Istanbul, Turkey
[2] Beykent Univ, Dept Comp Engn, Hadim Koruyolu Cd 19, TR-34398 Istanbul, Turkey
关键词
feature selection; health information system; type; 2; diabetes; machine learning; nursing care; RISK; MODEL;
D O I
10.5114/hpc.2022.114541
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background. The aim of this study is to develop and evaluate a machine learning model for the early diagnosis of type 2 diabetes to allow for treatments to be applied in the early stages of the disease.Material and methods. A proposed hybrid machine learning model was developed and applied to the Early-stage diabetes risk prediction dataset from the UCI database. The prediction success of the proposed model was compared with other machine learning models. Pearson's correlation and SelectKBest feature selection methods were employed to examine the relationships between the dataset input parameters and the results.Results. Of the 520 patients included in the dataset, 320 were diagnosed with diabetes and 328 (63.08%) were males. The most commonly observed diabetes diagnosis criterion was obesity (n=482, 83.08%). While the strongest feature detected with Pearson's correlation was polyuria, the strongest feature detected with SelectKBest was polydipsia. With Pearson's feature extraction, the most successful machine learning method was the proposed hybrid method, with an accuracy of 97.28%. Using SelectKBest feature selection, the same model was able to predict type 2 diabetes with accuracy of 95.16%.Conclusions. Early detection of type 2 diabetes will allow for a prompter and more effective treatment of the patient. Thus, use of the proposed model may help to improve the quality of patient care and lower the number of deaths caused by this disease.
引用
收藏
页码:128 / 139
页数:12
相关论文
共 37 条
[21]   Comparison of Classifiers for the Risk of Diabetes Prediction [J].
Nai-arun, Nongyao ;
Moungmai, Rungruttikarn .
7TH INTERNATIONAL CONFERENCE ON ADVANCES IN INFORMATION TECHNOLOGY, 2015, 69 :132-142
[22]  
Nisar N, 2008, PAK J MED SCI, V24, P667
[23]  
Patil B.M., 2010, P 2010 INT C MACH LE, P330
[24]   Metabolic Syndrome and Development of Diabetes Mellitus: Predictive Modeling Based on Machine Learning Techniques [J].
Perveen, Sajida ;
Shahbaz, Muhammad ;
Keshavjee, Karim ;
Guergachi, Aziz .
IEEE ACCESS, 2019, 7 :1365-1375
[25]  
Rahman RM., 2013, J. Softw. Eng. Appl, V6, P85, DOI [DOI 10.4236/JSEA.2013.63013, 10.4236/jsea.2013.63013]
[26]  
Shi Bing-Yin, 2016, Chronic Dis Transl Med, V2, P204, DOI 10.1016/j.cdtm.2016.11.013
[27]  
Sisodia Deepti, 2018, Procedia Computer Science, V132, P1578, DOI 10.1016/j.procs.2018.05.122
[28]  
Swapna G., 2018, Procedia Computer Science, V132, P1253, DOI 10.1016/j.procs.2018.05.041
[29]  
Swapna G., 2020, Stud Big Data, V68, P299, DOI [10.1007/978-3-030-33966-1_14, DOI 10.1007/978-3-030-33966-1_14]
[30]   Early Risk Prediction of Diabetes Based on GA-Stacking [J].
Tan, Yaqi ;
Chen, He ;
Zhang, Jianjun ;
Tang, Ruichun ;
Liu, Peishun .
APPLIED SCIENCES-BASEL, 2022, 12 (02)