Performance analysis of data mining classification algorithms for early prediction of diabetes mellitus 2

被引:1
作者
Devi, R. Delshi Howsalya [1 ]
Vijayalakshmi, P. R. [1 ]
机构
[1] KLN Coll Engn, Dept Comp Sci & Engn, Madurai, Tamil Nadu, India
关键词
diabetes mellitus; classification; support vector machine; SVM; AdaBoost; naive Bayes; NB; J48; random tree; random forest; OneR; data mining; SUPPORT VECTOR MACHINES; RULE EXTRACTION; DIAGNOSIS; SYSTEM;
D O I
10.1504/IJBET.2021.116097
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Diabetes mellitus (DM) generally referred to as diabetes. It is a group of metabolic infection in which there are high blood sugar levels over a prolonged period. Data mining is used for predicting various diseases. From many methods of data mining, classification is one of the main techniques. The classification techniques are used to classify the hidden information in all areas including medical diagnostic field. In this research work, we compare the machine learning classifiers (naive Bayes, J48 decision tree, OneR, AdaBoost, random forest, random tree and support vector machines) to classify the patients into diabetic and non-diabetic mellitus. These algorithms have been tested with data samples downloaded from UCI. The performances of the algorithms have been considered in both the cases, i.e., data samples with noisy data and data samples set without noisy data. Results are evaluated in terms of accuracy, sensitivity, and specificity. Experimental results suggested that, support vector machine (SVM) classifier is the best classifier for predicting diabetes mellitus 2.
引用
收藏
页码:148 / 171
页数:24
相关论文
共 40 条
[1]  
[Anonymous], 2014, International Journal of Computer Science and Information Technologies
[2]   MD-Logic Artificial Pancreas System A pilot study in adults with type 1 diabetes [J].
Atlas, Eran ;
Nimri, Revital ;
Miller, Shahar ;
Grunberg, Eli A. ;
Phillip, Moshe .
DIABETES CARE, 2010, 33 (05) :1072-1076
[3]  
Barakat NH, 2007, IEEE T KNOWL DATA EN, V19, P729, DOI [10.1109/TKDE.2007.1023, 10.1109/TKDE.2007.1023.]
[4]   Intelligible Support Vector Machines for Diagnosis of Diabetes Mellitus [J].
Barakat, Nahla H. ;
Bradley, Andrew P. ;
Barakat, Mohamed Nabil H. .
IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2010, 14 (04) :1114-1120
[5]   Bayesian analysis of blood glucose time series from diabetes home monitoring [J].
Bellazzi, R ;
Magni, P ;
De Nicolao, G .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2000, 47 (07) :971-975
[6]   Predictive data mining in clinical medicine: Current issues and guidelines [J].
Bellazzi, Riccardo ;
Zupan, Blaz .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2008, 77 (02) :81-97
[7]   The phantom burster model for pancreatic β-cells [J].
Bertram, R ;
Previte, J ;
Sherman, A ;
Kinard, TA ;
Satin, LS .
BIOPHYSICAL JOURNAL, 2000, 79 (06) :2880-2892
[8]  
Bhonde A., 2016, INT J INNOVATIVE RES, V4, P7497
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]  
Breiman L., 2007, Random Forest