Diagnosis of Diabetes by Applying Data Mining Classification Techniques Comparison of Three Data Mining Algorithms

被引:0
作者
Daghistani, Tahani [1 ]
Alshammari, Riyad [1 ]
机构
[1] King Saud Bin Abdulaziz Univ Hlth Sci KSAU HS, King Abdullah Int Med Res Ctr, Hlth Informat Dept, Coll Publ Hlth & Hlth Informat, Riyadh, Saudi Arabia
关键词
Diabetes; Data mining; Self-Organizing Map; Decision tree; Classification;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Health care data are often huge, complex and heterogeneous because it contains different variable types and missing values as well. Nowadays, knowledge from such data is a necessity. Data mining can be utilized to extract knowledge by constructing models from health care data such as diabetic patient data sets. In this research, three data mining algorithms, namely Self-Organizing Map (SOM), C4.5 and RandomForest, are applied on adult population data from Ministry of National Guard Health Affairs (MNGHA), Saudi Arabia to predict diabetic patients using 18 risk factors. RandomForest achieved the best performance compared to other data mining classifiers.
引用
收藏
页码:329 / 332
页数:4
相关论文
共 23 条
[1]  
Al Jarullah Asma A., 2011, 2011 International Conference on Innovations in Information Technology (IIT), P303, DOI 10.1109/INNOVATIONS.2011.5893838
[2]   Effects of supervised Self Organising Maps parameters on classification performance [J].
Ballabio, Davide ;
Vasighi, Mandi ;
Filzmoser, Peter .
ANALYTICA CHIMICA ACTA, 2013, 765 :45-53
[3]   Intelligible Support Vector Machines for Diagnosis of Diabetes Mellitus [J].
Barakat, Nahla H. ;
Bradley, Andrew P. ;
Barakat, Mohamed Nabil H. .
IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2010, 14 (04) :1114-1120
[4]  
Central Department of Statistics & Information (CDSI), 2016, STAT YB
[5]   Predictive models to assess risk of type 2 diabetes, hypertension and comorbidity: machine-learning algorithms and validation using national health data from Kuwait-a cohort study [J].
Farran, Bassam ;
Channanath, Arshad Mohamed ;
Behbehani, Kazem ;
Thanaraj, Thangavel Alphonse .
BMJ OPEN, 2013, 3 (05)
[6]   A fuzzy classification system based on Ant Colony Optimization for diabetes disease diagnosis [J].
Ganji, Mostafa Fathi ;
Abadeh, Mohammad Saniee .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (12) :14650-14659
[7]   Feature selection and classification model construction on type 2 diabetic patients' data [J].
Huang, Yue ;
McCullagh, Paul ;
Black, Norman ;
Harper, Roy .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2007, 41 (03) :251-262
[8]  
Johannes R. S., 1988, J HOPKINS APL TECH D, V10, P262
[9]   Cost-Effectiveness of Interventions to Prevent and Control Diabetes Mellitus: A Systematic Review [J].
Li, Rui ;
Zhang, Ping ;
Barker, Lawrence E. ;
Chowdhury, Farah M. ;
Zhang, Xuanping .
DIABETES CARE, 2010, 33 (08) :1872-1894
[10]  
Lin Jau-Huei, 2006, AMIA Annu Symp Proc, P489