Machine learning algorithms for diabetes detection: a comparative evaluation of performance of algorithms

被引:6
作者
Saxena, Surabhi [1 ]
Mohapatra, Debashish [2 ]
Padhee, Subhransu [3 ]
Sahoo, Goutam Kumar [4 ]
机构
[1] Koneru Lakshmaiah Educ Fdn, Dept BCA, Guntur, Andhra Pradesh, India
[2] Natl Inst Technol Rourkela, Dept Elect Engn, Rourkela, Odisha, India
[3] Sambalpur Univ Inst Informat Technol, Dept Elect & Elect Engn, Burla, Odisha, India
[4] Natl Inst Technol Rourkela, Dept Elect & Commun Engn, Rourkela, Odisha, India
关键词
Machine learning; PIMA Indian diabetes dataset; ensemble; Stacked ensemble; CLINICAL MEDICINE; FEATURE-SELECTION; ENSEMBLES; CLASSIFICATION; CLASSIFIERS; MODELS;
D O I
10.1007/s12065-021-00685-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently machine learning algorithms are widely used for the prediction of different attributes, and these algorithms find widespread applications in a variety of domains. Machine learning in health care has been one of the core areas of research where machine learning models are used on the medical datasets to predict different attributes. This work provides a comparative evaluation of different classical as well as ensemble machine learning models, which are used to predict the risk of diabetes from two different datasets, i.e., PIMA Indian diabetes dataset and early-stage diabetes risk prediction dataset. From the comparative analysis, it is found that the superlearner model provides the best accuracy i.e. 86% for PIMA Indian diabetes dataset, and it provides 97% accuracy for diabetes risk prediction dataset.
引用
收藏
页码:587 / 603
页数:17
相关论文
共 40 条
[1]   Particularities of data mining in medicine: lessons learned from patient medical time series data analysis [J].
Aljawarneh, Shadi ;
Anguera, Aurea ;
Atwood, John William ;
Lara, Juan A. ;
Lizcano, David .
EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2019, 2019 (01)
[2]   Hybrid Cascade Forward Neural Network with Elman Neural Network for Disease Prediction [J].
Alkhasawneh, Mutasem Sh .
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2019, 44 (11) :9209-9220
[3]   A survey of cross-validation procedures for model selection [J].
Arlot, Sylvain ;
Celisse, Alain .
STATISTICS SURVEYS, 2010, 4 :40-79
[4]   Predictive data mining in clinical medicine: Current issues and guidelines [J].
Bellazzi, Riccardo ;
Zupan, Blaz .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2008, 77 (02) :81-97
[5]   Predictive data mining in clinical medicine: a focus on selected methods and applications [J].
Bellazzi, Riccardo ;
Ferrazzi, Fulvia ;
Sacchi, Lucia .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (05) :416-430
[6]  
Ben-Gal I, 2005, DATA MINING AND KNOWLEDGE DISCOVERY HANDBOOK, P131, DOI 10.1007/0-387-25465-X_7
[7]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[8]   Gestational diabetes from A to Z [J].
Dirar, AbdelHameed Mirghani ;
Doupis, John .
WORLD JOURNAL OF DIABETES, 2017, 8 (12) :489-511
[9]   Artificial Intelligence (AI) based machine learning models predict glucose variability and hypoglycaemia risk in patients with type 2 diabetes on a multiple drug regimen who fast during ramadan (The PROFAST - IT Ramadan study) [J].
Elhadd, Tarik ;
Mall, Raghvendra ;
Bashir, Mohammed ;
Palotti, Joao ;
Fernandez-Luque, Luis ;
Farooq, Faisal ;
Al Mohanadi, Dabia ;
Dabbous, Zainab ;
Malik, Rayaz A. ;
Abou-Samra, Abdul Badi .
DIABETES RESEARCH AND CLINICAL PRACTICE, 2020, 169
[10]  
Fayyad U, 1996, AI MAG, V17, P37