Pima Indians diabetes mellitus classification based on machine learning (ML) algorithms

被引:72
作者
Chang, Victor [1 ]
Bailey, Jozeene [2 ]
Xu, Qianwen Ariel [2 ]
Sun, Zhili [3 ]
机构
[1] Aston Univ, Aston Business Sch, Dept Operat & Informat Management, Birmingham, W Midlands, England
[2] Teesside Univ, Sch Comp & Digital Technol, Cybersecur Informat Syst & AI Res Grp, Middlesbrough, Cleveland, England
[3] Univ Surrey, Inst Commun Syst ICS, 5G & 6G Innovat Ctr 5G & 6GIC, Guildford, Surrey, England
关键词
Diabetes mellitus; The Internet of Medical Things (IoMT); Machine learning; Interpretable artificial intelligence; DIAGNOSIS; INTERNET;
D O I
10.1007/s00521-022-07049-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an e-diagnosis system based on machine learning (ML) algorithms to be implemented on the Internet of Medical Things (IoMT) environment, particularly for diagnosing diabetes mellitus (type 2 diabetes). However, the ML applications tend to be mistrusted because of their inability to show the internal decision-making process, resulting in slow uptake by end-users within certain healthcare sectors. This research delineates the use of three interpretable supervised ML models: Naive Bayes classifier, random forest classifier, and J48 decision tree models to be trained and tested using the Pima Indians diabetes dataset in R programming language. The performance of each algorithm is analyzed to determine the one with the best accuracy, precision, sensitivity, and specificity. An assessment of the decision process is also made to improve the model. It can be concluded that a Naive Bayes model works well with a more fine-tuned selection of features for binary classification, while random forest works better with more features.
引用
收藏
页码:16157 / 16173
页数:17
相关论文
共 36 条
[1]  
Abdar M, 2019, 2019 IEEE 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2019), P26, DOI [10.1109/ccoms.2019.8821633, 10.1109/CCOMS.2019.8821633]
[2]  
Adeniyi E, 2021, IOT HEALTHCARE AMBIE, P103
[3]  
Akbar R, 2020, INT C INF TECH SYST, P8, DOI [10.1109/icitsi50517.2020.9264938, 10.1109/ICITSI50517.2020.9264938]
[4]   IoMT-SAF: Internet of Medical Things Security Assessment Framework [J].
Alsubaei, Faisal ;
Abuhussein, Abdullah ;
Shandilya, Vivek ;
Shiva, Sajjan .
INTERNET OF THINGS, 2019, 8
[5]   An explainable XGBoost-based approach towards assessing the risk of cardiovascular disease in patients with Type 2 Diabetes Mellitus [J].
Athanasiou, Maria ;
Sfrintzeri, Konstantina ;
Zarkogianni, Konstantia ;
Thanopoulou, Anastasia C. ;
Nikita, Konstantina S. .
2020 IEEE 20TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE 2020), 2020, :859-864
[6]  
Burdi F, 2016, INT CONF INFORM COMM, P281, DOI [10.1109/ICT4M.2016.064, 10.1109/ICT4M.2016.57]
[7]  
Cheng D., 2020, SOLID STATE TECHNOL, V63, P2780
[8]  
Daniati Erna, 2019, 2019 International Conference of Artificial Intelligence and Information Technology (ICAIIT). Proceedings, P238, DOI 10.1109/ICAIIT.2019.8834474
[9]  
Divya K., 2021, COGNITIVE INTERNET M, P145, DOI [10.1007/978-3-030-55833-8_9, DOI 10.1007/978-3-030-55833-8_9]
[10]  
Gomathi S., 2015, 2015 International Conference on Advanced Computing and Communication Systems (ICACCS). Proceedings, P1, DOI 10.1109/ICACCS.2015.7324054