Identification of People with Diabetes Treatment through Lipids Profile Using Machine Learning Algorithms

被引:4
作者
Alcala-Rmz, Vanessa [1 ]
Galvan-Tejada, Carlos E. [1 ]
Garcia-Hernandez, Alejandra [1 ]
Valladares-Salgado, Adan [2 ]
Cruz, Miguel [2 ]
Galvan-Tejada, Jorge I. [1 ]
Celaya-Padilla, Jose M. [1 ]
Luna-Garcia, Huizilopoztli [1 ]
Gamboa-Rosales, Hamurabi [1 ]
机构
[1] Univ Autonoma Zacatecas, Unidad Acad Ingn Elect, Jardin Juarez 147, Zacatecas 98000, Zacatecas, Mexico
[2] Inst Mexicano Seguro Social, Hosp Especialidades, Ctr Medico Nacl Siglo 21, Unidad Invest Med Bioquim, Av Cuauhtemoc 330, Mexico City 06720, DF, Mexico
关键词
type; 2; diabetes; diabetic treatment; logistic regression; random forest; K-nearest neighbor; decision trees; computer-aided diagnosis; statistical analysis; TYPE-2; MEXICO;
D O I
10.3390/healthcare9040422
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Diabetes incidence has been a problem, because according with the World Health Organization and the International Diabetes Federation, the number of people with this disease is increasing very fast all over the world. Diabetic treatment is important to prevent the development of several complications, also lipid profile monitoring is important. For that reason the aim of this work is the implementation of machine learning algorithms that are able to classify cases, that corresponds to patients diagnosed with diabetes that have diabetes treatment, and controls that refers to subjects who do not have diabetes treatment but some of them have diabetes, bases on lipids profile levels. Logistic regression, K-nearest neighbor, decision trees and random forest were implemented, all of them were evaluated with accuracy, sensitivity, specificity and AUC-ROC curve metrics. Artificial neural network obtain an acurracy of 0.685 and an AUC value of 0.750, logistic regression achieve an accuracy of 0.729 and an AUC value of 0.795, K-nearest neighbor gets an accuracy of 0.669 and an AUC value of 0.709, on the other hand, decision tree reached an accuracy pg 0.691 and a AUC value of 0.683, finally random forest achieve an accuracy of 0.704 and an AUC curve of 0.776. The performance of all models was statistically significant, but the best performance model for this problem corresponds to logistic regression.
引用
收藏
页数:14
相关论文
共 43 条
[1]  
Abedin M.M., 2019, P 7 INT C DAT SCI SD
[2]   Basic concepts of artificial neural network (ANN) modeling and its application in pharmaceutical research [J].
Agatonovic-Kustrin, S ;
Beresford, R .
JOURNAL OF PHARMACEUTICAL AND BIOMEDICAL ANALYSIS, 2000, 22 (05) :717-727
[3]  
Alam F., 2020, SMART INFRASTRUCTURE, P135
[4]   Identification of Diabetic Patients through Clinical and Para-Clinical Features in Mexico: An Approach Using Deep Neural Networks [J].
Alcala-Rmz, Vanessa ;
Zanella-Calzada, Laura A. ;
Galvan-Tejada, Carlos E. ;
Garcia-Hernandez, Alejandra ;
Cruz, Miguel ;
Valladares-Salgado, Adan ;
Galvan-Tejada, Jorge I. ;
Gamboa-Rosales, Hamurabi .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2019, 16 (03)
[5]  
Almatrooshi F., 2020, P THE 25 INT C 3D WE, P1
[6]  
Amer Diabet Assoc, 2014, DIABETES CARE, V37, pS81, DOI [10.2337/dc14-S081, 10.2337/dc11-S011, 10.2337/dc13-S011, 10.2337/dc11-S062, 10.2337/dc13-S067, 10.2337/dc10-S062, 10.2337/dc10-S011, 10.2337/dc12-s064, 10.2337/dc12-s011]
[7]  
Assegie TA, 2019, Int J Electr Comput Eng (IJECE), V9, P4446
[8]   Glycemic control and lipid outcomes in children and adolescents with type 2 diabetes [J].
Barr, Mary Margaret ;
Aslibekyan, Stella ;
Ashraf, Ambika P. .
PLOS ONE, 2019, 14 (07)
[9]  
Carlini N., 2017, P 2017 IEEE S SEC PR
[10]  
Chollet F., Keras: Deep Learning Library for Theano and Tensorflow