Metabolic Syndrome and Development of Diabetes Mellitus: Predictive Modeling Based on Machine Learning Techniques

被引:33
作者
Perveen, Sajida [1 ]
Shahbaz, Muhammad [1 ,2 ]
Keshavjee, Karim [2 ,3 ]
Guergachi, Aziz [2 ,4 ,5 ]
机构
[1] Univ Engn & Technol, Dept Comp Sci & Engn, Lahore 54890, Pakistan
[2] Ryerson Univ, Res Lab Adv Syst Modelling, Toronto, ON M5B 2K3, Canada
[3] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON M5S, Canada
[4] Ryerson Univ, Ted Rogers Sch Informat Technol Management, Toronto, ON M5B 2K3, Canada
[5] York Univ, Dept Math & Stat, Toronto, ON M3J 1P3, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Metabolic syndrome; decision tree; the National Heart; Lungs and Blood Institute and American Heart Association (NHLBI) and American Heart Association (AHA); diagnostic prediction; data sampling methods; K-medoids; random under sampling; over sampling; CORONARY-HEART-DISEASE; CARDIOVASCULAR-DISEASE; INDEPENDENT ASSOCIATIONS; PERFORMANCE ANALYSIS; SYNDROME SEVERITY; RISK; CLASSIFICATION; SCORE; PIOGLITAZONE; CHOLESTEROL;
D O I
10.1109/ACCESS.2018.2884249
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The objective of this inductive research was to investigate: 1) the relationship between diabetes mellitus and individual risk factors of metabolic syndrome (MetS), in a non-conservative setting; 2) the prediction of future onset of diabetes using relevant risk factors of MetS; and 3) to investigate the relative performance of machine learning methods when data sampling techniques are used to generate balanced training sets. The dataset used in this research contains 667 907 records for a period ranging from 2003 to 2013. Quantifying the contribution of individual risk factors of MetS in the development of diabetes in a non-conservative setting logistic regression analysis was performed. Our analyses contradict the view that diabetes is commonly associated with low levels of high-density lipoprotein (HDL). Instead, our results demonstrate that the increased levels of HDL are positively correlated with diabetes onset, particularly in women. We also proposed J48 decision tree and Naive Bayes methods for prediction of future onset of diabetes using relevant risk factors obtained from logistic regression analysis, over balanced and unbalanced datasets. The results demonstrated the supremacy of Naive Bayes with K-medoids under-sampling technique as compared to random under-sampling, oversampling, and no sampling. It is achieved on average 79% receiver operating characteristic performance with the increased true positive rate. The results of this paper suggest further research to clarify the pathophysiological significance of HDL and pathways in the development of diabetes.
引用
收藏
页码:1365 / 1375
页数:11
相关论文
共 50 条
[1]   Metabolic syndrome and hyperglycemia: Congruence and divergence [J].
Alexander, Charles M. ;
Landsman, Pamela B. ;
Grundy, Scott M. .
AMERICAN JOURNAL OF CARDIOLOGY, 2006, 98 (07) :982-985
[2]   HDL cholesterol, very low levels of LDL cholesterol, and cardiovascular events [J].
Barter, Philip ;
Gotto, Antonio M. ;
LaRosa, John C. ;
Maroni, Jaman ;
Szarek, Michael ;
Grundy, Scott M. ;
Kastelein, John J. P. ;
Bittner, Vera ;
Fruchart, Jean-Charles .
NEW ENGLAND JOURNAL OF MEDICINE, 2007, 357 (13) :1301-1310
[3]   Uric acid and risk of new-onset metabolic syndrome, impaired fasting glucose and diabetes mellitus in a general Italian population: data from the Pressioni Arteriose Monitorate E Loro Associazioni study [J].
Bombelli, Michele ;
Quarti-Trevano, Fosca ;
Tadic, Marijana ;
Facchetti, Rita ;
Cuspidi, Cesare ;
Mancia, Giuseppe ;
Grassi, Guido .
JOURNAL OF HYPERTENSION, 2018, 36 (07) :1492-1498
[4]   Reporting and Methods in Clinical Prediction Research: A Systematic Review [J].
Bouwmeester, Walter ;
Zuithoff, Nicolaas P. A. ;
Mallett, Susan ;
Geerlings, Mirjam I. ;
Vergouwe, Yvonne ;
Steyerberg, Ewout W. ;
Altman, Douglas G. ;
Moons, Karel G. M. .
PLOS MEDICINE, 2012, 9 (05)
[5]   A systematic study of the class imbalance problem in convolutional neural networks [J].
Buda, Mateusz ;
Maki, Atsuto ;
Mazurowski, Maciej A. .
NEURAL NETWORKS, 2018, 106 :249-259
[6]   The metabolic syndrome as a predictor of incident diabetes mellitus in Mauritius [J].
Cameron, A. J. ;
Zimmet, P. Z. ;
Soderberg, S. ;
Alberti, K. G. M. M. ;
Sicree, R. ;
Tuomilehto, J. ;
Chitson, P. ;
Shaw, J. E. .
DIABETIC MEDICINE, 2007, 24 (12) :1460-1469
[7]  
Canlas Jr R. D., 2009, TECH REP
[8]   Metabolic syndrome and the risk of adverse cardiovascular events after an acute coronary syndrome [J].
Cavallari, Ilaria ;
Cannon, Christopher P. ;
Braunwald, Eugene ;
Goodrich, Erica L. ;
Im, KyungAh ;
Lukas, Mary Ann ;
O'Donoghue, Michelle L. .
EUROPEAN JOURNAL OF PREVENTIVE CARDIOLOGY, 2018, 25 (08) :830-838
[9]  
Ceska Richard, 2007, Diab Vasc Dis Res, V4 Suppl 3, pS2
[10]   High-density lipoprotein cholesterol in diabetes: Is higher always better? [J].
Costacou, Tina ;
Evans, Rhobert W. ;
Orchard, Trevor J. .
JOURNAL OF CLINICAL LIPIDOLOGY, 2011, 5 (05) :387-394