Metabolic Syndrome and Development of Diabetes Mellitus: Predictive Modeling Based on Machine Learning Techniques

被引:32
|
作者
Perveen, Sajida [1 ]
Shahbaz, Muhammad [1 ,2 ]
Keshavjee, Karim [2 ,3 ]
Guergachi, Aziz [2 ,4 ,5 ]
机构
[1] Univ Engn & Technol, Dept Comp Sci & Engn, Lahore 54890, Pakistan
[2] Ryerson Univ, Res Lab Adv Syst Modelling, Toronto, ON M5B 2K3, Canada
[3] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON M5S, Canada
[4] Ryerson Univ, Ted Rogers Sch Informat Technol Management, Toronto, ON M5B 2K3, Canada
[5] York Univ, Dept Math & Stat, Toronto, ON M3J 1P3, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Metabolic syndrome; decision tree; the National Heart; Lungs and Blood Institute and American Heart Association (NHLBI) and American Heart Association (AHA); diagnostic prediction; data sampling methods; K-medoids; random under sampling; over sampling; CORONARY-HEART-DISEASE; CARDIOVASCULAR-DISEASE; INDEPENDENT ASSOCIATIONS; PERFORMANCE ANALYSIS; SYNDROME SEVERITY; RISK; CLASSIFICATION; SCORE; PIOGLITAZONE; CHOLESTEROL;
D O I
10.1109/ACCESS.2018.2884249
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The objective of this inductive research was to investigate: 1) the relationship between diabetes mellitus and individual risk factors of metabolic syndrome (MetS), in a non-conservative setting; 2) the prediction of future onset of diabetes using relevant risk factors of MetS; and 3) to investigate the relative performance of machine learning methods when data sampling techniques are used to generate balanced training sets. The dataset used in this research contains 667 907 records for a period ranging from 2003 to 2013. Quantifying the contribution of individual risk factors of MetS in the development of diabetes in a non-conservative setting logistic regression analysis was performed. Our analyses contradict the view that diabetes is commonly associated with low levels of high-density lipoprotein (HDL). Instead, our results demonstrate that the increased levels of HDL are positively correlated with diabetes onset, particularly in women. We also proposed J48 decision tree and Naive Bayes methods for prediction of future onset of diabetes using relevant risk factors obtained from logistic regression analysis, over balanced and unbalanced datasets. The results demonstrated the supremacy of Naive Bayes with K-medoids under-sampling technique as compared to random under-sampling, oversampling, and no sampling. It is achieved on average 79% receiver operating characteristic performance with the increased true positive rate. The results of this paper suggest further research to clarify the pathophysiological significance of HDL and pathways in the development of diabetes.
引用
收藏
页码:1365 / 1375
页数:11
相关论文
共 50 条
  • [1] Predictive models for diabetes mellitus using machine learning techniques
    Lai, Hang
    Huang, Huaxiong
    Keshavjee, Karim
    Guergachi, Aziz
    Gao, Xin
    BMC ENDOCRINE DISORDERS, 2019, 19 (01)
  • [2] Predictive models for diabetes mellitus using machine learning techniques
    Hang Lai
    Huaxiong Huang
    Karim Keshavjee
    Aziz Guergachi
    Xin Gao
    BMC Endocrine Disorders, 19
  • [3] Predictive modeling for the development of diabetes mellitus using key factors in various machine learning approaches
    Tanaka, Marenao
    Akiyama, Yukinori
    Mori, Kazuma
    Hosaka, Itaru
    Kato, Kenichi
    Endo, Keisuke
    Ogawa, Toshifumi
    Sato, Tatsuya
    Suzuki, Toru
    Yano, Toshiyuki
    Ohnishi, Hirofumi
    Hanawa, Nagisa
    Furuhashi, Masato
    DIABETES EPIDEMIOLOGY AND MANAGEMENT, 2024, 13
  • [4] A Predictive Model for Diabetes Mellitus Using Machine Learning Techniques (A Study in Nigeria)
    Evwiekpaefe, Abraham Eseoghene
    Abdulkadir, Nafisat
    AFRICAN JOURNAL OF INFORMATION SYSTEMS, 2023, 15 (01):
  • [5] Diabetes Mellitus Disease Prediction and Type Classification Involving Predictive Modeling Using Machine Learning Techniques and Classifiers
    Ahamed, B. Shamreen
    Arya, Meenakshi S.
    Sangeetha, S. K. B.
    Auxilia Osvin, Nancy V.
    APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2022, 2022
  • [6] Machine Learning Techniques for Diabetes Mellitus Based on Lifestyle Predictors
    Ansari, Gufran Ahmad
    Bhat, Salliah Shafi
    Ansari, Mohd Dilshad
    RECENT ADVANCES IN ELECTRICAL & ELECTRONIC ENGINEERING, 2024,
  • [7] The Use of Machine Learning Techniques to Determine the Predictive Value of Inflammatory Biomarkers in the Development of Type 2 Diabetes Mellitus
    Garcia-Carretero, Rafael
    Vigil-Medina, Luis
    Barquero-Perez, Oscar
    METABOLIC SYNDROME AND RELATED DISORDERS, 2021, 19 (04) : 240 - 248
  • [8] Predicting Diabetes Mellitus With Machine Learning Techniques
    Zou, Quan
    Qu, Kaiyang
    Luo, Yamei
    Yin, Dehui
    Ju, Ying
    Tang, Hua
    FRONTIERS IN GENETICS, 2018, 9
  • [9] Predictive Supervised Machine Learning Models for Diabetes Mellitus
    Muhammad L.J.
    Algehyne E.A.
    Usman S.S.
    SN Computer Science, 2020, 1 (5)
  • [10] Machine learning-based predictive model for prevention of metabolic syndrome
    Shin, Hyunseok
    Shim, Simon
    Oh, Sejong
    PLOS ONE, 2023, 18 (06):