Metabolic Syndrome and Development of Diabetes Mellitus: Predictive Modeling Based on Machine Learning Techniques

被引:32
|
作者
Perveen, Sajida [1 ]
Shahbaz, Muhammad [1 ,2 ]
Keshavjee, Karim [2 ,3 ]
Guergachi, Aziz [2 ,4 ,5 ]
机构
[1] Univ Engn & Technol, Dept Comp Sci & Engn, Lahore 54890, Pakistan
[2] Ryerson Univ, Res Lab Adv Syst Modelling, Toronto, ON M5B 2K3, Canada
[3] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON M5S, Canada
[4] Ryerson Univ, Ted Rogers Sch Informat Technol Management, Toronto, ON M5B 2K3, Canada
[5] York Univ, Dept Math & Stat, Toronto, ON M3J 1P3, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Metabolic syndrome; decision tree; the National Heart; Lungs and Blood Institute and American Heart Association (NHLBI) and American Heart Association (AHA); diagnostic prediction; data sampling methods; K-medoids; random under sampling; over sampling; CORONARY-HEART-DISEASE; CARDIOVASCULAR-DISEASE; INDEPENDENT ASSOCIATIONS; PERFORMANCE ANALYSIS; SYNDROME SEVERITY; RISK; CLASSIFICATION; SCORE; PIOGLITAZONE; CHOLESTEROL;
D O I
10.1109/ACCESS.2018.2884249
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The objective of this inductive research was to investigate: 1) the relationship between diabetes mellitus and individual risk factors of metabolic syndrome (MetS), in a non-conservative setting; 2) the prediction of future onset of diabetes using relevant risk factors of MetS; and 3) to investigate the relative performance of machine learning methods when data sampling techniques are used to generate balanced training sets. The dataset used in this research contains 667 907 records for a period ranging from 2003 to 2013. Quantifying the contribution of individual risk factors of MetS in the development of diabetes in a non-conservative setting logistic regression analysis was performed. Our analyses contradict the view that diabetes is commonly associated with low levels of high-density lipoprotein (HDL). Instead, our results demonstrate that the increased levels of HDL are positively correlated with diabetes onset, particularly in women. We also proposed J48 decision tree and Naive Bayes methods for prediction of future onset of diabetes using relevant risk factors obtained from logistic regression analysis, over balanced and unbalanced datasets. The results demonstrated the supremacy of Naive Bayes with K-medoids under-sampling technique as compared to random under-sampling, oversampling, and no sampling. It is achieved on average 79% receiver operating characteristic performance with the increased true positive rate. The results of this paper suggest further research to clarify the pathophysiological significance of HDL and pathways in the development of diabetes.
引用
收藏
页码:1365 / 1375
页数:11
相关论文
共 50 条
  • [41] Predicting metabolic syndrome: Machine learning techniques for improved preventive medicine
    Goldman, Orit
    Ben-Assuli, Ofir
    Ababa, Shimon
    Rogowski, Ori
    Berliner, Shlomo
    HEALTH INFORMATICS JOURNAL, 2025, 31 (01)
  • [42] Predictive factors of response to liraglutide in patients with type 2 diabetes mellitus and metabolic syndrome
    Song, Jinfang
    Li, Na
    Zhuang, Yongru
    Chen, Ya
    Zhang, Chu
    Zhu, Jian
    FRONTIERS IN ENDOCRINOLOGY, 2024, 15
  • [43] A cohort study on the predictive capability of body composition for diabetes mellitus using machine learning
    Nematollahi, Mohammad Ali
    Askarinejad, Amir
    Asadollahi, Arefeh
    Bazrafshan, Mehdi
    Sarejloo, Shirin
    Moghadami, Mana
    Sasannia, Sarvin
    Farjam, Mojtaba
    Homayounfar, Reza
    Pezeshki, Babak
    Amini, Mitra
    Roshanzamir, Mohamad
    Alizadehsani, Roohallah
    Bazrafshan, Hanieh
    Drissi, Hamed Bazrafshan
    Tan, Ru-San
    Acharya, U. Rajendra
    Islam, Mohammed Shariful Sheikh
    JOURNAL OF DIABETES AND METABOLIC DISORDERS, 2024, 23 (01) : 189 - 198
  • [44] Comparison of Machine Learning Techniques for Predictive Modeling of High-Speed Links
    Ma, Hanzhi
    Li, Er-Ping
    Cangellaris, Andreas C.
    Chen, Xu
    2019 IEEE 28TH CONFERENCE ON ELECTRICAL PERFORMANCE OF ELECTRONIC PACKAGING AND SYSTEMS (EPEPS 2019), 2019,
  • [45] Predictive Modeling of Crop Yield in Precision Agriculture Using Machine Learning Techniques
    Raj, G. Bhupal
    EswararaoBoddepalli
    Veena, C. H.
    Manjunatha
    Singla, Atul
    Dhanraj, JoshuvaArockia
    2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [46] Predictive modeling of photovoltaic system cleaning schedules using machine learning techniques
    Abuzaid, Haneen
    Awad, Mahmoud
    Shamayleh, Abdulrahim
    Alshraideh, Hussam
    RENEWABLE ENERGY, 2025, 239
  • [47] Unraveling the Factors Determining Development of Type 2 Diabetes in Women With a History of Gestational Diabetes Mellitus Through Machine-Learning Techniques
    Ilari, Ludovica
    Piersanti, Agnese
    Goebl, Christian
    Burattini, Laura
    Kautzky-Willer, Alexandra
    Tura, Andrea
    Morettini, Micaela
    FRONTIERS IN PHYSIOLOGY, 2022, 13
  • [48] Advancing sustainable mobility: Dynamic predictive modeling of charging cycles in electric vehicles using machine learning techniques and predictive application development
    Paneru, Biplov
    Mainali, Durga Prasad
    Paneru, Bishwash
    Sapkota, Sanjog Chhetri
    SYSTEMS AND SOFT COMPUTING, 2024, 6
  • [49] Machine Learning-Based Predictive Modeling of Postpartum Depression
    Shin, Dayeon
    Lee, Kyung Ju
    Adeluwa, Temidayo
    Hur, Junguk
    JOURNAL OF CLINICAL MEDICINE, 2020, 9 (09) : 1 - 14
  • [50] Non-invasive Diabetes Mellitus Detection System using Machine Learning Techniques
    Prabha, Anju
    Yadav, Jyoti
    Rani, Asha
    Singh, Vijander
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 948 - 953