Using a robust model to detect the association between anthropometric factors and T2DM: machine learning approaches

被引:1
|
作者
Hosseini, Nafiseh [1 ,2 ]
Tanzadehpanah, Hamid [3 ,4 ,5 ]
Mansoori, Amin [6 ]
Sabzekar, Mostafa [7 ]
Ferns, Gordon A. [8 ]
Esmaily, Habibollah [9 ,10 ]
Ghayour-Mobarhan, Majid [1 ,4 ]
机构
[1] Mashhad Univ Med Sci, Int UNESCO Ctr Hlth Related Basic Sci & Human Nutr, Mashhad 9919991766, Iran
[2] Mashhad Univ Med Sci, Fac Med, Dept Med Informat, Mashhad, Iran
[3] Mashhad Univ Med Sci, Antimicrobial Resistance Res Ctr, Mashhad, Iran
[4] Mashhad Univ Med Sci, Metab Syndrome Res Ctr, Mashhad, Iran
[5] Mashhad Univ Med Sci, Basic Sci Res Inst, Mashhad, Iran
[6] Ferdowsi Univ Mashhad, Sch Math Sci, Dept Appl Math, Mashhad, Iran
[7] Birjand Univ Technol, Dept Comp Engn, Birjand, Iran
[8] Brighton & Sussex Med Sch, Div Med Educ, Brighton, England
[9] Mashhad Univ Med Sci, Sch Hlth, Dept Biostat, Mashhad, Iran
[10] Mashhad Univ Med Sci, Social Determinants Hlth Res Ctr, Mashhad, Iran
关键词
TYPE-2; DIABETES-MELLITUS;
D O I
10.1186/s12911-025-02887-y
中图分类号
R-058 [];
学科分类号
摘要
BackgroundThe aim of this study was to evaluate the potential models to determine the most important anthropometric factors associated with type 2 diabetes mellitus (T2DM).MethodA dataset derived from the Mashhad Stroke and heart atherosclerotic disorders (MASHAD) study comprising 9354 subject aged 65 - 35. 25% (2336 people) of subjects were diabetic and 75% (7018 people) where non-diabetic was used for the analysis of 10 anthropometric factors and age that were measured in all patients. A K-nearest neighbor (KNN) model was used to assess the association between T2DM and selected factors. The model was evaluated using accuracy, sensitivity, specificity, precision and f1-measure parameters. The receiver operating characteristic (ROC) curve and factor importance analysis were also determined. The performance of the KNN model was compared with Artificial neural network (ANN) and support vector machine (SVM) models.ResultAfter feature selection analysis and assessing multicollinearity, six factors (Mid-arm Circumference (MAC), Waist Circumference (WC), Body Roundness Index (BRI), Body Adiposity Index (BAI), Body Mass Index (BMI), age) were used in the final model. BRI, BAI and MAC factors in males and BMI, BRI, and MAC factors in females were found to have the greatest association with T2DM. The accuracy of the KNN model was approximately 93% for both genders. The best K (number of neighbors) for the model was 4 which had the lowest error rate. The area under the ROC curve (AUC) was 0.985 for men and 0.986 for women. The KNN model achieved the best result of the models explored.ConclusionThe KNN model had a high accuracy (93%) for predicting the association between anthropometric factors and T2DM. Selecting the K parameter (nearest neighbor) has an essential impact on reducing the error rate. Feature selection analysis reduces the dimensions of the KNN model and increases the accuracy of final results.
引用
收藏
页数:10
相关论文
共 15 条
  • [1] Association between circulating betatrophin levels and T2DM: A meta-analysis
    Zhan, Yuquan
    MINERVA ENDOCRINOLOGY, 2024,
  • [2] Association between sociodemographic status and the T2DM-related risks in China: implication for reducing T2DM disease burden
    Huang, Xin
    He, Yinhui
    Xu, Haiyan
    Shen, Yuyan
    Pan, Xiaowen
    Wu, Junyun
    Chen, Kai
    FRONTIERS IN PUBLIC HEALTH, 2024, 11
  • [3] The mediating effect of TyG and its derived indices in the association between OSAHS and atherosclerosis in patients with T2DM
    Ding, Ling
    Jiang, Xiaohong
    SLEEP AND BREATHING, 2024, 28 (06) : 2469 - 2479
  • [4] Association Between Thyroid Function and Body Composition in Type 2 Diabetes Mellitus (T2DM) Patients: Does Sex Have a Role?
    Chen, Lu
    Zhang, Min
    Xiang, Shoukui
    Jiang, Xiaohong
    Gu, Hongliu
    Sha, Qi
    Qu, Meidi
    Xu, Ting
    MEDICAL SCIENCE MONITOR, 2021, 27
  • [5] Predictive models and determinants of mortality among T2DM patients in a tertiary hospital in Ghana, how do machine learning techniques perform?
    Kpene, Godsway Edem
    Lokpo, Sylvester Yao
    Darfour-Oduro, Sandra A.
    BMC ENDOCRINE DISORDERS, 2025, 25 (01)
  • [6] Association Between Waist Circumference and Serum Triacylglcerol Status in Type 2 Diabetes Mellitus (T2DM): A Study Performed on a Rural Population in Puducherry
    Srinivasan, Arunkumar
    Joshi, Prabal
    Raghavan, Srinivasan A.
    JOURNAL OF CLINICAL AND DIAGNOSTIC RESEARCH, 2013, 7 (04) : 631 - 634
  • [7] The association between Toxoplasma gondii infection and hypertensive disorders in T2DM patients: a case-control study in the Han Chinese population
    Han, Yajing
    Nie, Lihong
    Ye, Xiaohong
    Zhou, Zixing
    Huang, Shiqi
    Zeng, Chengli
    Guo, Congcong
    Ou, Meiling
    Xiao, Di
    Zhang, Baohuan
    Huang, Chuican
    Ye, Xingguang
    Jing, Chunxia
    Yang, Guang
    PARASITOLOGY RESEARCH, 2018, 117 (03) : 689 - 695
  • [8] Association between organic cation transporter genetic polymorphisms and metformin response and intolerance in T2DM individuals: a systematic review and meta-analysis
    Peng, Aiyu
    Gong, Chunmei
    Xu, Yuanfei
    Liang, Xiongshun
    Chen, Xiaoping
    Hong, Wenxu
    Yan, Junxia
    FRONTIERS IN PUBLIC HEALTH, 2023, 11
  • [9] Lack of association between single nucleotide polymorphisms in TCF7L2 and T2DM in the Chinese Yao population A case-control study
    Sun, Shi-Yu
    Huang, Run-Ze
    Huang, Huang
    Zhang, Ming-Qi
    Sun, Hui-Lin
    MEDICINE, 2021, 100 (12) : E25326
  • [10] Investigating the association of CD36 gene polymorphisms (rs1761667 and rs1527483) with T2DM and dyslipidemia: Statistical analysis, machine learning based prediction, and meta-analysis
    Hatmal, Ma'mon M.
    Alshaer, Walhan
    Mahmoud, Ismail S.
    Al-Hatamleh, Mohammad A. I.
    Al-Ameer, Hamzeh J.
    Abuyaman, Omar
    Zihlif, Malek
    Mohamud, Rohimah
    Darras, Mais
    Al Shhab, Mohammad
    Abu-Raideh, Rand
    Ismail, Hilweh
    Al-Hamadi, Ali
    Abdelhay, Ali
    PLOS ONE, 2021, 16 (10):