Study on risk factors of impaired fasting glucose and development of a prediction model based on Extreme Gradient Boosting algorithm

被引:0
作者
Cui, Qiyuan [1 ]
Pu, Jianhong [1 ]
Li, Wei [2 ]
Zheng, Yun [1 ]
Lin, Jiaxi [3 ]
Liu, Lu [3 ]
Xue, Peng [4 ]
Zhu, Jinzhou [3 ]
He, Mingqing [1 ]
机构
[1] Soochow Univ, Dept Geriatr, Affiliated Hosp 1, Suzhou, Jiangsu, Peoples R China
[2] Nanjing Univ, Phys Examinat Ctr, Affiliated Suzhou Hosp, Med Sch, Suzhou, Jiangsu, Peoples R China
[3] Soochow Univ, Dept Gastroenterol, Affiliated Hosp 1, Suzhou, Jiangsu, Peoples R China
[4] Nanjing Univ, Dept Endocrinol, Affiliated Suzhou Hosp, Med Sch, Suzhou, Jiangsu, Peoples R China
来源
FRONTIERS IN ENDOCRINOLOGY | 2024年 / 15卷
关键词
impaired fasting glucose; prediction model; artificial intelligence; cohort study; middle-aged and elderly people; DIAGNOSIS; TOLERANCE;
D O I
10.3389/fendo.2024.1368225
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objective The aim of this study was to develop and validate a machine learning-based model to predict the development of impaired fasting glucose (IFG) in middle-aged and older elderly people over a 5-year period using data from a cohort study.Methods This study was a retrospective cohort study. The study population was 1855 participants who underwent consecutive physical examinations at the First Affiliated Hospital of Soochow University between 2018 and 2022.The dataset included medical history, physical examination, and biochemical index test results. The cohort was randomly divided into a training dataset and a validation dataset in a ratio of 8:2. The machine learning algorithms used in this study include Extreme Gradient Boosting (XGBoost), Support Vector Machines (SVM), Naive Bayes, Decision Trees (DT), and traditional Logistic Regression (LR). Feature selection, parameter optimization, and model construction were performed in the training set, while the validation set was used to evaluate the predictive performance of the models. The performance of these models is evaluated by an area under the receiver operating characteristic (ROC) curves (AUC), calibration curves and decision curve analysis (DCA). To interpret the best-performing model, the Shapley Additive exPlanation (SHAP) Plots was used in this study.Results The training/validation dataset consists of 1,855 individuals from the First Affiliated Hospital of Soochow University, yielded significant variables following selection by the Boruta algorithm and logistic multivariate regression analysis. These significant variables included systolic blood pressure (SBP), fatty liver, waist circumference (WC) and serum creatinine (Scr). The XGBoost model outperformed the other models, demonstrating an AUC of 0.7391 in the validation set.Conclusions The XGBoost model was composed of SBP, fatty liver, WC and Scr may assist doctors with the early identification of IFG in middle-aged and elderly people.
引用
收藏
页数:13
相关论文
共 30 条
  • [1] Contributions of β-cell dysfunction and insulin resistance to the pathogenesis of impaired glucose tolerance and impaired fasting glucose
    Abdul-Ghani, MA
    Tripathy, D
    DeFronzo, RA
    [J]. DIABETES CARE, 2006, 29 (05) : 1130 - 1139
  • [2] Abdullah K., 2019, Int J Diabetes Metab, V25, P39, DOI [10.1159/000500913, DOI 10.1159/000500913]
  • [3] Alberti KGMM, 1998, DIABETIC MED, V15, P539, DOI 10.1002/(SICI)1096-9136(199807)15:7<539::AID-DIA668>3.0.CO
  • [4] 2-S
  • [5] The effect of dietary creatine supplementation on skeletal muscle metabolism in congestive heart failure
    Andrews, R
    Greenhaff, P
    Curtis, S
    Perry, A
    Cowley, AJ
    [J]. EUROPEAN HEART JOURNAL, 1998, 19 (04) : 617 - 622
  • [6] Hybridized sine cosine algorithm with convolutional neural networks dropout regularization application
    Bacanin, Nebojsa
    Zivkovic, Miodrag
    Al-Turjman, Fadi
    Venkatachalam, K.
    Trojovsky, Pavel
    Strumberger, Ivana
    Bezdan, Timea
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [7] Performance of a Novel Chaotic Firefly Algorithm with Enhanced Exploration for Tackling Global Optimization Problems: Application for Dropout Regularization
    Bacanin, Nebojsa
    Stoean, Ruxandra
    Zivkovic, Miodrag
    Petrovic, Aleksandar
    Rashid, Tarik A.
    Bezdan, Timea
    [J]. MATHEMATICS, 2021, 9 (21)
  • [8] An Interpretable Prediction Model for Identifying N7-Methylguanosine Sites Based on XGBoost and SHAP
    Bi, Yue
    Xiang, Dongxu
    Ge, Zongyuan
    Li, Fuyi
    Jia, Cangzhi
    Song, Jiangning
    [J]. MOLECULAR THERAPY-NUCLEIC ACIDS, 2020, 22 : 362 - 372
  • [9] Association of Sustained Blood Pressure Control with Lower Risk for High-Cost Multimorbidities Among Medicare Beneficiaries in ALLHAT
    Bowling, C. Barrett
    Sloane, Richard
    Pieper, Carl
    Luciano, Alison
    Davis, Barry R.
    Simpson, Lara M.
    Einhorn, Paula T.
    Oparil, Suzanne
    Muntner, Paul
    [J]. JOURNAL OF GENERAL INTERNAL MEDICINE, 2021, 36 (08) : 2221 - 2229
  • [10] Exploring the risk factors of impaired fasting glucose in middle-aged population living in South Korean communities by using categorical boosting machine
    Byeon, Haewon
    [J]. FRONTIERS IN ENDOCRINOLOGY, 2022, 13