Machine Learning for Predicting the 3-Year Risk of Incident Diabetes in Chinese Adults

被引:19
作者
Wu, Yang [1 ,2 ,3 ]
Hu, Haofei [3 ,4 ,5 ]
Cai, Jinlin [1 ,2 ,6 ]
Chen, Runtian [1 ,2 ,3 ]
Zuo, Xin [7 ]
Cheng, Heng [7 ]
Yan, Dewen [1 ,2 ,3 ]
机构
[1] Shenzhen Univ, Affiliated Hosp 1, Dept Endocrinol, Shenzhen, Peoples R China
[2] Shenzhen Second Peoples Hosp, Dept Endocrinol, Shenzhen, Peoples R China
[3] Shenzhen Univ, Hlth Sci Ctr, Shenzhen, Peoples R China
[4] Shenzhen Univ, Affiliated Hosp 1, Dept Nephrol, Shenzhen, Peoples R China
[5] Shenzhen Second Peoples Hosp, Dept Nephrol, Shenzhen, Peoples R China
[6] Shantou Univ, Med Coll, Shantou, Peoples R China
[7] Third Peoples Hosp Shenzhen, Dept Endocrinol, Shenzhen, Peoples R China
关键词
machine learning; extreme gradient boosting; simple stepwise model; Incident diabetes; risk; TYPE-2; MELLITUS; MODELS; COMPLICATIONS; NOMOGRAM; TRENDS; IMPACT; BMI;
D O I
10.3389/fpubh.2021.626331
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Purpose: We aimed to establish and validate a risk assessment system that combines demographic and clinical variables to predict the 3-year risk of incident diabetes in Chinese adults. Methods: A 3-year cohort study was performed on 15,928 Chinese adults without diabetes at baseline. All participants were randomly divided into a training set (n = 7,940) and a validation set (n = 7,988). XGBoost method is an effective machine learning technique used to select the most important variables from candidate variables. And we further established a stepwise model based on the predictors chosen by the XGBoost model. The area under the receiver operating characteristic curve (AUC), decision curve and calibration analysis were used to assess discrimination, clinical use and calibration of the model, respectively. The external validation was performed on a cohort of 11,113 Japanese participants. Result: In the training and validation sets, 148 and 145 incident diabetes cases occurred. XGBoost methods selected the 10 most important variables from 15 candidate variables. Fasting plasma glucose (FPG), body mass index (BMI) and age were the top 3 important variables. And we further established a stepwise model and a prediction nomogram. The AUCs of the stepwise model were 0.933 and 0.910 in the training and validation sets, respectively. The Hosmer-Lemeshow test showed a perfect fit between the predicted diabetes risk and the observed diabetes risk (p = 0.068 for the training set, p = 0.165 for the validation set). Decision curve analysis presented the clinical use of the stepwise model and there was a wide range of alternative threshold probability spectrum. And there were almost no the interactions between these predictors (most P-values for interaction >0.05). Furthermore, the AUC for the external validation set was 0.830, and the Hosmer-Lemeshow test for the external validation set showed no statistically significant difference between the predicted diabetes risk and observed diabetes risk (P = 0.824). Conclusion: We established and validated a risk assessment system for characterizing the 3-year risk of incident diabetes.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Traditional Methods Hold Their Ground Against Machine Learning in Predicting Potentially Inappropriate Medication Use in Older Adults
    Chiu, Yohann Moanahere
    Sirois, Caroline
    Simard, Marc
    Gagnon, Marie-Eve
    Talbot, Denis
    VALUE IN HEALTH, 2024, 27 (10) : 1393 - 1399
  • [42] Machine learning classifiers for predicting 3-year progression-free survival and overall survival in patients with gliomas after surgery
    Zhang, Bin
    Yan, Jing
    Chen, Weiqi
    Dong, Yuhao
    Zhang, Lu
    Mo, Xiaokai
    Chen, Qiuying
    Cheng, Jingliang
    Liu, Xianzhi
    Wang, Weiwei
    Zhang, Zhenyu
    Zhang, Shuixing
    JOURNAL OF CANCER, 2021, 12 (06): : 1604 - 1615
  • [43] Degree of Blood Pressure Control and Incident Diabetes Mellitus in Chinese Adults With Hypertension
    Zhang, Yuanyuan
    Nie, Jing
    Zhang, Yan
    Li, Jianping
    Liang, Min
    Wang, Guobao
    Tian, Jianwei
    Liu, Chengzhang
    Wang, Binyan
    Cui, Yimin
    Wang, Xiaobin
    Huo, Yong
    Xu, Xiping
    Hou, Fan Fan
    Qin, Xianhui
    JOURNAL OF THE AMERICAN HEART ASSOCIATION, 2020, 9 (16):
  • [44] Primary Care Diabetes Bundle Management: 3-Year Outcomes for Microvascular and Macrovascular Events
    Bloom, Frederick J., Jr.
    Yan, Xiaowei
    Stewart, Walter F.
    Graf, Thomas R.
    Anderer, Tammy
    Davis, Duane E.
    Pierdon, Steven B.
    Pitcavage, James
    Steele, Glenn D., Jr.
    AMERICAN JOURNAL OF MANAGED CARE, 2014, 20 (06) : E175 - E182
  • [45] Associations Between Serum Free Fatty Acid Levels and Incident Diabetes in a 3-Year Cohort Study
    Li, Qihang
    Zhao, Meng
    Wang, Yupeng
    Zhong, Fang
    Liu, Jing
    Gao, Ling
    Zhao, Jiajun
    DIABETES METABOLIC SYNDROME AND OBESITY-TARGETS AND THERAPY, 2021, 14 : 2743 - 2751
  • [46] Serum uric acid and risk of incident diabetes in middle-aged and elderly Chinese adults: prospective cohort study
    Cheng, Di
    Hu, Chunyan
    Du, Rui
    Qi, Hongyan
    Lin, Lin
    Wu, Xueyan
    Ma, Lina
    Peng, Kui
    Li, Mian
    Xu, Min
    Xu, Yu
    Bi, Yufang
    Wang, Weiqing
    Chen, Yuhong
    Lu, Jieli
    FRONTIERS OF MEDICINE, 2020, 14 (06) : 802 - 810
  • [47] Predicting the risk of cancer in adults using supervised machine learning: a scoping review
    Alfayez, Asma Abdullah
    Kunz, Holger
    Lai, Alvina Grace
    BMJ OPEN, 2021, 11 (09):
  • [48] Predicting postoperative pulmonary infection risk in patients with diabetes using machine learning
    Zhao, Chunxiu
    Xiang, Bingbing
    Zhang, Jie
    Yang, Pingliang
    Liu, Qiaoli
    Wang, Shun
    FRONTIERS IN PHYSIOLOGY, 2024, 15
  • [49] Predicting dyslipidemia in Chinese elderly adults using dietary behaviours and machine learning algorithms
    Wang, Biying
    Lin, Luotao
    Wang, Wenjun
    Song, Hualing
    Xu, Xianglong
    PUBLIC HEALTH, 2025, 238 : 274 - 279
  • [50] Predicting systemic risk of banks: a machine learning approach
    Kumar, Gaurav
    Rahman, Molla Ramizur
    Rajverma, Abhinav
    Misra, Arun Kumar
    JOURNAL OF MODELLING IN MANAGEMENT, 2024, 19 (02) : 441 - 469