A SuperLearner approach for predicting diabetic kidney disease upon the initial diagnosis of T2DM in hospital

被引:0
作者
Lin, Xiaomeng [1 ]
Liu, Chao [2 ,3 ]
Wang, Huaiyu [4 ]
Fan, Xiaohui [5 ]
Li, Linfeng [2 ]
Xu, Jiming [2 ]
Li, Changlin [6 ]
Wang, Yao [2 ]
Cai, Xudong [6 ]
Peng, Xin [1 ]
机构
[1] Zhejiang Chinese Med Univ, Ningbo Municipal Hosp Tradit Chinese Med TCM, Affiliated Hosp, Ningbo Inst Chinese Med Res, 819 Liyuan North Rd, Ningbo 315010, Peoples R China
[2] Yidu Cloud Technol Inc, Beijing 100083, Peoples R China
[3] Nanjing YiGenCloud Inst, Nanjing 211899, Peoples R China
[4] Beijing Univ Chinese Med, Natl Inst Tradit Chinese Med Constitut & Prevent T, Beijing 100029, Peoples R China
[5] Zhejiang Univ, Pharmaceut Informat Inst, Coll Pharmaceut Sci, Hangzhou 310058, Peoples R China
[6] Zhejiang Chinese Med Univ, Ningbo Municipal Hosp Tradit Chinese Med TCM, Affiliated Hosp, Dept Nephrol, Ningbo 315010, Peoples R China
关键词
Type; 2; diabetes; diabetic kidney disease; real-world data; machine learning; model interpretability; risk estimation; RISK; MANAGEMENT; DYSLIPIDEMIA; VALIDATION; DECLINE; MODEL;
D O I
10.1186/s12911-025-02977-x
中图分类号
R-058 [];
学科分类号
摘要
BackgroundDiabetic kidney disease (DKD) is a serious complication of diabetes mellitus (DM), with patients typically remaining asymptomatic until reaching an advanced stage. We aimed to develop and validate a predictive model for DKD in patients with an initial diagnosis of type 2 diabetes mellitus (T2DM) using real-world data.MethodsWe retrospectively examined data from 3,291 patients (1740 men, 1551 women) newly diagnosed with T2DM at Ningbo Municipal Hospital of Traditional Chinese Medicine (2011-2023). The dataset was randomly divided into training and validation cohorts. Forty-six readily available medical characteristics at initial diagnosis of T2DM from the electronic medical records were used to develop prediction models based on linear, non-linear, and SuperLearner approaches. Model performance was evaluated using the area under the curve (AUC). SHapley Additive exPlanation (SHAP) was used to interpret the best-performing models.ResultsAmong 3291 participants, 563 (17.1%) were diagnosed with DKD during median follow-up of 2.53 years. The SuperLearner model exhibited the highest AUC (0.7138, 95% confidence interval: [0.673, 0.7546]) for the holdout internal validation set in predicting any DKD stage. Top-ranked features were WBC_Cnt*, Neut_Cnt, Hct, and Hb. High WBC_Cnt, low Neut_Cnt, high Hct, and low Hb levels were associated with an increased risk of DKD.ConclusionsWe developed and validated a DKD risk prediction model for patients with newly diagnosed T2DM. Using routinely available clinical measurements, the SuperLearner model could predict DKD during hospital visits. Prediction accuracy and SHAP-based model interpretability may help improve early detection, targeted interventions, and prognosis of patients with DM.
引用
收藏
页数:11
相关论文
共 53 条
[1]   Prediction of diabetic kidney disease with machine learning algorithms, upon the initial diagnosis of type 2 diabetes mellitus [J].
Allen, Angier ;
Iqbal, Zohora ;
Green-Saxena, Abigail ;
Hurtado, Myrna ;
Hoffman, Jana ;
Mao, Qingqing ;
Das, Ritankar .
BMJ OPEN DIABETES RESEARCH & CARE, 2022, 10 (01)
[2]  
Bouhairie Victoria E, 2016, Mo Med, V113, P390
[3]  
Braun Michael M, 2021, FP Essent, V509, P26
[4]  
Chinese Diabetes Society, 2002, Zhongguo Yi Xue Ke Xue Yuan Xue Bao, V24, P447
[5]   Three machine learning algorithms and their utility in exploring risk factors associated with primary cesarean section in low-risk women: A methods paper [J].
Clark, Rebecca R. S. ;
Hou, Jintong .
RESEARCH IN NURSING & HEALTH, 2021, 44 (03) :559-570
[6]   A systematic review finds prediction models for chronic kidney disease were poorly reported and often developed using inappropriate methods [J].
Collins, Gary S. ;
Omar, Omar ;
Shanyinde, Milensu ;
Yu, Ly-Mee .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2013, 66 (03) :268-277
[7]   Diabetic Dyslipidemia Review: An Update on Current Concepts and Management Guidelines of Diabetic Dyslipidemia [J].
Dake, Andrew W. ;
Sora, Nicoleta D. .
AMERICAN JOURNAL OF THE MEDICAL SCIENCES, 2016, 351 (04) :361-365
[8]   Prediction of 3-year risk of diabetic kidney disease using machine learning based on electronic medical records [J].
Dong, Zheyi ;
Wang, Qian ;
Ke, Yujing ;
Zhang, Weiguang ;
Hong, Quan ;
Liu, Chao ;
Liu, Xiaomin ;
Yang, Jian ;
Xi, Yue ;
Shi, Jinlong ;
Zhang, Li ;
Zheng, Ying ;
Lv, Qiang ;
Wang, Yong ;
Wu, Jie ;
Sun, Xuefeng ;
Cai, Guangyan ;
Qiao, Shen ;
Yin, Chengliang ;
Su, Shibin ;
Chen, Xiangmei .
JOURNAL OF TRANSLATIONAL MEDICINE, 2022, 20 (01)
[9]  
ElSayed NA, 2023, DIABETES CARE, V46, pS191, DOI 10.2337/dc23-S011
[10]  
Gheith Osama, 2016, J Nephropharmacol, V5, P49