A 10-year retrospective cohort of diabetic patients in a large medical institution: Utilizing multiple machine learning models for diabetic kidney disease prediction

被引:0
作者
Li, Guangpu [1 ,2 ,3 ,4 ]
Li, Jia [2 ,3 ,4 ]
Tian, Fei [1 ]
Ren, Jingjing [5 ,6 ]
Guo, Zuishuang [1 ]
Pan, Shaokang [1 ]
Liu, Dongwei [1 ]
Duan, Jiayu [1 ,2 ,3 ,4 ]
Liu, Zhangsuo [1 ,2 ,3 ,4 ]
机构
[1] Zhengzhou Univ, Affiliated Hosp 1, Dept Integrated Tradit & Western Nephrol, Zhengzhou, Peoples R China
[2] Zhengzhou Univ, Res Inst Nephrol, Zhengzhou, Peoples R China
[3] Henan Prov Res Ctr Kidney Dis, Zhengzhou, Peoples R China
[4] Key Lab Precis Diag & Treatment Chron Kidney Dis H, Zhengzhou, Peoples R China
[5] Brigham & Womens Hosp, Dept Med, Boston, MA USA
[6] Harvard Med Sch, Boston, MA USA
来源
DIGITAL HEALTH | 2024年 / 10卷
基金
中国国家自然科学基金;
关键词
Diabetic kidney disease; machine learning; diabetes mellitus; LIFE-STYLE; RETINOPATHY; COMPLICATIONS; PREVALENCE;
D O I
10.1177/20552076241265220
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objective As the prevalence of diabetes steadily increases, the burden of diabetic kidney disease (DKD) is also intensifying. In response, we have utilized a 10-year diabetes cohort from our medical center to train machine learning-based models for predicting DKD and interpreting relevant factors.Methods Employing a large dataset from 73,101 hospitalized type 2 diabetes patients at The First Affiliated Hospital of Zhengzhou University, we analyzed demographic and medication data. Machine learning models, including XGBoost, CatBoost, LightGBM, Random Forest, AdaBoost, GBDT (gradient boosting decision tree), and SGD (stochastic gradient descent), were trained on these data, focusing on interpretability by SHAP. SHAP explains the output of the models by assigning an importance value to each feature for a particular prediction, enabling a clear understanding of how individual features influence the prediction outcomes.Results The XGBoost model achieved an area under the curve (AUC) of 0.95 and an area under the precision-recall curve (AUPR) of 0.76, while CatBoost recorded an AUC of 0.97 and an AUPR of 0.84. These results underscore the effectiveness of these models in predicting DKD in patients with type 2 diabetes.Conclusions This study provides a comprehensive approach for predicting DKD in patients with type 2 diabetes, employing machine learning techniques. The findings are crucial for the early detection and intervention of DKD, offering a roadmap for future research and healthcare strategies in diabetes management. Additionally, the presence of non-diabetic kidney diseases and diabetes with complications was identified as significant factors in the development of DKD.
引用
收藏
页数:11
相关论文
共 41 条
[1]   Optuna: A Next-generation Hyperparameter Optimization Framework [J].
Akiba, Takuya ;
Sano, Shotaro ;
Yanase, Toshihiko ;
Ohta, Takeru ;
Koyama, Masanori .
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :2623-2631
[2]  
[Anonymous], 2016, International Statistical Classification of Diseases and Related Health Problems 10th Revision
[3]  
Association A.D., 2021, DIABETES CARE, V44, pS7 S14
[4]   Albuminuric diabetic kidney disease predicts foot ulcers in type 2 diabetes [J].
Borderie, Gauthier ;
Foussard, Ninon ;
Larroumet, Alice ;
Blanco, Laurence ;
Mohammedi, Kamel ;
Ducasse, Eric ;
Caradu, Caroline ;
Rigalleau, Vincent .
JOURNAL OF DIABETES AND ITS COMPLICATIONS, 2023, 37 (02)
[5]  
[曹莹 Cao Ying], 2013, [自动化学报, Acta Automatica Sinica], V39, P745
[6]   Does a strict glycemic control during acute coronary syndrome play a cardioprotective effect? Pathophysiology and clinical evidence [J].
Caturano, Alfredo ;
Galiero, Raffaele ;
Pafundi, Pia Clara ;
Cesaro, Arturo ;
Vetrano, Erica ;
Palmiero, Giuseppe ;
Rinaldi, Luca ;
Salvatore, Teresa ;
Marfella, Raffaele ;
Sardu, Celestino ;
Moscarella, Elisabetta ;
Gragnano, Felice ;
Calabro, Paolo ;
Sasso, Ferdinando Carlo .
DIABETES RESEARCH AND CLINICAL PRACTICE, 2021, 178
[7]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[8]   Screening for chronic kidney disease: moving toward more sustainable health care [J].
Curtis, Sarah ;
Komenda, Paul .
CURRENT OPINION IN NEPHROLOGY AND HYPERTENSION, 2020, 29 (03) :333-338
[9]   Temporal Trends in the Prevalence of Diabetic Kidney Disease in the United States [J].
de Boer, Ian H. ;
Rue, Tessa C. ;
Hall, Yoshio N. ;
Heagerty, Patrick J. ;
Weiss, Noel S. ;
Himmelfarb, Jonathan .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2011, 305 (24) :2532-2539
[10]   Prevalence and risk factors of chronic kidney disease and diabetic kidney disease in a central Chinese urban population: a cross-sectional survey [J].
Duan, Jia-Yu ;
Duan, Guang-Cai ;
Wang, Chong-Jian ;
Liu, Dong-Wei ;
Qiao, Ying-Jin ;
Pan, Shao-Kang ;
Jiang, Deng-Ke ;
Liu, Yong ;
Zhao, Zi-Hao ;
Liang, Lu-Lu ;
Tian, Fei ;
Liu, Zhang-Suo .
BMC NEPHROLOGY, 2020, 21 (01)