Machine learning algorithms for predicting the risk of fracture in patients with diabetes in China

被引:4
作者
Chu, Sijia [1 ,2 ]
Jiang, Aijun [1 ]
Chen, Lyuzhou [3 ]
Zhang, Xi [4 ]
Shen, Xiurong [5 ]
Zhou, Wan [1 ]
Ye, Shandong [1 ]
Chen, Chao [1 ]
Zhang, Shilu [1 ,2 ]
Zhang, Li [1 ,2 ]
Chen, Yang [1 ,6 ]
Miao, Ya [7 ]
Wang, Wei [1 ]
机构
[1] Univ Sci & Technol China, Affiliated Hosp USTC 1, Dept Endocrinol, Div Life Sci & Med, Hefei 230001, Anhui, Peoples R China
[2] Wannan Med Coll, Grad Sch, Wuhu, Peoples R China
[3] Univ Sci & Technol China, Sch Data Sci, Hefei, Peoples R China
[4] Peoples Hosp Chizhou, Dept Endocrinol, Chizhou, Peoples R China
[5] Fuyang Peoples Hosp, Fuyang, Peoples R China
[6] Anhui Med Univ, Grad Sch, Hefei, Peoples R China
[7] Univ Sci & Technol China, Inst Adv Technol, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
Diabetes; Feature selection; Machine learning; Fracture; Risk prediction; BONE-MINERAL DENSITY; CLASSIFICATION; MELLITUS; TURNOVER; WOMEN; FRAX;
D O I
10.1016/j.heliyon.2023.e18186
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Patients with diabetes are more likely to be predisposed to fractures compared to those without diabetes. In clinical practice, predicting fracture risk in diabetics is still difficult because of the limited availability and accessibility of existing fracture prediction tools in the diabetic population. The purpose of this study was to develop and validate models using machine learning (ML) algorithms to achieve high predictive power for fracture in patients with diabetes in China. Methods: In this study, the clinical data of 775 hospitalized patients with diabetes was analyzed by using Decision Tree (DT), Gradient Boosting Decision Tree (GBDT), Logistic Regression (LR), Random Forest (RF), Support Vector Machine (SVM), eXtreme Gradient Boosting (XGBoost) and Probabilistic Classification Vector Machines (PCVM) algorithms to construct risk prediction models for fractures. Moreover, the risk factors for diabetes-related fracture were identified by the feature selection algorithms. Results: The ML algorithms extracted 17 most relevant factors from raw clinical data to maximize the accuracy of the prediction results, including bone mineral density, age, sex, weight, highdensity lipoprotein cholesterol, height, duration of diabetes, total cholesterol, osteocalcin, Nterminal propeptide of type I, diastolic blood pressure, and body mass index. The 7 ML models including LR, SVM, RF, DT, GBDT, XGBoost, and PCVM had f1 scores of 0.75, 0.83, 0.84, 0.85, 0.87, 0.88, and 0.97, respectively. Conclusions: This study identified 17 most relevant risk factors for diabetes-related fracture using ML algorithms. And the PCVM model proved to perform best in predicting the fracture risk in the diabetic population. This work proposes a cheap, safe, and extensible ML algorithm for the precise assessment of risk factors for diabetes-related fracture.
引用
收藏
页数:13
相关论文
共 70 条
[1]   Performance of the Garvan Fracture Risk Calculator in Individuals with Diabetes: A Registry-Based Cohort Study [J].
Agarwal, Arnav ;
Leslie, William D. ;
Nguyen, Tuan, V ;
Morin, Suzanne N. ;
Lix, Lisa M. ;
Eisman, John A. .
CALCIFIED TISSUE INTERNATIONAL, 2022, 110 (06) :658-665
[2]   Applying hybrid support vector regression and genetic algorithm to water alternating CO2 gas EOR [J].
Amar, Menad Nait ;
Zeraibi, Noureddine ;
Jahanbani Ghahfarokhi, Ashkan .
GREENHOUSE GASES-SCIENCE AND TECHNOLOGY, 2020, 10 (03) :613-630
[3]   A combined support vector regression with firefly algorithm for prediction of bottom hole pressure [J].
Amar, Menad Nait ;
Zeraibi, Noureddine .
SN APPLIED SCIENCES, 2020, 2 (01)
[4]  
[Anonymous], 2018, CHIN J OSTEOPOROS BO, V38, P127, DOI [10.19538/j.nk2018020109, DOI 10.19538/J.NK2018020109]
[5]   Quality Evaluation of Triples in Knowledge Graph by Incorporating Internal With External Consistency [J].
Ban, Taiyu ;
Wang, Xiangyu ;
Chen, Lyuzhou ;
Wu, Xingyu ;
Chen, Qiuju ;
Chen, Huanhuan .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) :1980-1992
[6]   Random forest in remote sensing: A review of applications and future directions [J].
Belgiu, Mariana ;
Dragut, Lucian .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 114 :24-31
[7]   Semisupervised Negative Correlation Learning [J].
Chen, Huanhuan ;
Jiang, Bingbing ;
Yao, Xin .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (11) :5366-5379
[8]   Efficient Probabilistic Classification Vector Machine With Incremental Basis Function Selection [J].
Chen, Huanhuan ;
Tino, Peter ;
Yao, Xin .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (02) :356-369
[9]   Multiobjective Neural Network Ensembles Based on Regularized Negative Correlation Learning [J].
Chen, Huanhuan ;
Yao, Xin .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (12) :1738-1751
[10]   Regularized Negative Correlation Learning for Neural Network Ensembles [J].
Chen, Huanhuan ;
Yao, Xin .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (12) :1962-1979