Using machine learning techniques to predict the risk of osteoporosis based on nationwide chronic disease data

被引:7
|
作者
Tu, Jun-Bo [1 ]
Liao, Wei-Jie [2 ]
Liu, Wen-Cai [3 ]
Gao, Xing-Hua [4 ]
机构
[1] Xinfeng Cty Peoples Hosp, Dept Orthopaed, Xinfeng 341600, Jiangxi, Peoples R China
[2] GanZhou Peoples Hosp, Dept ICU, Ganzhou 341000, Jiangxi, Peoples R China
[3] Shanghai Jiao Tong Univ, Shanghai Peoples Hosp 6, Sch Med, Dept Orthopaed, 600 Yishan Rd, Shanghai 200233, Peoples R China
[4] South China Univ Technol, Guangzhou Peoples Hosp 1, Dept Orthopaed, Guangzhou 510180, Peoples R China
关键词
Osteoporosis; Machine learning; Predict; Stacker; Chronic disease; BONE-MINERAL DENSITY; MANAGEMENT; HEALTH; CHOLESTEROL; FRACTURE; WOMEN;
D O I
10.1038/s41598-024-56114-1
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Osteoporosis is a major public health concern that significantly increases the risk of fractures. The aim of this study was to develop a Machine Learning based predictive model to screen individuals at high risk of osteoporosis based on chronic disease data, thus facilitating early detection and personalized management. A total of 10,000 complete patient records of primary healthcare data in the German Disease Analyzer database (IMS HEALTH) were included, of which 1293 diagnosed with osteoporosis and 8707 without the condition. The demographic characteristics and chronic disease data, including age, gender, lipid disorder, cancer, COPD, hypertension, heart failure, CHD, diabetes, chronic kidney disease, and stroke were collected from electronic health records. Ten different machine learning algorithms were employed to construct the predictive mode. The performance of the model was further validated and the relative importance of features in the model was analyzed. Out of the ten machine learning algorithms, the Stacker model based on Logistic Regression, AdaBoost Classifier, and Gradient Boosting Classifier demonstrated superior performance. The Stacker model demonstrated excellent performance through ten-fold cross-validation on the training set and ROC curve analysis on the test set. The confusion matrix, lift curve and calibration curves indicated that the Stacker model had optimal clinical utility. Further analysis on feature importance highlighted age, gender, lipid metabolism disorders, cancer, and COPD as the top five influential variables. In this study, a predictive model for osteoporosis based on chronic disease data was developed using machine learning. The model shows great potential in early detection and risk stratification of osteoporosis, ultimately facilitating personalized prevention and management strategies.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease
    Chan, Lili
    Nadkarni, Girish N.
    Fleming, Fergus
    McCullough, James R.
    Connolly, Patricia
    Mosoyan, Gohar
    El Salem, Fadi
    Kattan, Michael W.
    Vassalotti, Joseph A.
    Murphy, Barbara
    Donovan, Michael J.
    Coca, Steven G.
    Damrauer, Scott M.
    DIABETOLOGIA, 2021, 64 (07) : 1504 - 1515
  • [22] Comparison of Machine Learning Models to Predict Risk of Falling in Osteoporosis Elderly
    Cuaya-Simbro, German
    Perez-Sanpablo, Alberto-Isaac
    Munoz-Melendez, Angelica
    Quinones Uriostegui, Ivett
    Morales-Manzanares, Eduardo-F
    Nunez-Carrera, Lidia
    FOUNDATIONS OF COMPUTING AND DECISION SCIENCES, 2020, 45 (02) : 65 - 77
  • [23] Risk assessment and prediction of nosocomial infections based on surveillance data using machine learning methods
    Chen, Ying
    Zhang, Yonghong
    Nie, Shuping
    Ning, Jie
    Wang, Qinjin
    Yuan, Hanmei
    Wu, Hui
    Li, Bin
    Hu, Wenbiao
    Wu, Chao
    BMC PUBLIC HEALTH, 2024, 24 (01)
  • [24] Identification of combined biomarkers for predicting the risk of osteoporosis using machine learning
    Zheng, Zhenlong
    Zhang, Xianglan
    Oh, Bong-Kyeong
    Kim, Ki-Yeol
    AGING-US, 2022, 14 (10): : 4270 - 4280
  • [25] Development and validation of common data model-based fracture prediction model using machine learning algorithm
    Kong, Sung Hye
    Kim, Sihyeon
    Kim, Yisak
    Kim, Jung Hee
    Kim, Kwangsoo
    Shin, Chan Soo
    OSTEOPOROSIS INTERNATIONAL, 2023, 34 (08) : 1437 - 1451
  • [26] Integrating diverse data sources to predict disease risk in dairy cattle-a machine learning approach
    Lasser, Jana
    Matzhold, Caspar
    Egger-Danner, Christa
    Fuerst-Waltl, Birgit
    Steininger, Franz
    Wittek, Thomas
    Klimek, Peter
    JOURNAL OF ANIMAL SCIENCE, 2021, 99 (11)
  • [27] Predict the Price of Gold Based on Machine Learning Techniques
    Zhu, Han-chao
    Wang, Dong
    INTERNATIONAL CONFERENCE ON MATHEMATICS, MODELLING AND SIMULATION TECHNOLOGIES AND APPLICATIONS (MMSTA 2017), 2017, 215 : 615 - 622
  • [28] Using data mining techniques to predict chronic kidney disease: A review study
    Sattari, Mohammad
    Mohammadi, Maryam
    INTERNATIONAL JOURNAL OF PREVENTIVE MEDICINE, 2023, 14 (01)
  • [29] Iron Deficiency Anemia as a Risk Factor for Osteoporosis in Taiwan: A Nationwide Population-Based Study
    Pan, Mei-Lien
    Chen, Li-Ru
    Tsao, Hsiao-Mei
    Chen, Kuo-Hu
    NUTRIENTS, 2017, 9 (06):
  • [30] Data dissemination approach using machine learning techniques for WBANs
    Punj, Roopali
    Kumar, Rakesh
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (05)