PrOsteoporosis: predicting osteoporosis risk using NHANES data and machine learning approach

被引:0
|
作者
Si, Zebing [1 ,2 ]
Zhang, Di [3 ]
Wang, Huajun [1 ]
Zheng, Xiaofei [1 ]
机构
[1] Jinan Univ, Affiliated Hosp 1, Guangdong Prov Key Lab Speed Capabil, Dept Sports Med,Guangzhou Key Lab Precis Orthoped, Guangzhou 510630, Peoples R China
[2] Yuebei Peoples Hosp, Dept Orthoped, 133 Shaoguan Huimin South Ave, Shaoguan 512026, Peoples R China
[3] Shaoguan Univ, Country Coll Informat Sci & Engn, Shaoguan, Guangdong, Peoples R China
关键词
Osteoporosis; Machine learning; Risk; Model;
D O I
10.1186/s13104-025-07089-3
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
ObjectivesOsteoporosis, prevalent among the elderly population, is primarily diagnosed through bone mineral density (BMD) testing, which has limitations in early detection. This study aims to develop and validate a machine learning approach for osteoporosis identification by integrating demographic data, laboratory and questionnaire data, offering a more practical and effective screening alternative.MethodsIn this study, data from the National Health and Nutrition Examination Survey were analyzed to explore factors linked to osteoporosis. After cleaning, 8766 participants with 223 variables were studied. Minimum Redundancy Maximum Relevance and SelectKBest were employed to select the import features. Four Machine learning algorithms (RF, NN, LightGBM and XGBoost.) were applied to examine osteoporosis, with performance comparisons made. Data balancing was done using SMOTE, and metrics like F1 score, and AUC were evaluated for each algorithm.ResultsThe LightGBM model outperformed others with an F1 score of 0.914, an MCC of 0.831, and an AUC of 0.970 on the training set. On the test set, it achieved an F1 score of 0.912, an MCC of 0.826, and an AUC of 0.972. Top predictors for osteoporosis were height, age, and sex.ConclusionsThis study demonstrates the potential of machine learning models in assessing an individual's risk of developing osteoporosis, a condition that significantly impacts quality of life and imposes substantial healthcare costs. The superior performance of the LightGBM model suggests a promising tool for early detection and personalized prevention strategies. Importantly, identifying height, age, and sex as top predictors offers critical insights into the demographic and physiological factors that clinicians should consider when evaluating patients' risk profiles.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Predicting youth diabetes risk using NHANES data and machine learning
    Vangeepuram, Nita
    Liu, Bian
    Chiu, Po-Hsiang
    Wang, Linhua
    Pandey, Gaurav
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [2] Predicting youth diabetes risk using NHANES data and machine learning
    Nita Vangeepuram
    Bian Liu
    Po-hsiang Chiu
    Linhua Wang
    Gaurav Pandey
    Scientific Reports, 11
  • [3] Machine Learning Model for Predicting CVD Risk on NHANES Data
    Klados, G. A.
    Politof, K.
    Bei, E. S.
    Moirogiorgou, K.
    Anousakis-Vlachochristou, N.
    Matsopoulos, G. K.
    Zervakis, M.
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 1749 - 1752
  • [4] Osteoporosis Risk Prediction by Machine Learning Approach Using Clinical and Metabolomic Data
    Qiu, Chuan
    Zhang, Xiao
    Liu, Anqi
    Zhe, Luo
    Tian, Qing
    Shen, Hui
    Deng, Hong-Wen
    Su, Kuan-Jui
    Gong, Yun
    JOURNAL OF BONE AND MINERAL RESEARCH, 2023, 38 : 175 - 175
  • [5] Identification of combined biomarkers for predicting the risk of osteoporosis using machine learning
    Zheng, Zhenlong
    Zhang, Xianglan
    Oh, Bong-Kyeong
    Kim, Ki-Yeol
    AGING-US, 2022, 14 (10): : 4270 - 4280
  • [6] Predicting the risk of osteoporosis in older Vietnamese women using machine learning approaches
    Hanh My Bui
    Minh Hoang Ha
    Hoang Giang Pham
    Thang Phuoc Dao
    Thuy-Trang Thi Nguyen
    Minh Loi Nguyen
    Ngan Thi Vuong
    Xuyen Hong Thi Hoang
    Loc Tien Do
    Thanh Xuan Dao
    Cuong Quang Le
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [7] Predicting the risk of osteoporosis in older Vietnamese women using machine learning approaches
    Hanh My Bui
    Minh Hoang Ha
    Hoang Giang Pham
    Thang Phuoc Dao
    Thuy-Trang Thi Nguyen
    Minh Loi Nguyen
    Ngan Thi Vuong
    Xuyen Hong Thi Hoang
    Loc Tien Do
    Thanh Xuan Dao
    Cuong Quang Le
    Scientific Reports, 12
  • [8] A Machine Learning Approach for Predicting Therapeutic Adherence to Osteoporosis Treatment
    Marvin, Ggaliwango
    Alam, Md Golam Rabiul
    2021 IEEE ASIA-PACIFIC CONFERENCE ON COMPUTER SCIENCE AND DATA ENGINEERING (CSDE), 2021,
  • [9] Predicting corporate policies using downside risk: A machine learning approach
    Avramov, Doron
    Li, Minwen
    Wang, Hao
    JOURNAL OF EMPIRICAL FINANCE, 2021, 63 : 1 - 26
  • [10] Machine Learning Approaches for Predicting Fatty Acid Classes in Popular US Snacks Using NHANES Data
    Tachie, Christabel Y. E.
    Obiri-Ananey, Daniel
    Tawiah, Nii Adjetey
    Attoh-Okine, Nii
    Aryee, Alberta N. A.
    NUTRIENTS, 2023, 15 (15)