Machine learning-based prediction of vitamin D deficiency: NHANES 2001-2018

被引:4
|
作者
Guo, Jiale [1 ]
He, Qionghan [2 ]
Li, Yehai [1 ]
机构
[1] Anhui Med Univ, Chaohu Hosp, Dept Orthoped, Hefei, Peoples R China
[2] Anhui Med Univ, Dept Infect, Chaohu Hosp, Hefei, Peoples R China
来源
FRONTIERS IN ENDOCRINOLOGY | 2024年 / 15卷
关键词
machine learning; vitamin D deficiency; clinical decision rules; nutrition surveys; public health; INSUFFICIENCY; HOMEOSTASIS; PREVALENCE;
D O I
10.3389/fendo.2024.1327058
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background Vitamin D deficiency is strongly associated with the development of several diseases. In the current context of a global pandemic of vitamin D deficiency, it is critical to identify people at high risk of vitamin D deficiency. There are no prediction tools for predicting the risk of vitamin D deficiency in the general community population, and this study aims to use machine learning to predict the risk of vitamin D deficiency using data that can be obtained through simple interviews in the community.Methods The National Health and Nutrition Examination Survey 2001-2018 dataset is used for the analysis which is randomly divided into training and validation sets in the ratio of 70:30. GBM, LR, NNet, RF, SVM, XGBoost methods are used to construct the models and their performance is evaluated. The best performed model was interpreted using the SHAP value and further development of the online web calculator.Results There were 62,919 participants enrolled in the study, and all participants included in the study were 2 years old and above, of which 20,204 (32.1%) participants had vitamin D deficiency. The models constructed by each method were evaluated using AUC as the primary evaluation statistic and ACC, PPV, NPV, SEN, SPE, F1 score, MCC, Kappa, and Brier score as secondary evaluation statistics. Finally, the XGBoost-based model has the best and near-perfect performance. The summary plot of SHAP values shows that the top three important features for this model are race, age, and BMI. An online web calculator based on this model can easily and quickly predict the risk of vitamin D deficiency.Conclusion In this study, the XGBoost-based prediction tool performs flawlessly and is highly accurate in predicting the risk of vitamin D deficiency in community populations.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Prevalence, trend, and predictor analyses of vitamin D deficiency in the US population, 2001-2018
    Cui, Aiyong
    Xiao, Peilun
    Ma, Yuzhuo
    Fan, Zhiqiang
    Zhou, Fengjin
    Zheng, Jiang
    Zhang, Liang
    FRONTIERS IN NUTRITION, 2022, 9
  • [2] Prediction of Vitamin D Deficiency in Older Adults: The Role of Machine Learning Models
    Sluyter, John D.
    Raita, Yoshihiko
    Hasegawa, Kohei
    Reid, Ian R.
    Scragg, Robert
    Camargo, Carlos A.
    JOURNAL OF CLINICAL ENDOCRINOLOGY & METABOLISM, 2022, 107 (10) : 2737 - 2747
  • [3] Constructing machine learning-based risk prediction model for osteoarthritis in population aged 45 and above: NHANES 2011–2018
    Yun Fu
    Yaming Yu
    Weichao Chen
    Scientific Reports, 15 (1)
  • [4] Association between serum 25-hydroxyvitamin D and osteoarthritis: A national population-based analysis of NHANES 2001-2018
    Yu, Guoyu
    Lin, Yuan
    Dai, Hanhao
    Xu, Jie
    Liu, Jun
    FRONTIERS IN NUTRITION, 2023, 10
  • [5] Machine Learning-Based Prediction of Helicobacter pylori Infection Study in Adults
    Liu, Min
    Liu, Shiyu
    Lu, Zhaolin
    Chen, Hu
    Xu, Yuling
    Gong, Xue
    Chen, Guangxia
    MEDICAL SCIENCE MONITOR, 2024, 30
  • [6] Development and validation of a prediction model for ED using machine learning: according to NHANES 2001-2004
    Chen, Xing-Yu
    Lu, Wen-Ting
    Zhang, Di
    Tan, Mo-Yao
    Qin, Xin
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [7] A machine learning-based universal outbreak risk prediction tool
    Zhang, Tianyu
    Rabhi, Fethi
    Chen, Xin
    Paik, Hye-young
    Macintyre, Chandini Raina
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 169
  • [8] Machine learning-based cache miss prediction
    Jelacic, Edin
    Seceleanu, Cristina
    Xiong, Ning
    Backeman, Peter
    Yaghoobi, Sharifeh
    Seceleanu, Tiberiu
    INTERNATIONAL JOURNAL ON SOFTWARE TOOLS FOR TECHNOLOGY TRANSFER, 2025, : 53 - 80
  • [9] A MACHINE LEARNING-BASED TOURIST PATH PREDICTION
    Zheng, Siwen
    Liu, Yu
    Ouyang, Zhenchao
    PROCEEDINGS OF 2016 4TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (IEEE CCIS 2016), 2016, : 38 - 42
  • [10] Machine Learning-Based Prediction of Air Quality
    Liang, Yun-Chia
    Maimury, Yona
    Chen, Angela Hsiang-Ling
    Juarez, Josue Rodolfo Cuevas
    APPLIED SCIENCES-BASEL, 2020, 10 (24): : 1 - 17