Machine learning-based prediction of vitamin D deficiency: NHANES 2001-2018

被引:4
|
作者
Guo, Jiale [1 ]
He, Qionghan [2 ]
Li, Yehai [1 ]
机构
[1] Anhui Med Univ, Chaohu Hosp, Dept Orthoped, Hefei, Peoples R China
[2] Anhui Med Univ, Dept Infect, Chaohu Hosp, Hefei, Peoples R China
来源
FRONTIERS IN ENDOCRINOLOGY | 2024年 / 15卷
关键词
machine learning; vitamin D deficiency; clinical decision rules; nutrition surveys; public health; INSUFFICIENCY; HOMEOSTASIS; PREVALENCE;
D O I
10.3389/fendo.2024.1327058
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background Vitamin D deficiency is strongly associated with the development of several diseases. In the current context of a global pandemic of vitamin D deficiency, it is critical to identify people at high risk of vitamin D deficiency. There are no prediction tools for predicting the risk of vitamin D deficiency in the general community population, and this study aims to use machine learning to predict the risk of vitamin D deficiency using data that can be obtained through simple interviews in the community.Methods The National Health and Nutrition Examination Survey 2001-2018 dataset is used for the analysis which is randomly divided into training and validation sets in the ratio of 70:30. GBM, LR, NNet, RF, SVM, XGBoost methods are used to construct the models and their performance is evaluated. The best performed model was interpreted using the SHAP value and further development of the online web calculator.Results There were 62,919 participants enrolled in the study, and all participants included in the study were 2 years old and above, of which 20,204 (32.1%) participants had vitamin D deficiency. The models constructed by each method were evaluated using AUC as the primary evaluation statistic and ACC, PPV, NPV, SEN, SPE, F1 score, MCC, Kappa, and Brier score as secondary evaluation statistics. Finally, the XGBoost-based model has the best and near-perfect performance. The summary plot of SHAP values shows that the top three important features for this model are race, age, and BMI. An online web calculator based on this model can easily and quickly predict the risk of vitamin D deficiency.Conclusion In this study, the XGBoost-based prediction tool performs flawlessly and is highly accurate in predicting the risk of vitamin D deficiency in community populations.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Machine learning-based progress prediction in accelerated cross-linking for Keratoconus
    Wan, Qi
    Wang, Qiong
    Wei, Ran
    Tang, Jing
    Yin, Hongbo
    Deng, Ying-ping
    Ma, Ke
    GRAEFES ARCHIVE FOR CLINICAL AND EXPERIMENTAL OPHTHALMOLOGY, 2025,
  • [32] Machine Learning-Based Pressure Ulcer Prediction in Modular Critical Care Data
    Sin, Petr
    Hokynkova, Alica
    Marie, Novakova
    Andrea, Pokorna
    Krc, Rostislav
    Podrouzek, Jan
    DIAGNOSTICS, 2022, 12 (04)
  • [33] Machine learning-based prediction of health outcomes in pediatric organ transplantation recipients
    Killian, Michael O.
    Payrovnaziri, Seyedeh Neelufar
    Gupta, Dipankar
    Desai, Dev
    He, Zhe
    JAMIA OPEN, 2021, 4 (01)
  • [34] Machine learning-based prediction of abdominal aortic aneurysms for individualized patient care
    Summers, Kelli L.
    Kerut, Edmund K.
    To, Filip
    Sheahan, Claudie M.
    Sheahan, Malachi G.
    JOURNAL OF VASCULAR SURGERY, 2024, 79 (05)
  • [35] Iron Deficiency and Vitamin D Deficiency Are Associated with Sleep in Females of Reproductive Age: An Analysis of NHANES 2005-2018 Data
    Al Hinai, Maymona
    Jansen, Erica C.
    Song, Peter X. K.
    Peterson, Karen E.
    Baylin, Ana
    JOURNAL OF NUTRITION, 2024, 154 (02) : 648 - 657
  • [36] Perspective: School Meal Programs Require Higher Vitamin D Fortification Levels in Milk Products and Plant-Based Alternatives-Evidence from the National Health and Nutrition Examination Surveys (NHANES 2001-2018)
    Calvo, Mona S.
    Whiting, Susan J.
    ADVANCES IN NUTRITION, 2022, 13 (05) : 1440 - 1449
  • [37] Estimated Sweetness in US Diet Among Children and Adults Declined From 2001 to 2018: A Serial Cross-Sectional Surveillance Study Using NHANES 2001-2018
    Kamil, Alison
    Wilson, Alissa R.
    Rehm, Colin D.
    FRONTIERS IN NUTRITION, 2021, 8
  • [38] A machine learning-based diabetes risk prediction modeling study
    Ming, Jiexiu
    Xu, Junyi
    Zhang, Miaomiao
    Li, Ningyu
    Yan, Xu
    PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON COMPUTER AND MULTIMEDIA TECHNOLOGY, ICCMT 2024, 2024, : 363 - 369
  • [39] Machine Learning-based Prediction of Prolonged Length of Stay in Newborns
    Thompson, Brandon
    Elish, Karim O.
    Steele, Robert
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 1454 - 1459
  • [40] Machine learning-based prediction models for accidental hypothermia patients
    Yohei Okada
    Tasuku Matsuyama
    Sachiko Morita
    Naoki Ehara
    Nobuhiro Miyamae
    Takaaki Jo
    Yasuyuki Sumida
    Nobunaga Okada
    Makoto Watanabe
    Masahiro Nozawa
    Ayumu Tsuruoka
    Yoshihiro Fujimoto
    Yoshiki Okumura
    Tetsuhisa Kitamura
    Ryoji Iiduka
    Shigeru Ohtsuru
    Journal of Intensive Care, 9