Development and validation of a prediction model for ED using machine learning: according to NHANES 2001-2004

被引:1
|
作者
Chen, Xing-Yu [1 ,2 ]
Lu, Wen-Ting [3 ]
Zhang, Di [4 ]
Tan, Mo-Yao [5 ]
Qin, Xin [1 ,2 ]
机构
[1] Chengdu Integrated TCM, Chengdu, Sichuan, Peoples R China
[2] Western Med Hosp, Chengdu, Sichuan, Peoples R China
[3] XinDu Hosp Tradit Chinese Med, Chengdu, Sichuan, Peoples R China
[4] Sichuan Univ, West China Sch Pharm, Chengdu, Sichuan, Peoples R China
[5] Chengdu Univ Tradit Chinese Med, Chengdu, Sichuan, Peoples R China
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
Erectile Dysfunction; Machine learning; XGBoost; National Health and Nutrition Examination Survey; Prediction model; URINARY-TRACT SYMPTOMS; ERECTILE DYSFUNCTION; CARDIOVASCULAR-DISEASE; OXIDATIVE STRESS; NEURAL-NETWORKS; MEN; PREVALENCE; DIAGNOSIS; TESTOSTERONE; CLASSIFICATION;
D O I
10.1038/s41598-024-78797-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Erectile Dysfunction (ED) is a form of sexual dysfunction in males that imposes significant health and financial burdens globally. Despite its high prevalence, diagnosing ED remains challenging due to the limitations of current diagnostic methods and patients' reluctance to seek medical help. Currently, some studies have used machine learning techniques for developing ED prediction models, but the performance and interpretability of existing models need to be further improved. This study utilized data from the National Health and Nutrition Examination Survey (NHANES) for the years 2001 to 2004, adhering to the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) statement. After excluding male respondents who did not meet the study criteria, a total of 3,869 participants were included. Gradient boosting decision tree (GBDT) algorithms (XGBoost, CatBoost, LightGBM) were used to develop the ED prediction model. Data preprocessing, feature selection, model evaluation, and interpretability analysis were performed to ensure the reliability and effectiveness of the model. The model evaluation results revealed that the AUC values are XGBoost: 0.887 +/- 0.016; LightGBM: 0.879 +/- 0.016; CatBoost: 0.871 +/- 0.019. The F1-Scores are XGBoost: 0.695 +/- 0.023; LightGBM: 0.681 +/- 0.025; CatBoost: 0.681 +/- 0.025. The Recall values are XGBoost: 0.789 +/- 0.026; LightGBM: 0.739 +/- 0.030; CatBoost: 0.711 +/- 0.030. These results confirmed that the XGBoost model is the best-performing ED prediction model in this study. Interpretability analysis results of the XGBoost model showed that age, obesity, cardiovascular risk factors, prostate-related diseases, and socioeconomic status are key features for predicting ED, playing a significant role in the ED mechanism. Therefore, we believe the ED prediction model trained in this study has strong predictive performance and high interpretability. This model can help to expand the diagnostic options for ED, improve the diagnosis rate of ED, and assist doctors in early intervention for patients with ED, ultimately improving patient prognosis.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] A comprehensive analysis of erectile dysfunction prevalence and the impact of prostate conditions on ED among US adults: evidence from NHANES 2001-2004
    Zhang, Yuhao
    Zang, Nan
    Xiang, Yingyue
    Lin, Fanlu
    Liu, Xue
    Zhang, Jing
    FRONTIERS IN ENDOCRINOLOGY, 2025, 15
  • [2] Development and validation of a clinical prediction model for glioma grade using machine learning
    Wu, Mingzhen
    Luan, Jixin
    Zhang, Di
    Fan, Hua
    Qiao, Lishan
    Zhang, Chuanchen
    TECHNOLOGY AND HEALTH CARE, 2024, 32 (03) : 1977 - 1990
  • [3] Development and Validation of an ICU-Venous Thromboembolism Prediction Model Using Machine Learning Approaches: A Multicenter Study
    Jin, Jie
    Lu, Jie
    Su, Xinyang
    Xiong, Yinhuan
    Ma, Shasha
    Kong, Yang
    Xu, Hongmei
    INTERNATIONAL JOURNAL OF GENERAL MEDICINE, 2024, 17 : 3279 - 3292
  • [4] Development and experimental validation of a machine learning model for the prediction of new antimalarials
    Kore, Mukul
    Acharya, Dimple
    Sharma, Lakshya
    Vembar, Shruthi Sridhar
    Sundriyal, Sandeep
    BMC CHEMISTRY, 2025, 19 (01)
  • [5] Development and Validation of a Predictive Model for Coronary Artery Disease Using Machine Learning
    Wang, Chen
    Zhao, Yue
    Jin, Bingyu
    Gan, Xuedong
    Liang, Bin
    Xiang, Yang
    Zhang, Xiaokang
    Lu, Zhibing
    Zheng, Fang
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2021, 8
  • [6] Development and validation of common data model-based fracture prediction model using machine learning algorithm
    Kong, Sung Hye
    Kim, Sihyeon
    Kim, Yisak
    Kim, Jung Hee
    Kim, Kwangsoo
    Shin, Chan Soo
    OSTEOPOROSIS INTERNATIONAL, 2023, 34 (08) : 1437 - 1451
  • [7] Development and Validation of a Prediction Model for Elevated Arterial Stiffness in Chinese Patients With Diabetes Using Machine Learning
    Li, Qingqing
    Xie, Wenhui
    Li, Liping
    Wang, Lijing
    You, Qinyi
    Chen, Lu
    Li, Jing
    Ke, Yilang
    Fang, Jun
    Liu, Libin
    Hong, Huashan
    FRONTIERS IN PHYSIOLOGY, 2021, 12
  • [8] Development and validation of a new diagnostic prediction model for NAFLD based on machine learning algorithms in NHANES 2017-2020.3
    Wang, Yazhi
    Wang, Peng
    HORMONES-INTERNATIONAL JOURNAL OF ENDOCRINOLOGY AND METABOLISM, 2025,
  • [9] Machine learning-based prediction of vitamin D deficiency: NHANES 2001-2018
    Guo, Jiale
    He, Qionghan
    Li, Yehai
    FRONTIERS IN ENDOCRINOLOGY, 2024, 15
  • [10] Development and Validation of a Machine Learning-Based Prediction Model for Detection of Biliary Atresia
    Choi, Ho Jung
    Kim, Yeong Eun
    Namgoong, Jung-Man
    Kim, Inki
    Park, Jun Sung
    Baek, Woo Im
    Lee, Byong Sop
    Yoon, Hee Mang
    Cho, Young Ah
    Lee, Jin Seong
    Shim, Jung Ok
    Oh, Seak Hee
    Moon, Jin Soo
    Ko, Jae Sung
    Kim, Dae Yeon
    Kim, Kyung Mo
    GASTRO HEP ADVANCES, 2023, 2 (06): : 778 - 787