Development and validation of a prediction model for ED using machine learning: according to NHANES 2001-2004

被引:1
作者
Chen, Xing-Yu [1 ,2 ]
Lu, Wen-Ting [3 ]
Zhang, Di [4 ]
Tan, Mo-Yao [5 ]
Qin, Xin [1 ,2 ]
机构
[1] Chengdu Integrated TCM, Chengdu, Sichuan, Peoples R China
[2] Western Med Hosp, Chengdu, Sichuan, Peoples R China
[3] XinDu Hosp Tradit Chinese Med, Chengdu, Sichuan, Peoples R China
[4] Sichuan Univ, West China Sch Pharm, Chengdu, Sichuan, Peoples R China
[5] Chengdu Univ Tradit Chinese Med, Chengdu, Sichuan, Peoples R China
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
Erectile Dysfunction; Machine learning; XGBoost; National Health and Nutrition Examination Survey; Prediction model; URINARY-TRACT SYMPTOMS; ERECTILE DYSFUNCTION; CARDIOVASCULAR-DISEASE; OXIDATIVE STRESS; NEURAL-NETWORKS; MEN; PREVALENCE; DIAGNOSIS; TESTOSTERONE; CLASSIFICATION;
D O I
10.1038/s41598-024-78797-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Erectile Dysfunction (ED) is a form of sexual dysfunction in males that imposes significant health and financial burdens globally. Despite its high prevalence, diagnosing ED remains challenging due to the limitations of current diagnostic methods and patients' reluctance to seek medical help. Currently, some studies have used machine learning techniques for developing ED prediction models, but the performance and interpretability of existing models need to be further improved. This study utilized data from the National Health and Nutrition Examination Survey (NHANES) for the years 2001 to 2004, adhering to the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) statement. After excluding male respondents who did not meet the study criteria, a total of 3,869 participants were included. Gradient boosting decision tree (GBDT) algorithms (XGBoost, CatBoost, LightGBM) were used to develop the ED prediction model. Data preprocessing, feature selection, model evaluation, and interpretability analysis were performed to ensure the reliability and effectiveness of the model. The model evaluation results revealed that the AUC values are XGBoost: 0.887 +/- 0.016; LightGBM: 0.879 +/- 0.016; CatBoost: 0.871 +/- 0.019. The F1-Scores are XGBoost: 0.695 +/- 0.023; LightGBM: 0.681 +/- 0.025; CatBoost: 0.681 +/- 0.025. The Recall values are XGBoost: 0.789 +/- 0.026; LightGBM: 0.739 +/- 0.030; CatBoost: 0.711 +/- 0.030. These results confirmed that the XGBoost model is the best-performing ED prediction model in this study. Interpretability analysis results of the XGBoost model showed that age, obesity, cardiovascular risk factors, prostate-related diseases, and socioeconomic status are key features for predicting ED, playing a significant role in the ED mechanism. Therefore, we believe the ED prediction model trained in this study has strong predictive performance and high interpretability. This model can help to expand the diagnostic options for ED, improve the diagnosis rate of ED, and assist doctors in early intervention for patients with ED, ultimately improving patient prognosis.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Blood Uric Acid Prediction With Machine Learning: Model Development and Performance Comparison
    Sampa, Masuda Begum
    Hossain, Nazmul
    Hoque, Rakibul
    Islam, Rafiqul
    Yokota, Fumihiko
    Nishikitani, Mariko
    Ahmed, Ashir
    JMIR MEDICAL INFORMATICS, 2020, 8 (10)
  • [32] Development and application of a machine learning-based antenatal depression prediction model
    Hu, Chunfei
    Lin, Hongmei
    Xu, Yupin
    Fu, Xukun
    Qiu, Xiaojing
    Hu, Siqian
    Jin, Tong
    Xu, Hualin
    Luo, Qiong
    JOURNAL OF AFFECTIVE DISORDERS, 2025, 375 : 137 - 147
  • [33] Using Machine Learning to Predict the Duration of AtrialFibrillation:Model Development and Validation
    Shimoo, Satoshi
    Senoo, Keitaro
    Okawa, Taku
    Kawai, Kohei
    Makino, Masahiro
    Munakata, Jun
    Tomura, Nobunari
    Iwakoshi, Hibiki
    Nishimura, Tetsuro
    Shiraishi, Hirokazu
    Inoue, Keiji
    Matoba, Satoaki
    JMIR MEDICAL INFORMATICS, 2024, 12
  • [34] Development and evaluation of a machine learning model for osteoporosis risk prediction in Korean women
    Je, Minkyung
    Hwang, Seunghyeon
    Lee, Suwon
    Kim, Yoona
    BMC WOMENS HEALTH, 2025, 25 (01)
  • [35] Machine Learning-Based Reliability Evaluation for Software Defect Prediction and Model Validation Assessment
    Kovur, Krishna Mohan
    Shaik, Harun-Ul-Rasheed
    Verma, Ajit Kumar
    Srividya, A.
    INTERNATIONAL JOURNAL OF RELIABILITY QUALITY AND SAFETY ENGINEERING, 2025,
  • [36] Development and validation of prediction models for papillary thyroid cancer structural recurrence using machine learning approaches
    Wang, Hongxi
    Zhang, Chao
    Li, Qianrui
    Tian, Tian
    Huang, Rui
    Qiu, Jiajun
    Tian, Rong
    BMC CANCER, 2024, 24 (01)
  • [37] Development and internal validation of diagnostic prediction models using machine-learning algorithms in dogs with hypothyroidism
    Corsini, Andrea
    Lunetta, Francesco
    Alboni, Fabrizio
    Drudi, Ignazio
    Faroni, Eugenio
    Fracassi, Federico
    FRONTIERS IN VETERINARY SCIENCE, 2023, 10
  • [38] A study on frost prediction model using machine learning
    Kim, Hyojeoung
    Kim, Sahm
    KOREAN JOURNAL OF APPLIED STATISTICS, 2022, 35 (04) : 543 - 552
  • [39] Drought Prediction and Validation for Desert Region using Machine Learning Methods
    Raja, Azmat
    Gopikrishnan, T.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (07) : 47 - 53
  • [40] Development and external validation of a machine learning model for prediction of survival in undifferentiated pleomorphic sarcoma
    Lee L.
    Yi T.
    Fice M.
    Achar R.K.
    Jones C.
    Klein E.
    Buac N.
    Lopez-Hisijos N.
    Colman M.W.
    Gitelis S.
    Blank A.T.
    MUSCULOSKELETAL SURGERY, 2024, 108 (1) : 77 - 86