Development and validation of a prediction model for ED using machine learning: according to NHANES 2001-2004

被引：1

作者：

Chen, Xing-Yu ^{[1
,2
]}

Lu, Wen-Ting ^{[3
]}

Zhang, Di ^{[4
]}

Tan, Mo-Yao ^{[5
]}

Qin, Xin ^{[1
,2
]}

机构：

[1] Chengdu Integrated TCM, Chengdu, Sichuan, Peoples R China

[2] Western Med Hosp, Chengdu, Sichuan, Peoples R China

[3] XinDu Hosp Tradit Chinese Med, Chengdu, Sichuan, Peoples R China

[4] Sichuan Univ, West China Sch Pharm, Chengdu, Sichuan, Peoples R China

[5] Chengdu Univ Tradit Chinese Med, Chengdu, Sichuan, Peoples R China

来源：

SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期

关键词：

Erectile Dysfunction; Machine learning; XGBoost; National Health and Nutrition Examination Survey; Prediction model; URINARY-TRACT SYMPTOMS; ERECTILE DYSFUNCTION; CARDIOVASCULAR-DISEASE; OXIDATIVE STRESS; NEURAL-NETWORKS; MEN; PREVALENCE; DIAGNOSIS; TESTOSTERONE; CLASSIFICATION;

D O I：

10.1038/s41598-024-78797-2

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Erectile Dysfunction (ED) is a form of sexual dysfunction in males that imposes significant health and financial burdens globally. Despite its high prevalence, diagnosing ED remains challenging due to the limitations of current diagnostic methods and patients' reluctance to seek medical help. Currently, some studies have used machine learning techniques for developing ED prediction models, but the performance and interpretability of existing models need to be further improved. This study utilized data from the National Health and Nutrition Examination Survey (NHANES) for the years 2001 to 2004, adhering to the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) statement. After excluding male respondents who did not meet the study criteria, a total of 3,869 participants were included. Gradient boosting decision tree (GBDT) algorithms (XGBoost, CatBoost, LightGBM) were used to develop the ED prediction model. Data preprocessing, feature selection, model evaluation, and interpretability analysis were performed to ensure the reliability and effectiveness of the model. The model evaluation results revealed that the AUC values are XGBoost: 0.887 +/- 0.016; LightGBM: 0.879 +/- 0.016; CatBoost: 0.871 +/- 0.019. The F1-Scores are XGBoost: 0.695 +/- 0.023; LightGBM: 0.681 +/- 0.025; CatBoost: 0.681 +/- 0.025. The Recall values are XGBoost: 0.789 +/- 0.026; LightGBM: 0.739 +/- 0.030; CatBoost: 0.711 +/- 0.030. These results confirmed that the XGBoost model is the best-performing ED prediction model in this study. Interpretability analysis results of the XGBoost model showed that age, obesity, cardiovascular risk factors, prostate-related diseases, and socioeconomic status are key features for predicting ED, playing a significant role in the ED mechanism. Therefore, we believe the ED prediction model trained in this study has strong predictive performance and high interpretability. This model can help to expand the diagnostic options for ED, improve the diagnosis rate of ED, and assist doctors in early intervention for patients with ED, ultimately improving patient prognosis.

引用

页数：18

共 50 条

[31] Blood Uric Acid Prediction With Machine Learning: Model Development and Performance Comparison
Sampa, Masuda Begum
Hossain, Nazmul
Hoque, Rakibul
Islam, Rafiqul
Yokota, Fumihiko
Nishikitani, Mariko
Ahmed, Ashir
JMIR MEDICAL INFORMATICS, 2020, 8 (10)
[32] Development and application of a machine learning-based antenatal depression prediction model
Hu, Chunfei
Lin, Hongmei
Xu, Yupin
Fu, Xukun
Qiu, Xiaojing
Hu, Siqian
Jin, Tong
Xu, Hualin
Luo, Qiong
JOURNAL OF AFFECTIVE DISORDERS, 2025, 375 : 137 - 147
[33] Using Machine Learning to Predict the Duration of AtrialFibrillation:Model Development and Validation
Shimoo, Satoshi
Senoo, Keitaro
Okawa, Taku
Kawai, Kohei
Makino, Masahiro
Munakata, Jun
Tomura, Nobunari
Iwakoshi, Hibiki
Nishimura, Tetsuro
Shiraishi, Hirokazu
Inoue, Keiji
Matoba, Satoaki
JMIR MEDICAL INFORMATICS, 2024, 12
[34] Development and evaluation of a machine learning model for osteoporosis risk prediction in Korean women
Je, Minkyung
Hwang, Seunghyeon
Lee, Suwon
Kim, Yoona
BMC WOMENS HEALTH, 2025, 25 (01)
[35] Machine Learning-Based Reliability Evaluation for Software Defect Prediction and Model Validation Assessment
Kovur, Krishna Mohan
Shaik, Harun-Ul-Rasheed
Verma, Ajit Kumar
Srividya, A.
INTERNATIONAL JOURNAL OF RELIABILITY QUALITY AND SAFETY ENGINEERING, 2025,
[36] Development and validation of prediction models for papillary thyroid cancer structural recurrence using machine learning approaches
Wang, Hongxi
Zhang, Chao
Li, Qianrui
Tian, Tian
Huang, Rui
Qiu, Jiajun
Tian, Rong
BMC CANCER, 2024, 24 (01)
[37] Development and internal validation of diagnostic prediction models using machine-learning algorithms in dogs with hypothyroidism
Corsini, Andrea
Lunetta, Francesco
Alboni, Fabrizio
Drudi, Ignazio
Faroni, Eugenio
Fracassi, Federico
FRONTIERS IN VETERINARY SCIENCE, 2023, 10
[38] A study on frost prediction model using machine learning
Kim, Hyojeoung
Kim, Sahm
KOREAN JOURNAL OF APPLIED STATISTICS, 2022, 35 (04) : 543 - 552
[39] Drought Prediction and Validation for Desert Region using Machine Learning Methods
Raja, Azmat
Gopikrishnan, T.
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (07) : 47 - 53
[40] Development and external validation of a machine learning model for prediction of survival in undifferentiated pleomorphic sarcoma
Lee L.
Yi T.
Fice M.
Achar R.K.
Jones C.
Klein E.
Buac N.
Lopez-Hisijos N.
Colman M.W.
Gitelis S.
Blank A.T.
MUSCULOSKELETAL SURGERY, 2024, 108 (1) : 77 - 86

← 1 2 3 4 5 →