Development and validation of a prediction model for ED using machine learning: according to NHANES 2001-2004

被引：1

作者：

Chen, Xing-Yu ^{[1
,2
]}

Lu, Wen-Ting ^{[3
]}

Zhang, Di ^{[4
]}

Tan, Mo-Yao ^{[5
]}

Qin, Xin ^{[1
,2
]}

机构：

[1] Chengdu Integrated TCM, Chengdu, Sichuan, Peoples R China

[2] Western Med Hosp, Chengdu, Sichuan, Peoples R China

[3] XinDu Hosp Tradit Chinese Med, Chengdu, Sichuan, Peoples R China

[4] Sichuan Univ, West China Sch Pharm, Chengdu, Sichuan, Peoples R China

[5] Chengdu Univ Tradit Chinese Med, Chengdu, Sichuan, Peoples R China

来源：

SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期

关键词：

Erectile Dysfunction; Machine learning; XGBoost; National Health and Nutrition Examination Survey; Prediction model; URINARY-TRACT SYMPTOMS; ERECTILE DYSFUNCTION; CARDIOVASCULAR-DISEASE; OXIDATIVE STRESS; NEURAL-NETWORKS; MEN; PREVALENCE; DIAGNOSIS; TESTOSTERONE; CLASSIFICATION;

D O I：

10.1038/s41598-024-78797-2

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Erectile Dysfunction (ED) is a form of sexual dysfunction in males that imposes significant health and financial burdens globally. Despite its high prevalence, diagnosing ED remains challenging due to the limitations of current diagnostic methods and patients' reluctance to seek medical help. Currently, some studies have used machine learning techniques for developing ED prediction models, but the performance and interpretability of existing models need to be further improved. This study utilized data from the National Health and Nutrition Examination Survey (NHANES) for the years 2001 to 2004, adhering to the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) statement. After excluding male respondents who did not meet the study criteria, a total of 3,869 participants were included. Gradient boosting decision tree (GBDT) algorithms (XGBoost, CatBoost, LightGBM) were used to develop the ED prediction model. Data preprocessing, feature selection, model evaluation, and interpretability analysis were performed to ensure the reliability and effectiveness of the model. The model evaluation results revealed that the AUC values are XGBoost: 0.887 +/- 0.016; LightGBM: 0.879 +/- 0.016; CatBoost: 0.871 +/- 0.019. The F1-Scores are XGBoost: 0.695 +/- 0.023; LightGBM: 0.681 +/- 0.025; CatBoost: 0.681 +/- 0.025. The Recall values are XGBoost: 0.789 +/- 0.026; LightGBM: 0.739 +/- 0.030; CatBoost: 0.711 +/- 0.030. These results confirmed that the XGBoost model is the best-performing ED prediction model in this study. Interpretability analysis results of the XGBoost model showed that age, obesity, cardiovascular risk factors, prostate-related diseases, and socioeconomic status are key features for predicting ED, playing a significant role in the ED mechanism. Therefore, we believe the ED prediction model trained in this study has strong predictive performance and high interpretability. This model can help to expand the diagnostic options for ED, improve the diagnosis rate of ED, and assist doctors in early intervention for patients with ED, ultimately improving patient prognosis.

引用

页数：18

共 50 条

[1] A comprehensive analysis of erectile dysfunction prevalence and the impact of prostate conditions on ED among US adults: evidence from NHANES 2001-2004
Zhang, Yuhao
Zang, Nan
Xiang, Yingyue
Lin, Fanlu
Liu, Xue
Zhang, Jing
FRONTIERS IN ENDOCRINOLOGY, 2025, 15
[2] Development and validation of a clinical prediction model for glioma grade using machine learning
Wu, Mingzhen
Luan, Jixin
Zhang, Di
Fan, Hua
Qiao, Lishan
Zhang, Chuanchen
TECHNOLOGY AND HEALTH CARE, 2024, 32 (03) : 1977 - 1990
[3] Development and Validation of an ICU-Venous Thromboembolism Prediction Model Using Machine Learning Approaches: A Multicenter Study
Jin, Jie
Lu, Jie
Su, Xinyang
Xiong, Yinhuan
Ma, Shasha
Kong, Yang
Xu, Hongmei
INTERNATIONAL JOURNAL OF GENERAL MEDICINE, 2024, 17 : 3279 - 3292
[4] Development and experimental validation of a machine learning model for the prediction of new antimalarials
Kore, Mukul
Acharya, Dimple
Sharma, Lakshya
Vembar, Shruthi Sridhar
Sundriyal, Sandeep
BMC CHEMISTRY, 2025, 19 (01)
[5] Development and Validation of a Predictive Model for Coronary Artery Disease Using Machine Learning
Wang, Chen
Zhao, Yue
Jin, Bingyu
Gan, Xuedong
Liang, Bin
Xiang, Yang
Zhang, Xiaokang
Lu, Zhibing
Zheng, Fang
FRONTIERS IN CARDIOVASCULAR MEDICINE, 2021, 8
[6] Development and validation of common data model-based fracture prediction model using machine learning algorithm
Kong, Sung Hye
Kim, Sihyeon
Kim, Yisak
Kim, Jung Hee
Kim, Kwangsoo
Shin, Chan Soo
OSTEOPOROSIS INTERNATIONAL, 2023, 34 (08) : 1437 - 1451
[7] Development and Validation of a Prediction Model for Elevated Arterial Stiffness in Chinese Patients With Diabetes Using Machine Learning
Li, Qingqing
Xie, Wenhui
Li, Liping
Wang, Lijing
You, Qinyi
Chen, Lu
Li, Jing
Ke, Yilang
Fang, Jun
Liu, Libin
Hong, Huashan
FRONTIERS IN PHYSIOLOGY, 2021, 12
[8] Development and validation of a new diagnostic prediction model for NAFLD based on machine learning algorithms in NHANES 2017-2020.3
Wang, Yazhi
Wang, Peng
HORMONES-INTERNATIONAL JOURNAL OF ENDOCRINOLOGY AND METABOLISM, 2025,
[9] Machine learning-based prediction of vitamin D deficiency: NHANES 2001-2018
Guo, Jiale
He, Qionghan
Li, Yehai
FRONTIERS IN ENDOCRINOLOGY, 2024, 15
[10] Development and Validation of a Machine Learning-Based Prediction Model for Detection of Biliary Atresia
Choi, Ho Jung
Kim, Yeong Eun
Namgoong, Jung-Man
Kim, Inki
Park, Jun Sung
Baek, Woo Im
Lee, Byong Sop
Yoon, Hee Mang
Cho, Young Ah
Lee, Jin Seong
Shim, Jung Ok
Oh, Seak Hee
Moon, Jin Soo
Ko, Jae Sung
Kim, Dae Yeon
Kim, Kyung Mo
GASTRO HEP ADVANCES, 2023, 2 (06): : 778 - 787

← 1 2 3 4 5 →