Development and validation of a prediction model for coronary heart disease risk in depressed patients aged 20 years and older using machine learning algorithms

被引:0
作者
Wang, Yicheng [1 ,2 ,3 ]
Wu, Chuan-Yang [4 ]
Fu, Hui-Xian [5 ]
Zhang, Jian-Cheng [1 ,2 ,3 ]
机构
[1] Fujian Med Univ, Shengli Clin Med Coll, Fuzhou, Fujian, Peoples R China
[2] Fuzhou Univ, Dept Cardiovasc Med, Affiliated Prov Hosp, Fuzhou, Fujian, Peoples R China
[3] Fujian Prov Hosp, Dept Cardiol, Fuzhou, Fujian, Peoples R China
[4] Youxi Cty Gen Hop, Dept Cardiol, Sanming, Fujian, Peoples R China
[5] Changji Prefecture Peoples Hosp Xinjiang Uygur Aut, Dept Cardiol, Changji, Xinjiang, Peoples R China
来源
FRONTIERS IN CARDIOVASCULAR MEDICINE | 2025年 / 11卷
基金
中国国家自然科学基金;
关键词
depression; machine learning; prediction model; coronary heart disease; National Health and Nutrition Examination Survey (NHANES);
D O I
10.3389/fcvm.2024.1504957
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background Depression is being increasingly acknowledged as an important risk factor contributing to coronary heart disease (CHD). Currently, there is no predictive model specifically designed to evaluate the risk of coronary heart disease among individuals with depression. We aim to develop a machine learning (ML) model that will analyze risk factors and forecast the probability of coronary heart disease in individuals suffering from depression.Methods This research employed data from the National Health and Nutrition Examination Survey (NHANES) from 2007-2018, which included 2,085 individuals who had previously been diagnosed with depression. The population was randomly divided into a training set and a validation set, with an 8:2 ratio. Univariate and multivariate logistic regression analyses were employed to identify independent risk factors for coronary heart disease in individuals with depression. Eight machine learning algorithms were applied to the training set to construct the model, including logistic regression (LR), random forest (RF), gradient boosting machine (GBM), support vector machine (SVM), extreme gradient boosting (XGBoost), classification and regression tree (CART), k-nearest neighbors (KNN), and neural network (NNET). The validation set are used to evaluate the various performances of eight machine learning models. Several evaluation metrics were employed to assess and compare the performance of eight different machine learning models, aiming to identify the most effective algorithm for predicting coronary heart disease risk in individuals with depression. The evaluation metrics applied in this study included the area under the receiver operating characteristic (ROC) curve, calibration curve, Brier scores, decision curve analysis (DCA), and the precision-recall (PR) curve. And internally validated by the bootstrap method.Results Univariate and multivariate logistic regression analyses identified age, chest pain status, history of myocardial infarction, serum triglyceride levels, and education level as independent predictors of coronary heart disease risk. Eight machine learning algorithms are applied to construct the models, among which the Random Forest model has the best performance, with an (Area Under Curve) AUC of 0.987 for the random forest model in the training set, and an AUC of 0.848 for the PR curve. In the validation set, the random forest model achieves an AUC of 0.996, and an AUC of 0.960 for the PR curve, which demonstrates an excellent discriminative ability. Calibration curves indicated high congruence between observed and predicted odds, with minimal Brier scores of 0.026 and 0.021 for the training, respectively, reinforcing the model's ability to discriminate. Set and validation set, respectively, reinforcing the model's predictive accuracy. DCA curves confirmed net benefits of the random forest model across. Furthermore, the AUC of the random forest model was 0.928 after internal validation by bootstrap method, indicating that its discriminative ability is good, and the model is useful for clinical assessment of the risk of coronary heart disease in depressed people.Conclusion The random forest algorithm exhibited the best predictive performance, potentially aiding clinicians in assessing the risk probabilities of coronary heart disease within this population.
引用
收藏
页数:15
相关论文
共 32 条
[1]   Plasma Metabolites Alert Patients With Chest Pain to Occurrence of Myocardial Infarction [J].
Aa, Nan ;
Lu, Ying ;
Yu, Mengjie ;
Tang, Heng ;
Lu, Zhenyao ;
Sun, Runbing ;
Wang, Liansheng ;
Li, Chunjian ;
Yang, Zhijian ;
Aa, Jiye ;
Kong, Xiangqing ;
Wang, Guangji .
FRONTIERS IN CARDIOVASCULAR MEDICINE, 2021, 8
[2]  
Aoki J, 2023, KIDNEY MED, V5, DOI [10.1016/j.xkme.2023.100692, 10.1016/license]
[3]   Comparison of machine learning and conventional logistic regression-based prediction models for gestational diabetes in an ethnically diverse population; the Monash GDM Machine learning model [J].
Belsti, Yitayeh ;
Moran, Lisa ;
Du, Lan ;
Mousa, Aya ;
De Silva, Kushan ;
Enticott, Joanne ;
Teede, Helena .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2023, 179
[4]   Treating Depression to Improve Survival in Coronary Heart Disease What Have We Learned? [J].
Carney, Robert M. ;
Freedland, Kenneth E. ;
Rich, Michael W. .
JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2024, 84 (05) :482-489
[5]   Machine Learning for Prediction and Risk Stratification of Lupus Nephritis Renal Flare [J].
Chen, Yinghua ;
Huang, Siwan ;
Chen, Tiange ;
Liang, Dandan ;
Yang, Jing ;
Zeng, Caihong ;
Li, Xiang ;
Xie, Guotong ;
Liu, ZhiHong .
AMERICAN JOURNAL OF NEPHROLOGY, 2021, 52 (02) :152-160
[6]   Development of an unsupervised machine learning algorithm for the prognostication of walking ability in spinal cord injury patients [J].
DeVries, Zachary ;
Hoda, Mohamad ;
Rivers, Carly S. ;
Maher, Audrey ;
Wai, Eugene ;
Moravek, Dita ;
Stratton, Alexandra ;
Kingwell, Stephen ;
Fallah, Nader ;
Paquet, Jerome ;
Phan, Philippe .
SPINE JOURNAL, 2020, 20 (02) :213-224
[7]   Relationship between hepatocellular carcinoma and depression via online database analysis [J].
Han, Tiantian ;
Zhou, Yingchun ;
Li, Danhua .
BIOENGINEERED, 2021, 12 (01) :1689-1697
[8]   Development and external validation of a risk prediction model for depression in patients with coronary heart disease [J].
Hou, Xin-Zheng ;
Wu, Qian ;
Lv, Qian-Yu ;
Yang, Ying-Tian ;
Li, Lan-Lan ;
Ye, Xue-Jiao ;
Yang, Chen-Yan ;
Lv, Yan-Fei ;
Wang, Shi-Han .
JOURNAL OF AFFECTIVE DISORDERS, 2024, 367 :137-147
[9]   Prediction model for gestational diabetes mellitus using the XG Boost machine learning algorithm [J].
Hu, Xiaoqi ;
Hu, Xiaolin ;
Yu, Ya ;
Wang, Jia .
FRONTIERS IN ENDOCRINOLOGY, 2023, 14
[10]   The relationship between osteoporosis and depression [J].
Kashfi, Seyyed Sadra ;
Abdollahi, Gholamreza ;
Hassanzadeh, Jafar ;
Mokarami, Hamidreza ;
Jeihooni, Ali Khani .
SCIENTIFIC REPORTS, 2022, 12 (01)