Prediction of cardiovascular disease based on multiple feature selection and improved PSO-XGBoost model

被引:0
|
作者
Kerang Cao [1 ]
Chang Liu [2 ]
Siqi Yang [1 ]
Yuxin Zhang [1 ]
Lili Li [1 ]
Hoekyung Jung [3 ]
Shuo Zhang [4 ]
机构
[1] College of Computer Science and Technology, Shenyang University of Chemical Technology, Shenyang
[2] Key Laboratory of Intelligent Technology of Chemical Process Industry in Liaoning Province, Shenyang
[3] Shenyang Maternity and Child Health Hospital, Shenyang
[4] Computer Engineering Dept, Paichai University, Daejeon
关键词
Cardiovascular disease; Machine learning; Model prediction; Multi feature selection; Particle swarm optimization algorithm; XGBoost algorithm;
D O I
10.1038/s41598-025-96520-7
中图分类号
学科分类号
摘要
Cardiovascular disease is a common disease that threatens human health. In order to predict it more accurately, this paper proposes a cardiovascular disease prediction model that combines multiple feature selection, improved particle swarm optimization algorithm, and extreme gradient boosting tree. Firstly, the dataset is preprocessed, and an XGBoost cardiovascular disease prediction model is constructed for model training and compare it with other algorithms. Then, combined with two factor Pearson correlation analysis and feature importance ranking, multiple feature selection is performed, with the optimal feature subset as the feature input. Finally, the improved particle swarm optimization algorithm is used to adjust the hyperparameters of the extreme gradient boosting tree algorithm, and selecting the optimal hyperparameter combination to construct the MFS-DLPSO-XGBoost model. The recall, precision, accuracy, F1 score, and area under the ROC curve (AUC) of the MFS-DLPSO-XGBoost model reached 71.4%, 76.3%, 74.7%, 73.6%, and 80.8%, respectively, which increased by 3.6%, 3.2%, 2.7%, 3.2%, and 2.3% compared to XGBoost. The results indicate that the model proposed in this article has good classification performance and can provide assistance for doctors and patients in predicting and preventing heart disease. © The Author(s) 2025.
引用
收藏
相关论文
共 50 条
  • [21] Prediction model of mechanical properties of hot-rolled strip based on improved feature selection method
    Gao, Zhi-wei
    Cao, Guang-ming
    Wu, Si-wei
    Luo, Deng
    Wang, Hou-xin
    Liu, Zhen-yu
    JOURNAL OF IRON AND STEEL RESEARCH INTERNATIONAL, 2024,
  • [22] Improved Prediction of Knee Osteoarthritis by the Machine Learning Model XGBoost
    Su, Kui
    Yuan, Xin
    Huang, Yukai
    Yuan, Qian
    Yang, Minghui
    Sun, Jianwu
    Li, Shuyi
    Long, Xinyi
    Liu, Lang
    Li, Tianwang
    Yuan, Zhengqiang
    INDIAN JOURNAL OF ORTHOPAEDICS, 2023, 57 (10) : 1667 - 1677
  • [23] Improved Prediction of Knee Osteoarthritis by the Machine Learning Model XGBoost
    Kui Su
    Xin Yuan
    Yukai Huang
    Qian Yuan
    Minghui Yang
    Jianwu Sun
    Shuyi Li
    Xinyi Long
    Lang Liu
    Tianwang Li
    Zhengqiang Yuan
    Indian Journal of Orthopaedics, 2023, 57 : 1667 - 1677
  • [24] Classification and prediction of spinal disease based on the SMOTE-RFE- XGBoost model
    Zhang, Biao
    Dong, Xinyan
    Hu, Yuwei
    Jiang, Xuchu
    Li, Gongchi
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [25] Network traffic prediction model based on improved VMD and PSO-ELM
    Shi, Jinmei
    Zhou, Jinghe
    Feng, Junying
    Chen, Huandong
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2023, 36 (07)
  • [26] Feature Selection for Hypertension Risk Prediction Using XGBoost on Single Nucleotide Polymorphism Data
    Muflikhah, Lailil
    Fatyanosa, Tirana Noor
    Widodo, Nashi
    Perdana, Rizal Setya
    Solimun
    Ratnawati, Hana
    HEALTHCARE INFORMATICS RESEARCH, 2025, 31 (01) : 16 - 22
  • [27] Age Prediction Based on Feature Selection
    Wang, Yanhong
    Song, Wei
    Liu, Lizhen
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (ICCIA), 2017, : 359 - 363
  • [28] A diabetes prediction model based on Boruta feature selection and ensemble learning
    Hongfang Zhou
    Yinbo Xin
    Suli Li
    BMC Bioinformatics, 24
  • [29] Prediction of Total Organic Carbon Content in Shale Based on PCA-PSO-XGBoost
    Meng, Yingjie
    Xu, Chengwu
    Li, Tingting
    Liu, Tianyong
    Tang, Lu
    Zhang, Jinyou
    APPLIED SCIENCES-BASEL, 2025, 15 (07):
  • [30] Photovoltaic power prediction of LSTM model based on Pearson feature selection
    Chen, Hailang
    Chang, Xianfa
    ENERGY REPORTS, 2021, 7 : 1047 - 1054