Improving the accuracy of diagnosing and predicting coronary heart disease using ensemble method and feature selection techniques

被引:5
|
作者
Asif, Sohaib [1 ,2 ,3 ]
Wenhui, Yi [1 ,2 ]
ul Ain, Qurrat [4 ]
Yueyang, Yi [5 ]
Jinhai, Si [1 ,2 ]
机构
[1] Xi An Jiao Tong Univ, Key Lab Informat Photon Technol Shaanxi Prov, Xian 710049, Shaanxi, Peoples R China
[2] Xi An Jiao Tong Univ, Fac Elect & Informat Engn, Sch Elect Sci & Engn, Key Lab Phys Elect,Minist Educ, Xian 710049, Shaanxi, Peoples R China
[3] Cent South Univ, Sch Comp Sci & Engn, Changsha, Peoples R China
[4] Cent South Univ, Sch Publ Hlth, Changsha, Peoples R China
[5] Xi An Jiao Tong Univ, Hlth Sci Ctr, Xian, Shaanxi, Peoples R China
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2024年 / 27卷 / 02期
关键词
Heart disease classification; Machine learning; Feature selection; Ensemble methods; Intelligent system; FAILURE; SYSTEM;
D O I
10.1007/s10586-023-04062-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Heart disease is a complex disease, and many people around the world suffer from this disease. Due to the lack of a healthy lifestyle, it is the most common cause of death worldwide. Machine learning plays an important role in medical treatment. The goal of this research is to develop a machine learning model to help diagnose heart disease quickly and accurately. In this article, an effective and improved machine learning method is proposed to diagnose heart disease. We designed a novel and robust ensemble model that combines the top three classifiers, namely Random Forest, XGBoost and Gradient Boosting Machine, to effectively diagnose heart disease. We used an ensemble voting method to combine the results of the top three classifiers to improve the prediction of heart disease. We used a combined heart disease dataset containing five different datasets (Hungary, Statlog, Switzerland, VA Long Beach and Cleveland). Feature selection algorithms (Pearson Correlation, Univariate Feature Selection, Recursive Feature Elimination, Boruta Feature Selection, Random forest, and LightGBM) are used to select highly relevant features based on rankings to improve classification accuracy. The proposed ensemble model is designed using seven highly relevant features, and a comparison of machine learning algorithms and ensemble learning techniques is applied to the selected features. Different performance evaluation methods are used to evaluate the proposed model: accuracy, sensitivity, precision, F1-score, MCC, NPV and AUC. Results analysis shows that the ensemble model achieves excellent classification accuracy, sensitivity, and precision of 96.17%, 98.37%, and 94.53%. Our proposed model performs better than existing models and individual classifiers. The results show that the proposed ensemble method can effectively predict the risk of heart disease.
引用
收藏
页码:1927 / 1946
页数:20
相关论文
共 50 条
  • [31] Feature Selection by mRMR Method for Heart Disease Diagnosis
    Wang, Gaoshuai
    Lauri, Fabrice
    El Hassani, Amir Hajjam
    IEEE ACCESS, 2022, 10 : 100786 - 100796
  • [32] Predicting and Detecting Coronary Heart Disease in Patients Using Machine Learning Method
    Wos, Michal
    Drop, Bartlomiej
    Kiczek, Bartlomiej
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2023, PT I, 2023, 13919 : 367 - 377
  • [33] Feature Selection or Predicting Heart Disease Using Black Hole Optimization Algorithm and XGBoost Classifier
    Rajadevi, R.
    Devi, E. M. Roopa
    Shanthakumari, R.
    Latha, R. S.
    Anitha, N.
    Devipriya, R.
    2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,
  • [34] Efficient prediction of coronary artery disease using machine learning algorithms with feature selection techniques
    Hassan, Md. Mehedi
    Zaman, Sadika
    Rahman, Md. Mushfiqur
    Bairagi, Anupam Kumar
    El-Shafai, Walid
    Rathore, Rajkumar Singh
    Gupta, Deepak
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 115
  • [35] Improving performance with hybrid feature selection and ensemble machine learning techniques for code smell detection
    Jain, Shivani
    Saha, Anju
    SCIENCE OF COMPUTER PROGRAMMING, 2021, 212
  • [36] Improving Classification Performance for Malware Detection Using Genetic Programming Feature Selection Techniques
    Harahsheh, Heba
    Alshraideh, Mohammad
    Al-Sharaeh, Saleh
    Al-Sayyed, Rizik
    JOURNAL OF APPLIED SECURITY RESEARCH, 2023, 18 (03) : 627 - 647
  • [37] Improving heart disease prediction using multi-tier ensemble model
    Hera, Sabeena Yasmin
    Amjad, Mohammad
    Saba, Md Khalaf
    NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2022, 11 (01):
  • [38] High Accuracy COVID-19 Prediction Using Optimized Union Ensemble Feature Selection Approach
    Jafar, Abbas
    Lee, Myungho
    IEEE ACCESS, 2024, 12 : 122942 - 122958
  • [39] Improving spam email classification accuracy using ensemble techniques: a stacking approach
    Adnan, Muhammad
    Imam, Muhammad Osama
    Javed, Muhammad Furqan
    Murtza, Iqbal
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2024, 23 (01) : 505 - 517
  • [40] EDL-NSGA-II: Ensemble deep learning framework with NSGA-II feature selection for heart disease prediction
    Gupta, Aditya
    Singh, Amritpal
    EXPERT SYSTEMS, 2023, 40 (07)