Improving the accuracy of diagnosing and predicting coronary heart disease using ensemble method and feature selection techniques

被引:5
|
作者
Asif, Sohaib [1 ,2 ,3 ]
Wenhui, Yi [1 ,2 ]
ul Ain, Qurrat [4 ]
Yueyang, Yi [5 ]
Jinhai, Si [1 ,2 ]
机构
[1] Xi An Jiao Tong Univ, Key Lab Informat Photon Technol Shaanxi Prov, Xian 710049, Shaanxi, Peoples R China
[2] Xi An Jiao Tong Univ, Fac Elect & Informat Engn, Sch Elect Sci & Engn, Key Lab Phys Elect,Minist Educ, Xian 710049, Shaanxi, Peoples R China
[3] Cent South Univ, Sch Comp Sci & Engn, Changsha, Peoples R China
[4] Cent South Univ, Sch Publ Hlth, Changsha, Peoples R China
[5] Xi An Jiao Tong Univ, Hlth Sci Ctr, Xian, Shaanxi, Peoples R China
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2024年 / 27卷 / 02期
关键词
Heart disease classification; Machine learning; Feature selection; Ensemble methods; Intelligent system; FAILURE; SYSTEM;
D O I
10.1007/s10586-023-04062-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Heart disease is a complex disease, and many people around the world suffer from this disease. Due to the lack of a healthy lifestyle, it is the most common cause of death worldwide. Machine learning plays an important role in medical treatment. The goal of this research is to develop a machine learning model to help diagnose heart disease quickly and accurately. In this article, an effective and improved machine learning method is proposed to diagnose heart disease. We designed a novel and robust ensemble model that combines the top three classifiers, namely Random Forest, XGBoost and Gradient Boosting Machine, to effectively diagnose heart disease. We used an ensemble voting method to combine the results of the top three classifiers to improve the prediction of heart disease. We used a combined heart disease dataset containing five different datasets (Hungary, Statlog, Switzerland, VA Long Beach and Cleveland). Feature selection algorithms (Pearson Correlation, Univariate Feature Selection, Recursive Feature Elimination, Boruta Feature Selection, Random forest, and LightGBM) are used to select highly relevant features based on rankings to improve classification accuracy. The proposed ensemble model is designed using seven highly relevant features, and a comparison of machine learning algorithms and ensemble learning techniques is applied to the selected features. Different performance evaluation methods are used to evaluate the proposed model: accuracy, sensitivity, precision, F1-score, MCC, NPV and AUC. Results analysis shows that the ensemble model achieves excellent classification accuracy, sensitivity, and precision of 96.17%, 98.37%, and 94.53%. Our proposed model performs better than existing models and individual classifiers. The results show that the proposed ensemble method can effectively predict the risk of heart disease.
引用
收藏
页码:1927 / 1946
页数:20
相关论文
共 50 条
  • [1] Improving the accuracy of diagnosing and predicting coronary heart disease using ensemble method and feature selection techniques
    Sohaib Asif
    Yi Wenhui
    Qurrat ul Ain
    Yi Yueyang
    Si Jinhai
    Cluster Computing, 2024, 27 : 1927 - 1946
  • [2] Coronary heart disease classification using deep learning approach with feature selection for improved accuracy
    Muniasamy, Anandhavalli
    Begum, Arshiya
    Sabahath, Asfia
    Yaqub, Humara
    Karunakaran, Gauthaman
    TECHNOLOGY AND HEALTH CARE, 2024, 32 (03) : 1991 - 2007
  • [3] Diagnosing Coronary Heart Disease using Ensemble Machine Learning
    Miao, Kathleen H.
    Miao, Julia H.
    Miao, George J.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (10) : 30 - 39
  • [4] Exploring Important Factors in Predicting Heart Disease Based on Ensemble-Extra Feature Selection Approach
    Abubaker, Howida
    Muchtar, Farkhana
    Khairuddin, Alif Ridzuan
    Nuar, Ahmad Najmi Amerhaider
    Yunos, Zuriahati Mohd
    Salimun, Carolyn
    BAGHDAD SCIENCE JOURNAL, 2024, 21 (02) : 812 - 831
  • [5] Improving the Accuracy of Predicting Disulfide Connectivity by Feature Selection
    Zhu, Lin
    Yang, Jie
    Song, Jiang-Ning
    Chou, Kuo-Chen
    Shen, Hong-Bin
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2010, 31 (07) : 1478 - 1485
  • [6] Enhanced Evolutionary Feature Selection and Ensemble Method for Cardiovascular Disease Prediction
    Jothi Prakash, V.
    Karthikeyan, N. K.
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2021, 13 (03) : 389 - 412
  • [7] Ensemble Feature Selection for Heart Disease Classification
    Benhar, Houda
    Idri, Ali
    Hosni, Mohamed
    HEALTHINF: PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES - VOL 5: HEALTHINF, 2021, : 369 - 376
  • [8] Efficient prediction of evaporation using ensemble feature selection techniques
    Sharma, Rakhee
    Singh, Archana
    Mittal, Mamta
    MAUSAM, 2023, 74 (04): : 951 - 962
  • [9] The Role of Data Pre-processing Techniques in Improving Machine Learning Accuracy for Predicting Coronary Heart Disease
    Sami, Osamah
    Elsheikh, Yousef
    Almasalha, Fadi
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (06) : 812 - 820
  • [10] Effective Feature Selection Using Ensemble Techniques and Genetic Algorithm
    Ghorpade-Aher, Jayshree
    Sonkamble, Balwant
    PROCEEDINGS OF SIXTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICICT 2021), VOL 2, 2022, 236 : 367 - 375