Performance evaluation of optimal ensemble learning approaches with PCA and LDA-based feature extraction for heart disease prediction

被引:3
作者
Rabbi, Md. Sakhawat Hossain [1 ]
Bari, Md. Masbahul [1 ]
Debnath, Tanoy [2 ]
Rahman, Anichur [3 ,4 ]
Das, Avik Kumar [1 ]
Hossain, Md. Parvez [1 ]
Muhammad, Ghulam [5 ]
机构
[1] Green Univ Bangladesh, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY 11794 USA
[3] Univ Dhaka, Natl Inst Text Engn & Res NITER, Dept Comp Sci & Engn, Constituent Inst, Dhaka 1350, Bangladesh
[4] Mawlana Bhashani Sci & Technol Univ, Dept Comp Sci & Engn, Tangail, Bangladesh
[5] King Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Riyadh, Saudi Arabia
关键词
Machine learning; Prediction method; Heart disease detection; Ensemble methods; Feature selection; Data balancing and data analysis; SYSTEM;
D O I
10.1016/j.bspc.2024.107138
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Heart disease is a global health concern with a high mortality rate, necessitating early, accurate, and reliable prediction methods for effective prevention and control. In this research, we combine principal component analysis and linear discriminant analysis to reduce dataset complexity and enhance the performance of heart disease classification models by selecting the most relevant features. We address the class imbalance by employing two balancing techniques: oversampling and the synthetic minority oversampling technique, which ensures amore representative dataset, leading to more accurate predictions. Our study develops a novel ensemble approach, utilizing a combination of random forest, support vector machine, K-nearest neighbors, logistic regression, decision tree, and Gaussian naive Bayes to significantly improve heart disease prediction accuracy. Furthermore, we implement advanced ensemble learning techniques, such as Stacking, Bagging, Voting, and Boosting, to achieve early and precise prediction of heart disease. The performance evaluation is conducted on three datasets: Cleveland Heart Disease, Framingham Heart Disease, and Indicators of Heart Disease Dataset (2020), ensuring a robust validation of our methods. The results demonstrate that the voting ensemble machine learning algorithm (VEMLA) achieved 92% accuracy on the Cleveland Heart Disease dataset, while the bagging ensemble machine learning algorithm (BEMLA) achieved 97% accuracy on both the Framingham Heart Disease and Indicators of Heart Disease (2020) datasets. Notably, the proposed BEMLA consistently outperformed other methods, showcasing its superiority in heart disease prediction. This study contributes a comprehensive and effective approach to heart disease diagnosis, outperforming individual classifiers and providing valuable insights for practical clinical applications.
引用
收藏
页数:21
相关论文
共 66 条
[1]  
Aasheesh, (N.D.) Framingham heart study dataset
[2]   A quantum classification algorithm for classification incomplete patterns based on entanglement measure [J].
Abdel-Aty, Abdel-Haleem ;
Kadry, Heba ;
Zidan, Mohammed ;
Al-Sbou, Yazeed ;
Zanaty, E. A. ;
Abdel-Aty, Mahmoud .
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (03) :2809-2816
[3]   Explainable Heart Disease Prediction Using Ensemble-Quantum Machine Learning Approach [J].
Abdulsalam, Ghada ;
Meshoul, Souham ;
Shaiba, Hadil .
INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (01) :761-779
[4]  
Al Maruf A., 2023, INT C MACH LEARN IOT, P481
[5]   Comprehensive evaluation and performance analysis of machine learning in heart disease prediction [J].
Al-Alshaikh, Halah A. ;
Prabu, P. ;
Poonia, Ramesh Chandra ;
Saudagar, Abdul Khader Jilani ;
Yadav, Manoj ;
AlSagri, Hatoon S. ;
AlSanad, Abeer A. .
SCIENTIFIC REPORTS, 2024, 14 (01)
[6]   Heart disease prediction using supervised machine learning algorithms: Performance analysis and comparison [J].
Ali, Md Mamun ;
Paul, Bikash Kumar ;
Ahmed, Kawsar ;
Bui, Francis M. ;
Quinn, Julian M. W. ;
Moni, Mohammad Ali .
COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 136
[7]   Hyperparameter Search for Machine Learning Algorithms for Optimizing the Computational Complexity [J].
Ali, Yasser A. ;
Awwad, Emad Mahrous ;
Al-Razgan, Muna ;
Maarouf, Ali .
PROCESSES, 2023, 11 (02)
[8]   Prediction of User's Intention to Use Metaverse System in Medical Education: A Hybrid SEM-ML Learning Approach [J].
Almarzouqi, Amina ;
Aburayya, Ahmad ;
Salloum, Said A. .
IEEE ACCESS, 2022, 10 :43421-43434
[9]  
Alotaibi N, 2022, INT J ADV COMPUT SC, V13, P810
[10]   Cardiovascular Disease Detection using Ensemble Learning [J].
Alqahtani, Abdullah ;
Alsubai, Shtwai ;
Sha, Mohemmed ;
Vilcekova, Lucia ;
Javed, Talha .
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022