Hybrid Feature Selection Algorithm and Ensemble Stacking for Heart Disease Prediction

被引:0
作者
Zaini, Nureen Afiqah Mohd [1 ]
Awang, Mohd Khalid [1 ]
机构
[1] Univ Sultan Zainal Abidin, Fac Informat & Comp, Tembila 22000, Terengganu, Malaysia
关键词
Heart disease prediction; feature selection; stacking; accuracy; CLASSIFICATION;
D O I
10.14569/IJACSA.2023.0140220
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In cardiology, as in other medical specialties, early and accurate diagnosis of heart disease is crucial as it has been the leading cause of death over the past few decades. Early prediction of heart disease is now more crucial than ever. However, the state-of-the-art heart disease prediction strategy put more emphasis on classifier selection in enhancing the accuracy and performance of heart disease prediction, and seldom considers feature reduction techniques. Furthermore, there are several factors that lead to heart disease, and it is critical to identify the most significant characteristics in order to achieve the best prediction accuracy and increase prediction performance. Feature reduction reduces the dimensionality of the information, which may allow learning algorithms to work quicker and more efficiently, producing predictive models with the best rate of accuracy. In this study, we explored and suggested a hybrid of two distinct feature reduction techniques, chi-squared and analysis of variance (ANOVA). In addition, using the ensemble stacking method, classification is performed on selected features to classify the data. Using the optimal features based on hybrid features combination, the performance of a stacking ensemble based on logistic regression yields the best result with 93.44%. This can be summarized as the feature selection method can take into account as an effective method for the prediction of heart disease.
引用
收藏
页码:158 / 165
页数:8
相关论文
共 36 条
[1]   Association between a dietary pattern high in saturated fatty acids, dietary energy density, and sodium with coronary heart disease [J].
Abu Bakar, Nur Ain Fatinah ;
Ahmad, Aryati ;
Musa, Wan Zulaika Wan ;
Shahril, Mohd Razif ;
Wan-Arfah, Nadiah ;
Majid, Hazreen Abdul ;
Piernas, Carmen ;
Ramli, Ahmad Wazi ;
Naing, Nyi Nyi .
SCIENTIFIC REPORTS, 2022, 12 (01)
[2]  
Alfaidi A, 2022, INT J ADV COMPUT SC, V13, P135
[3]  
Alotaibi N, 2022, INT J ADV COMPUT SC, V13, P810
[4]   Significance of Visible Non-Invasive Risk Attributes for the Initial Prediction of Heart Disease Using Different Machine Learning Techniques [J].
Ansarullah, Syed Immamul ;
Saif, Syed Mohsin ;
Kumar, Pradeep ;
Kirmani, Mudasir Manzoor .
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
[5]   Feature selection and transformation by machine learning reduce variable numbers and improve prediction for heart failure readmission or death [J].
Awan, Saqib E. ;
Bennamoun, Mohammed ;
Sohel, Ferdous ;
Sanfilippo, Frank M. ;
Chow, Benjamin J. ;
Dwivedi, Girish .
PLOS ONE, 2019, 14 (06)
[6]  
Bashir S, 2019, INT BHURBAN C APPL S, P619, DOI 10.1109/IBCAST.2019.8667106
[7]   Comparative Study on Heart Disease Prediction Using Feature Selection Techniques on Classification Algorithms [J].
Dissanayake, Kaushalya ;
Johar, Md Gapar Md .
APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2021, 2021
[8]  
Diwan Shivangi, 2022, Journal of Physics: Conference Series, V2273, DOI [10.1088/1742-6596/2273/1/012027, 10.1088/1742-6596/2273/1/012027]
[9]   A Systematic Literature Review on Multi-Label Classification based on Machine Learning Algorithms [J].
Endut, Nurshahira ;
Hamzah, W. M. Amir Fazamin W. ;
Ismail, Ismahafezi ;
Yusof, Mohd Kamir ;
Abu Baker, Yousef ;
Yusoff, Hafiz .
TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS, 2022, 11 (02) :658-666
[10]  
FIENBERG SE, 1979, J ROY STAT SOC B MET, V41, P54