Exploring Important Factors in Predicting Heart Disease Based on Ensemble-Extra Feature Selection Approach

被引:2
作者
Abubaker, Howida [1 ]
Muchtar, Farkhana [1 ]
Khairuddin, Alif Ridzuan [1 ]
Nuar, Ahmad Najmi Amerhaider [1 ]
Yunos, Zuriahati Mohd [1 ]
Salimun, Carolyn [2 ]
机构
[1] Univ Teknol Malaysia, Fac Comp, Johor Baharu 81310, Johor, Malaysia
[2] Univ Malaysia Sabah, Fac Comp & Informat, Jalan UMS, Kota Kinabalu 88400, Sabah, Malaysia
关键词
Extra Tree; Feature selection; Feature subsets; Heart Disease Dataset; Machine learning; SYSTEM;
D O I
10.21123/bsj.2024.9711
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Heart disease is a significant and impactful health condition that ranks as the leading cause of death in many countries. In order to aid physicians in diagnosing cardiovascular diseases, clinical datasets are available for reference. However, with the rise of big data and medical datasets, it has become increasingly challenging for medical practitioners to accurately predict heart disease due to the abundance of unrelated and redundant features that hinder computational complexity and accuracy. As such, this study aims to identify the most discriminative features within high-dimensional datasets while minimizing complexity and improving accuracy through an Extra Tree feature selection based technique. The work study assesses the efficacy of several classification algorithms on four reputable datasets, using both the full features set and the reduced features subset selected through the proposed method. The results show that the feature selection technique achieves outstanding classification accuracy, precision, and recall, with an impressive 97% accuracy when used with the Extra Tree classifier algorithm. The research reveals the promising potential of the feature selection method for improving classifier accuracy by focusing on the most informative features and simultaneously decreasing computational burden.
引用
收藏
页码:812 / 831
页数:20
相关论文
共 48 条
[1]   Feature Subset Selection for Malware Detection in Smart IoT Platforms [J].
Abawajy, Jemal ;
Darem, Abdulbasit ;
Alhashmi, Asma A. .
SENSORS, 2021, 21 (04) :1-19
[2]  
Abdollahi J., 2022, Iran. J. Comput. Sci., V5, P229, DOI DOI 10.1007/S42044-022-00104-X
[3]   Mixed Machine Learning Approach for Efficient Prediction of Human Heart Disease by Identifying the Numerical and Categorical Features [J].
Ahmad, Ghulab Nabi ;
Shafiullah ;
Fatima, Hira ;
Abbas, Mohamed ;
Rahman, Obaidur ;
Imdadullah ;
Alqahtani, Mohammed S. .
APPLIED SCIENCES-BASEL, 2022, 12 (15)
[4]   Stable bagging feature selection on medical data [J].
Alelyani, Salem .
JOURNAL OF BIG DATA, 2021, 8 (01)
[5]   Predicting Breast Cancer from Risk Factors Using SVM and Extra-Trees-Based Feature Selection Method [J].
Alfian, Ganjar ;
Syafrudin, Muhammad ;
Fahrurrozi, Imam ;
Fitriyani, Norma Latif ;
Atmaji, Fransiskus Tatas Dwi ;
Widodo, Tri ;
Bahiyah, Nurul ;
Benes, Filip ;
Rhee, Jongtae .
COMPUTERS, 2022, 11 (09)
[6]   A smart healthcare monitoring system for heart disease prediction based on ensemble deep learning and feature fusion [J].
Ali, Farman ;
El-Sappagh, Shaker ;
Islam, S. M. Riazul ;
Kwak, Daehan ;
Ali, Amjad ;
Imran, Muhammad ;
Kwak, Kyung-Sup .
INFORMATION FUSION, 2020, 63 :208-222
[7]   Coronary Artery Heart Disease Prediction: A Comparative Study of Computational Intelligence Techniques [J].
Ayon, Safial Islam ;
Islam, Md. Milon ;
Hossain, Md. Rahat .
IETE JOURNAL OF RESEARCH, 2022, 68 (04) :2488-2507
[8]   A systematic review on machine learning approaches for cardiovascular disease prediction using medical big data [J].
Azmi, Javed ;
Arif, Muhammad ;
Nafis, Md Tabrez ;
Alam, M. Afshar ;
Tanweer, Safdar ;
Wang, Guojun .
MEDICAL ENGINEERING & PHYSICS, 2022, 105
[9]  
Bashir S, 2019, INT BHURBAN C APPL S, P619, DOI 10.1109/IBCAST.2019.8667106
[10]   Prediction of Heart Disease Using a Combination of Machine Learning and Deep Learning [J].
Bharti, Rohit ;
Khamparia, Aditya ;
Shabaz, Mohammad ;
Dhiman, Gaurav ;
Pande, Sagar ;
Singh, Parneet .
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021