Comparative Study on Heart Disease Prediction Using Feature Selection Techniques on Classification Algorithms

被引:44
作者
Dissanayake, Kaushalya [1 ,2 ]
Johar, Md Gapar Md [3 ]
机构
[1] Management & Sci Univ, Sch Grad Studies, Shah Alam, Malaysia
[2] Pioneer Inst Business & Technol, Sch Comp, Colombo 07, Sri Lanka
[3] Management & Sci Univ, Informat Technol Innovat Ctr, Shah Alam, Malaysia
关键词
FILTER;
D O I
10.1155/2021/5581806
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Heart disease is recognized as one of the leading factors of death rate worldwide. Biomedical instruments and various systems in hospitals have massive quantities of clinical data. Therefore, understanding the data related to heart disease is very important to improve prediction accuracy. This article has conducted an experimental evaluation of the performance of models created using classification algorithms and relevant features selected using various feature selection approaches. For results of the exploratory analysis, ten feature selection techniques, i.e., ANOVA, Chi-square, mutual information, ReliefF, forward feature selection, backward feature selection, exhaustive feature selection, recursive feature elimination, Lasso regression, and Ridge regression, and six classification approaches, i.e., decision tree, random forest, support vector machine, K-nearest neighbor, logistic regression, and Gaussian naive Bayes, have been applied to Cleveland heart disease dataset. The feature subset selected by the backward feature selection technique has achieved the highest classification accuracy of 88.52%, precision of 91.30%, sensitivity of 80.76%, and f-measure of 85.71% with the decision tree classifier.
引用
收藏
页数:17
相关论文
共 22 条
[1]  
Aggrawal Ritu., 2020, SN Computer Science, V1, P344, DOI [10.1007/s42979-020-00370-1, DOI 10.1007/S42979-020-00370-1]
[2]   Neural network and support vector machine for the prediction of chronic kidney disease: A comparative study [J].
Almansour, Njoud Abdullah ;
Syed, Hajra Fahim ;
Khayat, Nuha Radwan ;
Altheeb, Rawan Kanaan ;
Juri, Renad Emad ;
Alhiyafi, Jamal ;
Alrashed, Saleh ;
Olatunji, Sunday O. .
COMPUTERS IN BIOLOGY AND MEDICINE, 2019, 109 :101-111
[3]  
Angayarkanni G, 2020, J XIAN U ARCHITECTUR, P686
[4]   Coronary Artery Heart Disease Prediction: A Comparative Study of Computational Intelligence Techniques [J].
Ayon, Safial Islam ;
Islam, Md. Milon ;
Hossain, Md. Rahat .
IETE JOURNAL OF RESEARCH, 2022, 68 (04) :2488-2507
[5]  
Chin A.J., 2016, IEEE T COMPUTATIONAL
[6]  
Choubey DK, 2020, INT J RECENT TECHNOL, V9, P212
[7]   An efficient convolutional neural network for coronary heart disease prediction [J].
Dutta, Aniruddha ;
Batabyal, Tamal ;
Basu, Meheli ;
Acton, Scott T. .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 159
[8]   A Hybrid CFS Filter and RF-RFE Wrapper-Based Feature Extraction for Enhanced Agricultural Crop Yield Prediction Modeling [J].
Elavarasan, Dhivya ;
Vincent, Durai Raj P. M. ;
Srinivasan, Kathiravan ;
Chang, Chuan-Yu .
AGRICULTURE-BASEL, 2020, 10 (09) :1-27
[9]   Improving the Accuracy for Analyzing Heart Diseases Prediction Based on the Ensemble Method [J].
Gao, Xiao-Yan ;
Amin Ali, Abdelmegeid ;
Shaban Hassan, Hassan ;
Anwar, Eman M. .
COMPLEXITY, 2021, 2021
[10]  
Garate-Escamila Anna Karen, 2020, Informatics in Medicine Unlocked, V19, P193, DOI 10.1016/j.imu.2020.100330