Score and Correlation Coefficient-Based Feature Selection for Predicting Heart Failure Diagnosis by Using Machine Learning Algorithms

被引:56
作者
Senan, Ebrahim Mohammed [1 ]
Abunadi, Ibrahim [2 ]
Jadhav, Mukti E. [3 ]
Fati, Suliman Mohamed [2 ]
机构
[1] Dr Babasaheb Ambedkar Marathwada Univ, Dept Comp Sci & Informat Technol, Aurangabad, Maharashtra, India
[2] Prince Sultan Univ, Informat Syst Dept, Riyadh, Saudi Arabia
[3] Shri Shivaji Sci & Arts Coll, Buldana, India
关键词
CLASSIFICATION; IDENTIFICATION; NETWORK;
D O I
10.1155/2021/8500314
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cardiovascular disease (CVD) is one of the most common causes of death that kills approximately 17 million people annually. The main reasons behind CVD are myocardial infarction and the failure of the heart to pump blood normally. Doctors could diagnose heart failure (HF) through electronic medical records on the basis of patient's symptoms and clinical laboratory investigations. However, accurate diagnosis of HF requires medical resources and expert practitioners that are not always available, thus making the diagnosing challengeable. Therefore, predicting the patients' condition by using machine learning algorithms is a necessity to save time and efforts. This paper proposed a machine-learning-based approach that distinguishes the most important correlated features amongst patients' electronic clinical records. The SelectKBest function was applied with chi-squared statistical method to determine the most important features, and then feature engineering method has been applied to create new features correlated strongly in order to train machine learning models and obtain promising results. Optimised hyperparameter classification algorithms SVM, KNN, Decision Tree, Random Forest, and Logistic Regression were used to train two different datasets. The first dataset, called Cleveland, consisted of 303 records. The second dataset, which was used for predicting HF, consisted of 299 records. Experimental results showed that the Random Forest algorithm achieved accuracy, precision, recall, and F1 scores of 95%, 97.62%, 95.35%, and 96.47%, respectively, during the test phase for the second dataset. The same algorithm achieved accuracy scores of 100% for the first dataset and 97.68% for the second dataset, while 100% precision, recall, and F1 scores were reached for both datasets.
引用
收藏
页数:16
相关论文
共 39 条
[1]   Improving risk prediction in heart failure using machine learning [J].
Adler, Eric D. ;
Voors, Adriaan A. ;
Klein, Liviu ;
Macheret, Fima ;
Braun, Oscar O. ;
Urey, Marcus A. ;
Zhu, Wenhong ;
Sama, Iziah ;
Tadel, Matevz ;
Campagnari, Claudio ;
Greenberg, Barry ;
Yagil, Avi .
EUROPEAN JOURNAL OF HEART FAILURE, 2020, 22 (01) :139-147
[2]   Modelling the Psychological Impact of COVID-19 in Saudi Arabia Using Machine Learning [J].
Aleid, Mohammed A. ;
Alyamani, Khaled A. Z. ;
Rahmouni, Mohieddine ;
Aldhyani, Theyazn H. H. ;
Alsharif, Nizar ;
Alzahrani, Mohammed Y. .
CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 67 (02) :2029-2047
[3]  
American Heart Association, 2017, What is Cardiovascular Disease?
[4]   Identification of significant features and data mining techniques in predicting heart disease [J].
Amin, Mohammad Shafenoor ;
Chiam, Yin Kia ;
Varathan, Kasturi Dewi .
TELEMATICS AND INFORMATICS, 2019, 36 :82-93
[5]  
[Anonymous], 2016, WHO | Cardiovascular diseases (CVDs)
[6]   Computer aided decision making for heart disease detection using hybrid neural network-Genetic algorithm [J].
Arabasadi, Zeinab ;
Alizadehsani, Roohallah ;
Roshanzamir, Mohamad ;
Moosaei, Hossein ;
Yarifard, Ali Asghar .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2017, 141 :19-26
[7]   Effects of principle component analysis on assessment of coronary artery diseases using support vector machine [J].
Babaoglu, Ismail ;
Findik, Oguz ;
Bayrak, Mehmet .
EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (03) :2182-2185
[8]  
Babu S, 2017, 2017 INTERNATIONAL CONFERENCE OF ELECTRONICS, COMMUNICATION AND AEROSPACE TECHNOLOGY (ICECA), VOL 1, P750, DOI 10.1109/ICECA.2017.8203643
[9]   Physician Prediction versus Model Predicted Prognosis in Ambulatory Patients with Heart Failure [J].
Buchan, T. A. ;
Ross, H. J. ;
McDonald, M. ;
Billia, F. ;
Delgado, D. ;
Posada, J. G. Duero ;
Luk, A. ;
Guyatt, G. H. ;
Alba, A. C. .
JOURNAL OF HEART AND LUNG TRANSPLANTATION, 2019, 38 (04) :S381-S381
[10]   Clinical profiles in acute heart failure: an urgent need for a new approach [J].
Chapman, Brittany ;
DeVore, Adam D. ;
Mentz, Robert J. ;
Metra, Marco .
ESC HEART FAILURE, 2019, 6 (03) :464-474