Influence of Optimal Hyperparameters on the Performance of Machine Learning Algorithms for Predicting Heart Disease

被引:20
作者
Ahamad, Ghulab Nabi [1 ]
Shafiullah [2 ]
Fatima, Hira [1 ]
Imdadullah [3 ]
Zakariya, S. M. [3 ]
Abbas, Mohamed [4 ]
Alqahtani, Mohammed S. [5 ,6 ]
Usman, Mohammed [4 ]
机构
[1] Mangalayatan Univ, Inst Appl Sci, Aligarh 202145, Uttar Pradesh, India
[2] BRA Bihar Univ, KCTC Coll, Dept Math, Muzaffarpur 842001, India
[3] Aligarh Muslim Univ, Univ Polytech, Elect Engn Sect, Aligarh 202002, Uttar Pradesh, India
[4] King Khalid Univ, Coll Engn, Elect Engn Dept, Abha 61421, Saudi Arabia
[5] King Khalid Univ, Coll Appl Med Sci, Radiol Sci Dept, Abha 61421, Saudi Arabia
[6] Univ Leicester, Space Res Ctr, BioImaging Unit, Michael Atiyah Bldg, Leicester LE1 7RH, Leics, England
关键词
heart disease prediction; UCI Kaggle dataset; machine learning algorithms; GridSearchCV; hyperparameters; FEATURE-SELECTION; DIAGNOSIS;
D O I
10.3390/pr11030734
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
One of the most difficult challenges in medicine is predicting heart disease at an early stage. In this study, six machine learning (ML) algorithms, viz., logistic regression, K-nearest neighbor, support vector machine, decision tree, random forest classifier, and extreme gradient boosting, were used to analyze two heart disease datasets. One dataset was UCI Kaggle Cleveland and the other was the comprehensive UCI Kaggle Cleveland, Hungary, Switzerland, and Long Beach V. The performance results of the machine learning techniques were obtained. The support vector machine with tuned hyperparameters achieved the highest testing accuracy of 87.91% for dataset-I and the extreme gradient boosting classifier with tuned hyperparameters achieved the highest testing accuracy of 99.03% for the comprehensive dataset-II. The novelty of this work was the use of grid search cross-validation to enhance the performance in the form of training and testing. The ideal parameters for predicting heart disease were identified through experimental results. Comparative studies were also carried out with the existing studies focusing on the prediction of heart disease, where the approach used in this work significantly outperformed their results.
引用
收藏
页数:28
相关论文
共 43 条
[31]   Intelligent heart disease prediction system using data mining techniques [J].
Palaniappan, Sellappan ;
Awang, Raflah .
2008 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1-3, 2008, :108-115
[32]   How to optimize the adherence to a guideline-directed medical therapy in the secondary prevention of cardiovascular diseases: a clinical consensus statement from the European Association of Preventive Cardiology [J].
Pedretti, Roberto F. E. ;
Hansen, Dominique ;
Ambrosetti, Marco ;
Back, Maria ;
Berger, Thomas ;
Ferreira, Mariana Cordeiro ;
Cornelissen, Veronique ;
Davos, Constantinos H. ;
Doehner, Wolfram ;
Zarzosa, Carmen de Pablo Y. ;
Frederix, Ines ;
Greco, Andrea ;
Kurpas, Donata ;
Michal, Matthias ;
Osto, Elena ;
Pedersen, Susanne ;
Salvador, Rita Esmeralda ;
Simonenko, Maria ;
Steca, Patrizia ;
Thompson, David R. ;
Wilhelm, Matthias ;
Abreu, Ana .
EUROPEAN JOURNAL OF PREVENTIVE CARDIOLOGY, 2023, 30 (02) :149-166
[33]   A hybrid approach to medical decision support systems:: Combining feature selection, fuzzy weighted pre-processing and AIRS [J].
Polat, Kemal ;
Gunes, Salih .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2007, 88 (02) :164-174
[34]   Hybrid genetic algorithm and a fuzzy logic classifier for heart disease diagnosis [J].
Reddy, G. Thippa ;
Reddy, M. Praveen Kumar ;
Lakshmanna, Kuruva ;
Rajput, Dharmendra Singh ;
Kaluri, Rajesh ;
Srivastava, Gautam .
EVOLUTIONARY INTELLIGENCE, 2020, 13 (02) :185-196
[35]   An Efficient Prediction System for Coronary Heart Disease Risk Using Selected Principal Components and Hyperparameter Optimization [J].
Reddy, Karna Vishnu Vardhana ;
Elamvazuthi, Irraivan ;
Abd Aziz, Azrina ;
Paramasivam, Sivajothi ;
Chua, Hui Na ;
Pranavanand, Satyamurthy .
APPLIED SCIENCES-BASEL, 2023, 13 (01)
[36]   Heart Disease Prediction Using Decision Tree and SVM [J].
Saraswathi, R. Vijaya ;
Gajavelly, Kovid ;
Nikath, A. Kousar ;
Vasavi, R. ;
Anumasula, Rakshith Reddy .
PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER ENGINEERING AND COMMUNICATION SYSTEMS, ICACECS 2021, 2022, :69-78
[37]   Score and Correlation Coefficient-Based Feature Selection for Predicting Heart Failure Diagnosis by Using Machine Learning Algorithms [J].
Senan, Ebrahim Mohammed ;
Abunadi, Ibrahim ;
Jadhav, Mukti E. ;
Fati, Suliman Mohamed .
COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2021, 2021
[38]  
Singh R, 2019, INT J COMPUT SCI ENG, V7, P861
[39]  
UCI Machine Learning Repository, HEART DIS DAT
[40]   Medical Knowledge Acquisition through Data Mining [J].
Wang, Hai ;
Wang, Shouhong .
2008 IEEE INTERNATIONAL SYMPOSIUM ON IT IN MEDICINE AND EDUCATION, VOLS 1 AND 2, PROCEEDINGS, 2008, :777-+