Influence of Optimal Hyperparameters on the Performance of Machine Learning Algorithms for Predicting Heart Disease

被引:13
作者
Ahamad, Ghulab Nabi [1 ]
Shafiullah [2 ]
Fatima, Hira [1 ]
Imdadullah [3 ]
Zakariya, S. M. [3 ]
Abbas, Mohamed [4 ]
Alqahtani, Mohammed S. [5 ,6 ]
Usman, Mohammed [4 ]
机构
[1] Mangalayatan Univ, Inst Appl Sci, Aligarh 202145, Uttar Pradesh, India
[2] BRA Bihar Univ, KCTC Coll, Dept Math, Muzaffarpur 842001, India
[3] Aligarh Muslim Univ, Univ Polytech, Elect Engn Sect, Aligarh 202002, Uttar Pradesh, India
[4] King Khalid Univ, Coll Engn, Elect Engn Dept, Abha 61421, Saudi Arabia
[5] King Khalid Univ, Coll Appl Med Sci, Radiol Sci Dept, Abha 61421, Saudi Arabia
[6] Univ Leicester, Space Res Ctr, BioImaging Unit, Michael Atiyah Bldg, Leicester LE1 7RH, Leics, England
关键词
heart disease prediction; UCI Kaggle dataset; machine learning algorithms; GridSearchCV; hyperparameters; FEATURE-SELECTION; DIAGNOSIS;
D O I
10.3390/pr11030734
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
One of the most difficult challenges in medicine is predicting heart disease at an early stage. In this study, six machine learning (ML) algorithms, viz., logistic regression, K-nearest neighbor, support vector machine, decision tree, random forest classifier, and extreme gradient boosting, were used to analyze two heart disease datasets. One dataset was UCI Kaggle Cleveland and the other was the comprehensive UCI Kaggle Cleveland, Hungary, Switzerland, and Long Beach V. The performance results of the machine learning techniques were obtained. The support vector machine with tuned hyperparameters achieved the highest testing accuracy of 87.91% for dataset-I and the extreme gradient boosting classifier with tuned hyperparameters achieved the highest testing accuracy of 99.03% for the comprehensive dataset-II. The novelty of this work was the use of grid search cross-validation to enhance the performance in the form of training and testing. The ideal parameters for predicting heart disease were identified through experimental results. Comparative studies were also carried out with the existing studies focusing on the prediction of heart disease, where the approach used in this work significantly outperformed their results.
引用
收藏
页数:28
相关论文
共 43 条
  • [31] Intelligent heart disease prediction system using data mining techniques
    Palaniappan, Sellappan
    Awang, Raflah
    [J]. 2008 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1-3, 2008, : 108 - 115
  • [32] How to optimize the adherence to a guideline-directed medical therapy in the secondary prevention of cardiovascular diseases: a clinical consensus statement from the European Association of Preventive Cardiology
    Pedretti, Roberto F. E.
    Hansen, Dominique
    Ambrosetti, Marco
    Back, Maria
    Berger, Thomas
    Ferreira, Mariana Cordeiro
    Cornelissen, Veronique
    Davos, Constantinos H.
    Doehner, Wolfram
    Zarzosa, Carmen de Pablo Y.
    Frederix, Ines
    Greco, Andrea
    Kurpas, Donata
    Michal, Matthias
    Osto, Elena
    Pedersen, Susanne
    Salvador, Rita Esmeralda
    Simonenko, Maria
    Steca, Patrizia
    Thompson, David R.
    Wilhelm, Matthias
    Abreu, Ana
    [J]. EUROPEAN JOURNAL OF PREVENTIVE CARDIOLOGY, 2023, 30 (02) : 149 - 166
  • [33] A hybrid approach to medical decision support systems:: Combining feature selection, fuzzy weighted pre-processing and AIRS
    Polat, Kemal
    Gunes, Salih
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2007, 88 (02) : 164 - 174
  • [34] Hybrid genetic algorithm and a fuzzy logic classifier for heart disease diagnosis
    Reddy, G. Thippa
    Reddy, M. Praveen Kumar
    Lakshmanna, Kuruva
    Rajput, Dharmendra Singh
    Kaluri, Rajesh
    Srivastava, Gautam
    [J]. EVOLUTIONARY INTELLIGENCE, 2020, 13 (02) : 185 - 196
  • [35] An Efficient Prediction System for Coronary Heart Disease Risk Using Selected Principal Components and Hyperparameter Optimization
    Reddy, Karna Vishnu Vardhana
    Elamvazuthi, Irraivan
    Abd Aziz, Azrina
    Paramasivam, Sivajothi
    Chua, Hui Na
    Pranavanand, Satyamurthy
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [36] Heart Disease Prediction Using Decision Tree and SVM
    Saraswathi, R. Vijaya
    Gajavelly, Kovid
    Nikath, A. Kousar
    Vasavi, R.
    Anumasula, Rakshith Reddy
    [J]. PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER ENGINEERING AND COMMUNICATION SYSTEMS, ICACECS 2021, 2022, : 69 - 78
  • [37] Score and Correlation Coefficient-Based Feature Selection for Predicting Heart Failure Diagnosis by Using Machine Learning Algorithms
    Senan, Ebrahim Mohammed
    Abunadi, Ibrahim
    Jadhav, Mukti E.
    Fati, Suliman Mohamed
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2021, 2021
  • [38] Singh R, 2019, INT J COMPUT SCI ENG, V7, P861
  • [39] UCI Machine Learning Repository, HEART DIS DAT
  • [40] Medical Knowledge Acquisition through Data Mining
    Wang, Hai
    Wang, Shouhong
    [J]. 2008 IEEE INTERNATIONAL SYMPOSIUM ON IT IN MEDICINE AND EDUCATION, VOLS 1 AND 2, PROCEEDINGS, 2008, : 777 - +