Hyperparameter optimization: a comparative machine learning model analysis for enhanced heart disease prediction accuracy

被引:0
作者
Yagyanath Rimal
Navneet Sharma
机构
[1] IIS Deemed to be University,
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Bayesian optimization; Genetic optimization; GAsearchCV optimization; Optuna optimization; Gaussian; Random forest; Support vector machine; Principal component analysis;
D O I
暂无
中图分类号
学科分类号
摘要
An optimizer is the process of hyperparameter tuning that updates the machine learning model after each step of weight loss adjustment of input features. The permutation and combination of high and low learning rates with various step sizes ultimately leads to an optimal tuning model. The step size and learning rate sometimes take much smaller steps, allowing the derivatives of tangent to gradually reach global minima. The primary goal of this study is to compare the prediction accuracy of enhanced heart disease using various optimization algorithms. Heart disease treatment requires ensemble hyperparameter tuning for accurate prediction and classification due to multiple feature dependencies. The study analyzed model tuning techniques using the AUC and confusion matrix, revealing improvements in precision, recall, and f1 score from default to optimized models. The Hyper-opt in Bayesian optimizer and T-pot classifiers were used in genetic populations and offspring with 5 and 10 generations, while using Optuna optimization frozen trails was combined with a random forest algorithm. The default random forest (86.6%), Bayesian optimization with random forest (89%), and Bayesian optimization with support vector machines (90%) scored the highest accuracy among all. The generic algorithm with five generations (86.8%) and GAsearchCV with 10 generations (88.5%) scored the second highest accuracy, while Optuna's support vector machine model (84%) scored the least accuracy, respectively. This research further compares the machine learning accuracy, precision, recall, F1 score, macro average, and confusion matrix of each optimized model with their model's actual performance execution time. The predictive accuracy from exploratory data analysis and data pre-processing was further tested after the pipeline design of one-hot encoding and standard scaling of enhanced (31-featured) data sets and heart disease data (13 features). The gaussian algorithm (84%), logistic regression (83%), and classification models predict with higher accuracy than dummy classifiers (54%), when compared with standalone default machine learning models.
引用
收藏
页码:55091 / 55107
页数:16
相关论文
共 50 条
  • [21] Hyperparameter tuning of supervised bagging ensemble machine learning model using Bayesian optimization for estimating stormwater quality
    Moeini, Mohammadreza
    SUSTAINABLE WATER RESOURCES MANAGEMENT, 2024, 10 (02)
  • [22] Heart Disease Prediction using Machine Learning Techniques
    Shah D.
    Patel S.
    Bharti S.K.
    SN Computer Science, 2020, 1 (6)
  • [23] Enhanced PSA Density Prediction Accuracy When Based on Machine Learning
    Stojadinovic, Miroslav
    Milicevic, Bogdan
    Jankovic, Slobodan
    JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2023, 43 (03) : 249 - 257
  • [24] Enhanced PSA Density Prediction Accuracy When Based on Machine Learning
    Miroslav Stojadinovic
    Bogdan Milicevic
    Slobodan Jankovic
    Journal of Medical and Biological Engineering, 2023, 43 : 249 - 257
  • [25] An Outcome Based Analysis on Heart Disease Prediction using Machine Learning Algorithms and Data Mining Approaches
    Deb, Aushtmi
    Koli, Mst Sadia Akter
    Akter, Sheikh Beauty
    Chowdhury, Adil Ahmed
    2022 IEEE WORLD AI IOT CONGRESS (AIIOT), 2022, : 418 - 424
  • [26] Exploratory Data Analysis of Heart Disease Prediction using Machine Learning Techniques-RS Algorithm
    Vibha, M. B.
    Sneha, S. R.
    Kiran, U.
    Kiran, Y.
    2024 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT CYBER PHYSICAL SYSTEMS AND INTERNET OF THINGS, ICOICI 2024, 2024, : 209 - 216
  • [27] Multi-Objective Hyperparameter Optimization in Machine Learning—An Overview
    Karl F.
    Pielok T.
    Moosbauer J.
    Pfisterer F.
    Coors S.
    Binder M.
    Schneider L.
    Thomas J.
    Richter J.
    Lang M.
    Garrido-Merchán E.C.
    Branke J.
    Bischl B.
    ACM. Trans. Evol. Learn. Optim., 2023, 4
  • [28] Jucazinho Dam Streamflow Prediction: A Comparative Analysis of Machine Learning Techniques
    da Silva, Erickson Johny Galindo
    Coutinho, Artur Paiva
    Cardoso, Jean Firmino
    Bezerra, Saulo de Tarso Marques
    HYDROLOGY, 2024, 11 (07)
  • [29] Early prediction model for coronary heart disease using genetic algorithms, hyper-parameter optimization and machine learning techniques
    Priya R. L
    S. Vinila Jinny
    Yash Vijay Mate
    Health and Technology, 2021, 11 : 63 - 73
  • [30] Early prediction model for coronary heart disease using genetic algorithms, hyper-parameter optimization and machine learning techniques
    Priya, R. L.
    Jinny, S. Vinila
    Mate, Yash Vijay
    HEALTH AND TECHNOLOGY, 2021, 11 (01) : 63 - 73