Hyperparameter optimization: a comparative machine learning model analysis for enhanced heart disease prediction accuracy

被引:0
作者
Yagyanath Rimal
Navneet Sharma
机构
[1] IIS Deemed to be University,
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Bayesian optimization; Genetic optimization; GAsearchCV optimization; Optuna optimization; Gaussian; Random forest; Support vector machine; Principal component analysis;
D O I
暂无
中图分类号
学科分类号
摘要
An optimizer is the process of hyperparameter tuning that updates the machine learning model after each step of weight loss adjustment of input features. The permutation and combination of high and low learning rates with various step sizes ultimately leads to an optimal tuning model. The step size and learning rate sometimes take much smaller steps, allowing the derivatives of tangent to gradually reach global minima. The primary goal of this study is to compare the prediction accuracy of enhanced heart disease using various optimization algorithms. Heart disease treatment requires ensemble hyperparameter tuning for accurate prediction and classification due to multiple feature dependencies. The study analyzed model tuning techniques using the AUC and confusion matrix, revealing improvements in precision, recall, and f1 score from default to optimized models. The Hyper-opt in Bayesian optimizer and T-pot classifiers were used in genetic populations and offspring with 5 and 10 generations, while using Optuna optimization frozen trails was combined with a random forest algorithm. The default random forest (86.6%), Bayesian optimization with random forest (89%), and Bayesian optimization with support vector machines (90%) scored the highest accuracy among all. The generic algorithm with five generations (86.8%) and GAsearchCV with 10 generations (88.5%) scored the second highest accuracy, while Optuna's support vector machine model (84%) scored the least accuracy, respectively. This research further compares the machine learning accuracy, precision, recall, F1 score, macro average, and confusion matrix of each optimized model with their model's actual performance execution time. The predictive accuracy from exploratory data analysis and data pre-processing was further tested after the pipeline design of one-hot encoding and standard scaling of enhanced (31-featured) data sets and heart disease data (13 features). The gaussian algorithm (84%), logistic regression (83%), and classification models predict with higher accuracy than dummy classifiers (54%), when compared with standalone default machine learning models.
引用
收藏
页码:55091 / 55107
页数:16
相关论文
共 50 条
[31]   Early prediction model for coronary heart disease using genetic algorithms, hyper-parameter optimization and machine learning techniques [J].
Priya R. L ;
S. Vinila Jinny ;
Yash Vijay Mate .
Health and Technology, 2021, 11 :63-73
[32]   Early prediction model for coronary heart disease using genetic algorithms, hyper-parameter optimization and machine learning techniques [J].
Priya, R. L. ;
Jinny, S. Vinila ;
Mate, Yash Vijay .
HEALTH AND TECHNOLOGY, 2021, 11 (01) :63-73
[33]   Air Quality Forecasting Using Machine Learning: Comparative Analysis and Ensemble Strategies for Enhanced Prediction [J].
Ozupak, Yildirim ;
Alpsalaz, Feyyaz ;
Aslan, Emrah .
WATER AIR AND SOIL POLLUTION, 2025, 236 (07)
[34]   Structure Learning and Hyperparameter Optimization Using an Automated Machine Learning (AutoML) Pipeline [J].
Filippou, Konstantinos ;
Aifantis, George ;
Papakostas, George A. ;
Tsekouras, George E. .
INFORMATION, 2023, 14 (04)
[35]   Heart disease prediction using supervised machine learning algorithms: Performance analysis and comparison [J].
Ali, Md Mamun ;
Paul, Bikash Kumar ;
Ahmed, Kawsar ;
Bui, Francis M. ;
Quinn, Julian M. W. ;
Moni, Mohammad Ali .
COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 136
[36]   Bio-inspired disease prediction: harnessing the power of electric eel foraging optimization algorithm with machine learning for heart disease prediction [J].
Narasimhan, Geetha ;
Victor, Akila .
ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (12)
[37]   Heart Disease Prediction Using Modified Machine Learning Algorithm [J].
Kaur, Bavneet ;
Kaur, Gaganpreet .
INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1, 2023, 473 :189-201
[38]   Heart Failure Disease Prediction Using Machine Learning Models [J].
Tiburcio, Paola ;
Guerrero, Victor ;
Ponce, Hiram .
ADVANCES IN COMPUTATIONAL INTELLIGENCE, MICAI 2022, PT I, 2022, 13612 :183-191
[39]   Prediction of Heart Disease using Decision Tree over Logistic Regression using Machine Learning with Improved Accuracy [J].
Raj, K. N. S. Shanmukha ;
Thinakaran, K. .
CARDIOMETRY, 2022, (25) :1514-1519
[40]   A Comparative Analysis of Machine Learning Classifiers and Ensemble Techniques in Financial Distress Prediction [J].
Sreedharan, Meenu ;
Khedr, Ahmed M. ;
El Bannany, Magdi .
PROCEEDINGS OF THE 2020 17TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD 2020), 2020, :653-657