Analysis and Enhancement of Prediction of Cardiovascular Disease Diagnosis using Machine Learning Models SVM, SGD, and XGBoost

被引:0
作者
Tomar, Sandeep [1 ]
Dembla, Deepak [1 ]
Chaba, Yogesh [2 ]
机构
[1] JECRC Univ, Dept Comp Sci & Engn, Jaipur, Rajasthan, India
[2] GJU Sci & Technol, Dept Comp Sci & Engn, Hisar, Haryana, India
关键词
CVD; SVM; SGD; XGBoost; classifiers; machine learning; ROC; accuracy; confusion matrix;
D O I
10.14569/IJACSA.2024.0150449
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Cardiovascular disease (CVD), claiming 17.9 million lives annually, is exacerbated by factors like high blood pressure and obesity, prompting extensive data collection for deeper insights. Machine learning aids in accurate diagnosis, with techniques like SVM, SGD, and XGBoost proposed for heart disease prediction, addressing challenges such as data imbalance and optimizing diagnostic accuracy. This study integrates these algorithms to improve cardiovascular disease diagnosis, aiming to reduce mortality rates through timely interventions. This research investigates the efficacy of Support Vector Machine (SVM), Stochastic Gradient Descent (SGD), and XGBoost machine learning techniques for heart disease prediction. Analysis of the models' performance metrics reveals distinct characteristics and capabilities. SVM demonstrates robust performance with a training accuracy of 88.28% and a model accuracy score of 87.5%, exhibiting high precision and recall values across both classes. SGD, while commendable with a training accuracy of 83.65% and a model accuracy score of 84.24%, falls slightly behind SVM in accuracy and precision. XGBoost Classifier showcases perfect training accuracy but potential overfitting, yet demonstrates comparable precision and recall values to SVM. Overall, SVM emerges as the most effective model for heart disease prediction, followed by SGD and XGBoost Classifier. Further optimization and investigation into generalization capabilities are recommended to enhance the performance of SGD and XGBoost Classifier in clinical settings.
引用
收藏
页码:469 / 479
页数:11
相关论文
共 42 条
[1]  
Altan G., 2019, Resour. Conserv. Recycl., V146, P25
[2]   Identification of significant features and data mining techniques in predicting heart disease [J].
Amin, Mohammad Shafenoor ;
Chiam, Yin Kia ;
Varathan, Kasturi Dewi .
TELEMATICS AND INFORMATICS, 2019, 36 :82-93
[3]   Enhancing the Early Detection of Chronic Kidney Disease: A Robust Machine Learning Model [J].
Arif, Muhammad Shoaib ;
Mukheimer, Aiman ;
Asif, Daniyal .
BIG DATA AND COGNITIVE COMPUTING, 2023, 7 (03)
[4]  
Bashir S, 2019, INT BHURBAN C APPL S, P619, DOI 10.1109/IBCAST.2019.8667106
[5]  
Benjamin EJ, 2019, CIRCULATION, V139, pE56, DOI [10.1161/CIR.0000000000000659, 10.1161/CIR.0000000000000746]
[6]  
Chang V., 2022, Healthc. Anal, V2, P100016, DOI DOI 10.1016/J.HEALTH.2022.100016
[7]  
Diwakar M., 2019, Int. J. Eng. Adv. Technol., V8, P506
[8]  
Ekiz S, 2017, 2017 ELECTRIC ELECTRONICS, COMPUTER SCIENCE, BIOMEDICAL ENGINEERINGS' MEETING (EBBT)
[9]  
Fathima N., 2020, P 2020 INT C EM SMAR, P66
[10]  
Gavhane Aditi, 2018, 2018 Second International Conference on Electronics, Communication and Aerospace Technology (ICECA), P1275, DOI 10.1109/ICECA.2018.8474922