A proposed technique for predicting heart disease using machine learning algorithms and an explainable AI method

被引:16
作者
El-Sofany, Hosam [1 ]
Bouallegue, Belgacem [1 ,2 ]
Abd El-Latif, Yasser M. [3 ]
机构
[1] King Khalid Univ, Coll Comp Sci, Abha, Saudi Arabia
[2] Univ Monastir, Fac Sci Monastir, Elect & Microelect Lab EEL, Monastir, Tunisia
[3] Ain Shams Univ, Fac Sci, Cairo, Egypt
关键词
Machine learning; Heart diseases; ML algorithms; SMOTE; SHAP;
D O I
10.1038/s41598-024-74656-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
One of the critical issues in medical data analysis is accurately predicting a patient's risk of heart disease, which is vital for early intervention and reducing mortality rates. Early detection allows for timely treatment and continuous monitoring by healthcare providers, which is essential but often limited by the inability of medical professionals to provide constant patient supervision. Early detection of cardiac problems and continuous patient monitoring by physicians can help reduce death rates. Doctors cannot constantly have contact with patients, and heart disease detection is not always accurate. By offering a more solid foundation for prediction and decision-making based on data provided by healthcare sectors worldwide, machine learning (ML) could help physicians with the prediction and detection of HD. This study aims to use different feature selection strategies to produce an accurate ML algorithm for early heart disease prediction. We have chosen features using chi-square, ANOVA, and mutual information methods. The three feature groups chosen were SF-1, SF-2, and SF-3. The study employed ten machine learning algorithms to determine the most accurate technique and feature subset fit. The classification algorithms used include support vector machines (SVM), XGBoost, bagging, decision trees (DT), and random forests (RF). We evaluated the proposed heart disease prediction technique using a private dataset, a public dataset, and different cross-validation methods. We used the Synthetic Minority Oversampling Technique (SMOTE) to eliminate inconsistent data and discover the machine learning algorithm that achieves the most accurate heart disease predictions. Healthcare providers might identify early-stage heart disease quickly and cheaply with the proposed method. We have used the most effective ML algorithm to create a mobile app that instantly predicts heart disease based on the input symptoms. The experimental results demonstrated that the XGBoost algorithm performed optimally when applied to the combined datasets and the SF-2 feature subset. It had 97.57% accuracy, 96.61% sensitivity, 90.48% specificity, 95.00% precision, a 92.68% F1 score, and a 98% AUC. We have developed an explainable AI method based on SHAP approaches to understand how the system makes its final predictions.
引用
收藏
页数:18
相关论文
共 36 条
[1]  
Ahmed Z., 2016, Journal of Healthcare Engineering, V2016
[2]  
Akbar S., 2020, J. Ambient Intell. Humaniz. Comput, V11, P1973
[3]  
Alom Z., 2021, P INT C BIG DAT IOT
[4]  
Chen X., 2007, World Congress Med. Phys. Biomedical Eng, V14, P1668
[5]  
Chintan MB., 2023, Effective Heart Disease Prediction Using Mach. Learn. Techniques Algorithms, V16, P88, DOI [10.3390/a16020088, DOI 10.3390/A16020088]
[6]  
Delavar M. R., 2015, J. Med. Syst, V39, P98, DOI [DOI 10.1007/S10916-015-0333-5, 10.1007/s10916-015-0333-5]
[7]   A Proposed Technique Using Machine Learning for the Prediction of Diabetes Disease through a Mobile App [J].
El-Sofany, Hosam ;
El-Seoud, Samir A. ;
Karam, Omar H. ;
Abd El-Latif, Yasser M. ;
Taj-Eddin, Islam A. T. F. .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2024, 2024
[8]  
Farag A., 2016, P INT C INN TRENDS C, P90, DOI [10.1109/ITCE.2016.7473338, DOI 10.1109/ITCE.2016.7473338]
[9]  
Ghanbarzadeh M., 2019, Artif. Intell. Med, V98, P44
[10]  
Gour Sanjay, 2022, Intelligent Sustainable Systems: Selected Papers of WorldS4 2021. Lecture Notes in Networks and Systems (333), P741, DOI 10.1007/978-981-16-6309-3_70