Survival Analysis and Machine Learning Models for Predicting Heart Failure Outcomes

被引:0
作者
ALQahtani, Naseem Mohammed [1 ]
Algarni, Abdulmohsen [2 ]
机构
[1] King Khalid Univ, Coll Comp Sci, Dept Informat & Comp Syst, Abha 61421, Saudi Arabia
[2] King Khalid Univ, Dept Comp Sci, Abha 61421, Saudi Arabia
关键词
Heart failure prediction; machine learning; cox proportional hazards model; random forest;
D O I
10.14569/IJACSA.2025.0160536
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
failure is still one of the prominent causes of morbidity and mortality globally, and thus, determining the principal factors influencing survival in patients becomes crucial. Being able to predict survival is critical for optimizing patient treatment and management. Heart failure, with its multifactorial and involvement of numerous clinical variables, complicates prediction of survival rates in patients. This study utilizes the "Heart Failure Clinical Records" dataset to analyze and predict patient survival based on two separate approaches: survival analysis and machine learning (ML) classification. Specifically, we employ the Cox Proportional Hazards Model to assess the influence of clinical variables like "age", "serum creatinine", and "ejection fraction" on survival durations. Additionally, machine learning classification models like K-Nearest Neighbors (KNN), Decision Trees (DT), and Random Forests (RF) are implemented to predict the binary response variable of survival (DEATH_EVENT). Data preprocessing is carried out using methods like feature scaling, imputation of missing values, and balancing the classes for the improvement of model performance. Among the evaluated models, the Random Forest classifier, when integrated with feature selection derived from the Cox model, reached the best performance with 96.2% accuracy and an AUC ROC of 0.987, outperforming all other approaches. The results indicate that integrating survival analysis with machine-learning techniques is effective in heart failure prediction outcomes, providing valuable support for patient management and clinical decision-making.
引用
收藏
页码:365 / 375
页数:11
相关论文
共 21 条
[1]   Survival analysis of heart failure patients: A case study [J].
Ahmad, Tanvir ;
Munir, Assia ;
Bhatti, Sajjad Haider ;
Aftab, Muhammad ;
Raza, Muhammad Ali .
PLOS ONE, 2017, 12 (07)
[2]  
ASUNCION A., 2007, UCI MACHINE LEARNING
[3]   Data preprocessing for heart disease classification: A systematic literature review [J].
Benhar, H. ;
Idri, A. ;
Fernandez-Aleman, J. L. .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 195
[4]   Improve Extremely Fast Decision Tree Performance through Training Dataset Size for Early Prediction of Heart Diseases [J].
Benllarch, Mariam ;
El Hadaj, Salah ;
Benhaddi, Meriem .
2019 4TH INTERNATIONAL CONFERENCE ON SYSTEMS OF COLLABORATION BIG DATA, INTERNET OF THINGS & SECURITY (SYSCOBIOTS 2019), 2019, :39-43
[5]   Joint use of over- and under-sampling techniques and cross-validation for the development and assessment of prediction models [J].
Blagus, Rok ;
Lusa, Lara .
BMC BIOINFORMATICS, 2015, 16
[6]  
Cunningham P., 2020, arXiv
[7]  
Curth A., 2024, ARXIV
[8]   Improving the Prediction of Heart Failure Patients' Survival Using SMOTE and Effective Data Mining Techniques [J].
Ishaq, Abid ;
Sadiq, Saima ;
Umer, Muhammad ;
Ullah, Saleem ;
Mirjalili, Seyedali ;
Rupapara, Vaibhav ;
Nappi, Michele .
IEEE ACCESS, 2021, 9 :39707-39716
[9]   Artificial Intelligence and Machine Learning in Cardiovascular Health Care [J].
Kilic, Arman .
ANNALS OF THORACIC SURGERY, 2020, 109 (05) :1323-1329
[10]   Machine Learning-Enhanced Survival Analysis: Identifying Significant Predictors of Mortality in Heart Failure [J].
Lee, Heejeong Jasmine ;
Yoo, Sang-Sun ;
Lee, Kang-Yoon .
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2024, 18 (09) :2495-2511