Using Machine Learning for Detection and Prediction of Chronic Diseases

被引:0
|
作者
Yanes, Nacim [1 ,2 ]
Jamel, Leila [3 ]
Alabdullah, Bayan [3 ]
Ezz, Mohamed [4 ]
Mohamed Mostafa, Ayman [4 ]
Shabana, Hossameldeen [5 ]
机构
[1] Manouba Univ, RIADI Lab, Manouba 2010, Tunisia
[2] Gabes Univ, Higher Inst Management Gabes, Gabes 6033, Tunisia
[3] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Informat Syst, POB 84428, Riyadh 11671, Saudi Arabia
[4] Jouf Univ, Coll Comp & Informat Sci, Sakaka 72388, Saudi Arabia
[5] Shaqra Univ, Coll Med, Shaqra 11961, Saudi Arabia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Predictive models; Accuracy; Cardiac arrest; Diseases; Heart; Medical services; Data models; Prediction algorithms; Classification algorithms; Tuning; Heart attack prediction; ensemble model; chronic diseases; class imbalance; ML classifiers; model transparency;
D O I
10.1109/ACCESS.2024.3494839
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Heart attacks are a leading cause of mortality worldwide, necessitating the development of accurate predictive models to enhance early detection and intervention strategies. This study addresses the significant problem of class imbalance in medical datasets, specifically focusing on heart attack prediction using the Behavioral Risk Factor Surveillance System (BRFSS) dataset. To tackle this challenge, advanced machine learning (ML) methods are proposed to involve a refined dataset of 399,875 instances, with 47 significant features maintained through rigorous data cleaning and preparation. Balanced accuracy and macro-recall were chosen as primary metrics to ensure fair performance evaluation across classes in the imbalanced dataset. Our proposed system entails a detailed evaluation of various algorithms known for their effectiveness in managing class imbalance. The LGBM Classifier, XGB Classifier, and Logistic Regression (LR) are optimized using recursive feature elimination and hyperparameter tuning with Optuna. The results of this study are encapsulated in an ensemble model that significantly enhances predictive accuracy. The final model achieved 80.75% balanced accuracy and 79.97% recall for critical heart attack cases (class 1), along with an AUC score of 88.9%, indicating superior class distinction capability. Additionally, the application of SHAP (SHapley Additive exPlanations) analysis provided valuable insights into the contribution of each feature to heart attack likelihood, thus improving model transparency. This study's successful integration of complex ML techniques with interpretability analyses like SHAP marks a substantial advance in early detection and intervention strategies in healthcare. It demonstrates the potential of sophisticated ML approaches for early heart attack detection and prevention, highlighting their value in improving outcomes for patients with chronic diseases. These findings suggest promising pathways for employing advanced analytical tools in healthcare to enhance patient care.
引用
收藏
页码:177674 / 177691
页数:18
相关论文
共 50 条
  • [31] Refined Software Defect Prediction Using Enhanced JAYA Optimization and Extreme Learning Machine
    Pradhan, Debasish
    Muduli, Debendra
    Zamani, Abu Taha
    Yaqoob, Syed Irfan
    Alanazi, Sultan M.
    Kumar, Rakesh Ranjan
    Parveen, Nikhat
    Shameem, Mohammad
    IEEE ACCESS, 2024, 12 : 141559 - 141579
  • [32] Crop Classification and Yield Prediction Using Robust Machine Learning Models for Agricultural Sustainability
    Badshah, Abid
    Alkazemi, Basem Yousef
    Din, Fakhrud
    Zamli, Kamal Z.
    Haris, Muhammad
    IEEE ACCESS, 2024, 12 : 162799 - 162813
  • [33] Enhancing Chronic Disease Prediction in IoMT-Enabled Healthcare 5.0 Using Deep Machine Learning: Alzheimer's Disease as a Case Study
    Javed, Rabia
    Abbas, Tahir
    Shahzad, Tariq
    Kanwal, Khadija
    Ramay, Sadaqat Ali
    Khan, Muhammad Adnan
    Ouahada, Khmaies
    IEEE ACCESS, 2025, 13 : 14252 - 14272
  • [34] A Cloud-Based Optimized Ensemble Model for Risk Prediction of Diabetic Progression-An Azure Machine Learning Perspective
    Daliya, V. K.
    Ramesh, T. K.
    IEEE ACCESS, 2025, 13 : 11560 - 11575
  • [35] Machine Learning Algorithms for COPD Patients Readmission Prediction: A Data Analytics Approach
    Mohamed, Israa
    Fouda, Mostafa M.
    Hosny, Khalid M.
    IEEE ACCESS, 2022, 10 : 15279 - 15287
  • [36] Temperature Prediction for Electric Vehicles Using Machine Learning Algorithms
    Kishore, Shradha
    Bharti, Sonam Kumari
    Anand, Priyadarshi
    Srivastav, Dishant
    Sonali, Shubham
    IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2024, 60 (06) : 9251 - 9259
  • [37] Academic Performance Prediction Using Machine Learning Approaches: A Survey
    Pan, Jialun
    Zhao, Zhanzhan
    Han, Dongkun
    IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, 2025, 18 : 351 - 368
  • [38] Comparative Analysis of Machine Learning Methods for Prediction of Heart Diseases
    Stepanyan, I. V.
    Alimbayev, Ch. A.
    Savkin, M. O.
    Lyu, D.
    Zidun, M.
    JOURNAL OF MACHINERY MANUFACTURE AND RELIABILITY, 2022, 51 (08) : 789 - 799
  • [39] Comparative Analysis of Machine Learning Methods for Prediction of Heart Diseases
    I. V. Stepanyan
    Ch. A. Alimbayev
    M. O. Savkin
    D. Lyu
    M. Zidun
    Journal of Machinery Manufacture and Reliability, 2022, 51 : 789 - 799
  • [40] An Empirical Analysis of Machine Learning Algorithms for Crime Prediction Using Stacked Generalization: An Ensemble Approach
    Kshatri, Sapna Singh
    Singh, Deepak
    Narain, Bhavana
    Bhatia, Surbhi
    Quasim, Mohammad Tabrez
    Sinha, G. R.
    IEEE ACCESS, 2021, 9 : 67488 - 67500