Performance analysis of supervised classification models on heart disease prediction

被引:8
|
作者
Ogundepo, Ezekiel Adebayo [1 ]
Yahya, Waheed Babatunde [1 ]
机构
[1] Univ Ilorin, Dept Stat, Ilorin, Nigeria
关键词
Classifiers; Model selection; Feature selection; Exploratory data analysis; Evaluation metrics;
D O I
10.1007/s11334-022-00524-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper presents a predictive analysis of data on heart disease patients to determine the possible risk factors associated with their heart disease status. Two independent (but similar) published heart disease datasets, the Cleveland data (used to build classification models) and the Statlog data (used for results' validation), were considered for analysis. A detailed exploratory analysis using the Chi-square test of independence was performed on the Cleveland data after which ten standard classification models were trained for class prediction. The classification models were built by partitioning the Cleveland data randomly into 208 (70%) training samples and 89 (30%) test samples over 200 replications. Preliminary results showed that some of the bio-clinical categorical variables are strongly associated with the heart disease conditions of the patients (p < 0.001). The classification results from the test samples indicated that the support vector machine yielded the best predictive performances with 85% accuracy, 82% sensitivity, 88% specificity, 87% precision, 91% area under the ROC curve, and 38% log loss value. These results were validated on the Statlog data in tenfold cross-validation which were all consistent with those obtained from the Cleveland dataset.
引用
收藏
页码:129 / 144
页数:16
相关论文
共 50 条
  • [1] Performance analysis of supervised classification models on heart disease prediction
    Ezekiel Adebayo Ogundepo
    Waheed Babatunde Yahya
    Innovations in Systems and Software Engineering, 2023, 19 : 129 - 144
  • [2] Efficient Heart Disease Prediction Using Hybrid Deep Learning Classification Models
    Baviskar, Vaishali
    Verma, Madhushi
    Chatterjee, Pradeep
    Singal, Gaurav
    IRBM, 2023, 44 (05)
  • [3] Comparative Analysis of Supervised Models for Diamond Price Prediction
    Sharma, Garima
    Tripathi, Vikas
    Mahajan, Manish
    Srivastava, Awadhesh Kumar
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 1019 - 1022
  • [4] Dimensionality Reduction in Supervised Models-based for Heart Failure Prediction
    Escamilla, Anna Karen Garate
    El Hassani, Amir Hajjam
    Andres, Emmanuel
    ICPRAM: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2019, : 388 - 395
  • [5] A novel method for prediction of skin disease through supervised classification techniques
    Meena, K.
    Veni, N. N. Krishna
    Deepapriya, B. S.
    Vardhini, P. A. Harsha
    Kalyani, B. J. D.
    Sharmila, L.
    SOFT COMPUTING, 2022, 26 (19) : 10527 - 10533
  • [6] A Comprehensive Performance Analysis of Various Classifier Models for Coronary Artery Disease Prediction
    Balakrishnan, Baranidharan
    Kumar, Vinoth C. N. S.
    INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE, 2021, 15 (04)
  • [7] Heart Disease Prediction Using Ensemble Tree Algorithms: A Supervised Learning Perspective
    Sakyi-Yeboah, Enoch
    Agyemang, Edmund Fosu
    Agbenyeavu, Vincent
    Osei-Nkwantabisa, Akua
    Kissi-Appiah, Priscilla
    Moshood, Lateef
    Agbota, Lawrence
    Nortey, Ezekiel N. N.
    APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2025, 2025 (01)
  • [8] Prediction of heart disease and classifiers' sensitivity analysis
    Almustafa, Khaled Mohamad
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [9] Prediction of heart disease and classifiers’ sensitivity analysis
    Khaled Mohamad Almustafa
    BMC Bioinformatics, 21
  • [10] A comprehensive analysis and performance evaluation for osteoporosis prediction models
    Alden, Zahraa Noor Aldeen M. Shams
    Ata, Oguz
    PEERJ COMPUTER SCIENCE, 2024, 10 : 1 - 28