HDPM: An Effective Heart Disease Prediction Model for a Clinical Decision Support System

被引:112
作者
Fitriyani, Norma Latif [1 ]
Syafrudin, Muhammad [1 ]
Alfian, Ganjar [2 ]
Rhee, Jongtae [1 ]
机构
[1] Dongguk Univ, Dept Ind & Syst Engn, Seoul 04620, South Korea
[2] Dongguk Univ, Nano Informat Technol Acad, Ind AI Res Ctr, Seoul 04626, South Korea
关键词
Heart; Diseases; Predictive models; Support vector machines; Data models; Radio frequency; Machine learning; Heart disease; disease prediction model; clinical decision support system; outlier data; imbalanced data; machine learning; CARDIOVASCULAR RISK; ROC CURVE; FEATURES; SELECTION; AREA;
D O I
10.1109/ACCESS.2020.3010511
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Heart disease, one of the major causes of mortality worldwide, can be mitigated by early heart disease diagnosis. A clinical decision support system (CDSS) can be used to diagnose the subjects' heart disease status earlier. This study proposes an effective heart disease prediction model (HDPM) for a CDSS which consists of Density-Based Spatial Clustering of Applications with Noise (DBSCAN) to detect and eliminate the outliers, a hybrid Synthetic Minority Over-sampling Technique-Edited Nearest Neighbor (SMOTE-ENN) to balance the training data distribution and XGBoost to predict heart disease. Two publicly available datasets (Statlog and Cleveland) were used to build the model and compare the results with those of other models (naive bayes (NB), logistic regression (LR), multilayer perceptron (MLP), support vector machine (SVM), decision tree (DT), and random forest (RF)) and of previous study results. The results revealed that the proposed model outperformed other models and previous study results by achieving accuracies of 95.90% and 98.40% for Statlog and Cleveland datasets, respectively. In addition, we designed and developed the prototype of the Heart Disease CDSS (HDCDSS) to help doctors/clinicians diagnose the patients'/subjects' heart disease status based on their current condition. Therefore, early treatment could be conducted to prevent the deaths caused by late heart disease diagnosis.
引用
收藏
页码:133034 / 133050
页数:17
相关论文
共 49 条
  • [1] A Personalized Healthcare Monitoring System for Diabetic Patients by Utilizing BLE-Based Sensors and Real-Time Data Processing
    Alfian, Ganjar
    Syafrudin, Muhammad
    Ijaz, Muhammad Fazal
    Syaekhoni, M. Alex
    Fitriyani, Norma Latif
    Rhee, Jongtae
    [J]. SENSORS, 2018, 18 (07)
  • [2] Real-Time Monitoring System Using Smartphone-Based Sensors and NoSQL Database for Perishable Supply Chain
    Alfian, Ganjar
    Syafrudin, Muhammad
    Rhee, Jongtae
    [J]. SUSTAINABILITY, 2017, 9 (11)
  • [3] An Optimized Stacked Support Vector Machines Based Expert System for the Effective Prediction of Heart Failure
    Ali, Liaqat
    Niamat, Awais
    Khan, Javed Ali
    Golilarz, Noorbakhsh Amiri
    Xiong Xingzhong
    Noor, Adeeb
    Nour, Redhwan
    Bukhari, Syed Ahmad Chan
    [J]. IEEE ACCESS, 2019, 7 : 54007 - 54014
  • [4] Identification of significant features and data mining techniques in predicting heart disease
    Amin, Mohammad Shafenoor
    Chiam, Yin Kia
    Varathan, Kasturi Dewi
    [J]. TELEMATICS AND INFORMATICS, 2019, 36 : 82 - 93
  • [5] [Anonymous], CARD DIS CVDS
  • [6] Batista G.E., 2004, ACM SIGKDD Explor. Newsl, V6, P20, DOI [DOI 10.1145/1007730.1007735, 10.1145/1007730.1007735]
  • [7] Benjamin EJ, 2019, CIRCULATION, V139, pE56, DOI [10.1161/CIR.0000000000000659, 10.1161/CIR.0000000000000746]
  • [8] Selection of relevant features and examples in machine learning
    Blum, AL
    Langley, P
    [J]. ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) : 245 - 271
  • [9] Patient education and provider decision support to control blood pressure in primary care: A cluster randomized trial
    Bosworth, Hayden B.
    Olsen, Maren K.
    Dudley, Tara
    Orr, Melinda
    Goldstein, Mary K.
    Datta, Santanu K.
    McCant, Felicia
    Gentry, Pam
    Simel, David L.
    Oddone, Eugene Z.
    [J]. AMERICAN HEART JOURNAL, 2009, 157 (03) : 450 - 456
  • [10] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794