An Effective Heart Disease Detection and Severity Level Classification Model Using Machine Learning and Hyperparameter Optimization Methods

被引:31
作者
Abdellatif, Abdallah [1 ]
Abdellatef, Hamdan [2 ]
Kanesan, Jeevan [1 ]
Chow, Chee-Onn [1 ]
Chuah, Joon Huang [1 ]
Gheni, Hassan Muwafaq [3 ]
机构
[1] Univ Malaya, Fac Engn, Dept Elect Engn, Kuala Lumpur 50603, Malaysia
[2] Lebanese Amer Univ, Elect & Comp Engn Dept, Sch Engn, Byblos, Lebanon
[3] Al Mustaqbal Univ Coll, Comp Tech Engn Dept, Hillah 51001, Iraq
关键词
Heart; Predictive models; Support vector machines; Classification tree analysis; Feature extraction; Radio frequency; Prediction algorithms; CVD detection; severity classification; hyperparameter optimization; extra trees; imbalance; hyperband; PERFORMANCE EVALUATION; PREDICTION; ALGORITHMS; SMOTE; SELECTION;
D O I
10.1109/ACCESS.2022.3191669
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cardiovascular disease (CVD) is the leading cause of death worldwide. A Machine Learning (ML) system can predict CVD in the early stages to mitigate mortality rates based on clinical data. Recently, many research works utilized different machine learning approaches to detect CVD or identify the patient's severity level. Although these works obtained promising results, none focused on employing optimization methods to improve the ML model performance for CVD detection and severity-level classification. This study provides an effective method based on the Synthetic Minority Oversampling Technique (SMOTE) to handle imbalance distribution issue, six different ML classifiers to detect the patient status, and Hyperparameter Optimization (HPO) to find the best hyperparameter for ML classifier together with SMOTE. Two public datasets were used to build and test the model using all features. The results show that SMOTE and Extra Trees (ET) optimized using hyperband achieved higher results than other models and outperformed the state-of-the-art works by achieving 99.2% and 98.52% in CVD detection, respectively. Also, the developed model converged to 95.73% severity classification using the Cleveland dataset. The proposed model can help doctors determine a patient's current heart disease status. As a result, it is possible to prevent heart disease-related mortality by implementing early therapy.
引用
收藏
页码:79974 / 79985
页数:12
相关论文
共 42 条
[1]   An Optimized Stacked Support Vector Machines Based Expert System for the Effective Prediction of Heart Failure [J].
Ali, Liaqat ;
Niamat, Awais ;
Khan, Javed Ali ;
Golilarz, Noorbakhsh Amiri ;
Xiong Xingzhong ;
Noor, Adeeb ;
Nour, Redhwan ;
Bukhari, Syed Ahmad Chan .
IEEE ACCESS, 2019, 7 :54007-54014
[2]   Identification of significant features and data mining techniques in predicting heart disease [J].
Amin, Mohammad Shafenoor ;
Chiam, Yin Kia ;
Varathan, Kasturi Dewi .
TELEMATICS AND INFORMATICS, 2019, 36 :82-93
[3]  
[Anonymous], 2016, PROC 22 ACM SIGKDD I, DOI DOI 10.1145/2939672.2939785
[4]   Amended fused TOPSIS-VIKOR for classification (ATOVIC) applied to some UCI data sets [J].
Baccour, Leila .
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 99 :115-125
[5]   Machine learning versus conventional clinical methods in guiding management of heart failure patients-a systematic review [J].
Bazoukis, George ;
Stavrakis, Stavros ;
Zhou, Jiandong ;
Bollepalli, Sandeep Chandra ;
Tse, Gary ;
Zhang, Qingpeng ;
Singh, Jagmeet P. ;
Armoundas, Antonis A. .
HEART FAILURE REVIEWS, 2021, 26 (01) :23-34
[6]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[7]   A 2G-RFID-BASED E-HEALTHCARE SYSTEM [J].
Chen, Min ;
Gonzalez, Sergio ;
Leung, Victor ;
Zhang, Qian ;
Li, Ming .
IEEE WIRELESS COMMUNICATIONS, 2010, 17 (01) :37-43
[8]  
Claesen M, 2015, Arxiv, DOI [arXiv:1502.02127, DOI 10.48550/ARXIV.1502.02127, 10.48550/ARXIV.1502.02127]
[9]  
Detrano R., **DATA OBJECT**
[10]  
Dua C. G. D., MACHINE LEARNING REP