An Effective Heart Disease Detection and Severity Level Classification Model Using Machine Learning and Hyperparameter Optimization Methods

被引:33
作者
Abdellatif, Abdallah [1 ]
Abdellatef, Hamdan [2 ]
Kanesan, Jeevan [1 ]
Chow, Chee-Onn [1 ]
Chuah, Joon Huang [1 ]
Gheni, Hassan Muwafaq [3 ]
机构
[1] Univ Malaya, Fac Engn, Dept Elect Engn, Kuala Lumpur 50603, Malaysia
[2] Lebanese Amer Univ, Elect & Comp Engn Dept, Sch Engn, Byblos, Lebanon
[3] Al Mustaqbal Univ Coll, Comp Tech Engn Dept, Hillah 51001, Iraq
关键词
Heart; Predictive models; Support vector machines; Classification tree analysis; Feature extraction; Radio frequency; Prediction algorithms; CVD detection; severity classification; hyperparameter optimization; extra trees; imbalance; hyperband; PERFORMANCE EVALUATION; PREDICTION; ALGORITHMS; SMOTE; SELECTION;
D O I
10.1109/ACCESS.2022.3191669
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cardiovascular disease (CVD) is the leading cause of death worldwide. A Machine Learning (ML) system can predict CVD in the early stages to mitigate mortality rates based on clinical data. Recently, many research works utilized different machine learning approaches to detect CVD or identify the patient's severity level. Although these works obtained promising results, none focused on employing optimization methods to improve the ML model performance for CVD detection and severity-level classification. This study provides an effective method based on the Synthetic Minority Oversampling Technique (SMOTE) to handle imbalance distribution issue, six different ML classifiers to detect the patient status, and Hyperparameter Optimization (HPO) to find the best hyperparameter for ML classifier together with SMOTE. Two public datasets were used to build and test the model using all features. The results show that SMOTE and Extra Trees (ET) optimized using hyperband achieved higher results than other models and outperformed the state-of-the-art works by achieving 99.2% and 98.52% in CVD detection, respectively. Also, the developed model converged to 95.73% severity classification using the Cleveland dataset. The proposed model can help doctors determine a patient's current heart disease status. As a result, it is possible to prevent heart disease-related mortality by implementing early therapy.
引用
收藏
页码:79974 / 79985
页数:12
相关论文
共 42 条
[12]   SMOTE for Learning from Imbalanced Data: Progress and Challenges, Marking the 15-year Anniversary [J].
Fernandez, Alberto ;
Garcia, Salvador ;
Herrera, Francisco ;
Chawla, Nitesh V. .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2018, 61 :863-905
[13]   HDPM: An Effective Heart Disease Prediction Model for a Clinical Decision Support System [J].
Fitriyani, Norma Latif ;
Syafrudin, Muhammad ;
Alfian, Ganjar ;
Rhee, Jongtae .
IEEE ACCESS, 2020, 8 :133034-133050
[15]   Extremely randomized trees [J].
Geurts, P ;
Ernst, D ;
Wehenkel, L .
MACHINE LEARNING, 2006, 63 (01) :3-42
[16]   MIFH: A Machine Intelligence Framework for Heart Disease Diagnosis [J].
Gupta, Ankur ;
Kumar, Rahul ;
Arora, Harkirat Singh ;
Raman, Balasubramanian .
IEEE ACCESS, 2020, 8 :14659-14674
[17]   SVM and SVM Ensembles in Breast Cancer Prediction [J].
Huang, Min-Wei ;
Chen, Chih-Wen ;
Lin, Wei-Chao ;
Ke, Shih-Wen ;
Tsai, Chih-Fong .
PLOS ONE, 2017, 12 (01)
[18]   Improving the Prediction of Heart Failure Patients' Survival Using SMOTE and Effective Data Mining Techniques [J].
Ishaq, Abid ;
Sadiq, Saima ;
Umer, Muhammad ;
Ullah, Saleem ;
Mirjalili, Seyedali ;
Rupapara, Vaibhav ;
Nappi, Michele .
IEEE ACCESS, 2021, 9 :39707-39716
[19]   Efficient Heart Disease Prediction System using K-Nearest Neighbor Classification Technique [J].
Khateeb, Nida ;
Usman, Muhammad .
INTERNATIONAL CONFERENCE ON BIG DATA AND INTERNET OF THINGS (BDIOT 2017), 2017, :21-26
[20]   The severity prediction of the binary and multi-class cardiovascular disease-A machine learning-based fusion approach [J].
Kibria, Hafsa Binte ;
Matin, Abdul .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2022, 98