Ensemble framework for cardiovascular disease prediction

被引:55
作者
Tiwari, Achyut [1 ]
Chugh, Aryan [1 ]
Sharma, Aman [1 ]
机构
[1] Jaypee Univ Informat Technol, Dept Comp Sci & Engn, Waknaghat 173234, Himachal Prades, India
关键词
Machine learning; Algorithm; Stacked ensemble method; Cardiovascular disease (CVD); Heart disease dataset (Comprehensive); Risk; MACHINE;
D O I
10.1016/j.compbiomed.2022.105624
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Heart disease is the major cause of non-communicable and silent death worldwide. Heart diseases or cardiovascular diseases are classified into four types: coronary heart disease, heart failure, congenital heart disease, and cardiomyopathy. It is vital to diagnose heart disease early and accurately in order to avoid further injury and save patients' lives. As a result, we need a system that can predict cardiovascular disease before it becomes a critical situation. Machine learning has piqued the interest of researchers in the field of medical sciences. For heart disease prediction, researchers implement a variety of machine learning methods and approaches. In this work, to the best of our knowledge, we have used the dataset from IEEE Data Port which is one of the online available largest datasets for cardiovascular diseases individuals. The dataset isa combination of Hungarian, Cleveland, Long Beach VA, Switzerland & Statlog datasets with important features such as Maximum Heart Rate Achieved, Serum Cholesterol, Chest Pain Type, Fasting blood sugar, and so on. To assess the efficacy and strength of the developed model, several performance measures are used, such as ROC, AUC curve, specificity, F1-score, sensitivity, MCC, and accuracy. In this study, we have proposed a framework with a stacked ensemble classifier using several machine learning algorithms including ExtraTrees Classifier, Random Forest, XGBoost, and so on. Our proposed framework attained an accuracy of 92.34% which is higher than the existing literature.
引用
收藏
页数:13
相关论文
共 31 条
[1]   A comparative study of K-Nearest Neighbour, Support Vector Machine and Multi-Layer Perceptron for Thalassemia screening [J].
Amendolia, SR ;
Cossu, G ;
Ganadu, ML ;
Golosio, B ;
Masala, GL ;
Mura, GM .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2003, 69 (1-2) :13-20
[2]  
[Anonymous], PEOPLE RES INITIATIV
[3]  
[Anonymous], SUPPORT VECTOR MACHI
[4]  
Atallah R, 2019, 2019 2ND INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), P335
[5]   Predictive and Descriptive Analysis for Heart Disease Diagnosis [J].
Babic, Frantisek ;
Olejar, Jaroslav ;
Vantova, Zuzana ;
Paralic, Jan .
PROCEEDINGS OF THE 2017 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2017, :155-163
[6]   MV5: A Clinical Decision Support Framework for Heart Disease Prediction Using Majority Vote Based Classifier Ensemble [J].
Bashir, Saba ;
Qamar, Usman ;
Khan, Farhan Hassan ;
Javed, M. Younus .
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2014, 39 (11) :7771-7783
[7]  
Bengio Yoshua., 2003, CIRANO WORKING PAPER
[8]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[9]   Classification of drugs in absorption classes using the classification and regression trees (CART) methodology [J].
Deconinck, E ;
Hancock, T ;
Coomans, D ;
Massart, DL ;
Vander Heyden, Y .
JOURNAL OF PHARMACEUTICAL AND BIOMEDICAL ANALYSIS, 2005, 39 (1-2) :91-103
[10]   An ensemble model for Heart disease data sets: a generalized model [J].
El Bialy, Randa ;
Salama, Mostafa A. ;
Karam, Omar .
INTERNATIONAL CONFERENCE ON INFORMATICS AND SYSTEMS (INFOS 2016), 2016, :191-196