Efficient Prediction of Cardiovascular Disease Using Machine Learning Algorithms With Relief and LASSO Feature Selection Techniques

被引:146
|
作者
Ghosh, Pronab [1 ]
Azam, Sami [2 ]
Jonkman, Mirjam [2 ]
Karim, Asif [2 ]
Shamrat, F. M. Javed Mehedi [3 ]
Ignatious, Eva [2 ]
Shultana, Shahana [1 ]
Beeravolu, Abhijith Reddy [2 ]
De Boer, Friso [2 ]
机构
[1] Daffodil Int Univ, Dept Comp Sci & Engn, Dhaka 1225, Bangladesh
[2] Charles Darwin Univ, Coll Engn IT & Environm, Casuarina, NT 0810, Australia
[3] Govt Bangladesh, Minist Posts Telecommun & Informat Technol, Informat & Commun Technol Div, Dhaka 1000, Bangladesh
来源
IEEE ACCESS | 2021年 / 9卷
关键词
Heart; Predictive models; Prediction algorithms; Boosting; Support vector machines; Feature extraction; Classification algorithms; Heart disease; machine learning; CVD; relief feature selection; LASSO feature selection; decision tree; random forest; K-nearest neighbors; AdaBoost; and gradient boosting; HEART-FAILURE; DIAGNOSIS;
D O I
10.1109/ACCESS.2021.3053759
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cardiovascular diseases (CVD) are among the most common serious illnesses affecting human health. CVDs may be prevented or mitigated by early diagnosis, and this may reduce mortality rates. Identifying risk factors using machine learning models is a promising approach. We would like to propose a model that incorporates different methods to achieve effective prediction of heart disease. For our proposed model to be successful, we have used efficient Data Collection, Data Pre-processing and Data Transformation methods to create accurate information for the training model. We have used a combined dataset (Cleveland, Long Beach VA, Switzerland, Hungarian and Stat log). Suitable features are selected by using the Relief, and Least Absolute Shrinkage and Selection Operator (LASSO) techniques. New hybrid classifiers like Decision Tree Bagging Method (DTBM), Random Forest Bagging Method (RFBM), K-Nearest Neighbors Bagging Method (KNNBM), AdaBoost Boosting Method (ABBM), and Gradient Boosting Boosting Method (GBBM) are developed by integrating the traditional classifiers with bagging and boosting methods, which are used in the training process. We have also instrumented some machine learning algorithms to calculate the Accuracy (ACC), Sensitivity (SEN), Error Rate, Precision (PRE) and F1 Score (F1) of our model, along with the Negative Predictive Value (NPR), False Positive Rate (FPR), and False Negative Rate (FNR). The results are shown separately to provide comparisons. Based on the result analysis, we can conclude that our proposed model produced the highest accuracy while using RFBM and Relief feature selection methods (99.05%).
引用
收藏
页码:19304 / 19326
页数:23
相关论文
共 50 条
  • [21] Fracture risk prediction in diabetes patients based on Lasso feature selection and Machine Learning
    Shi, Yu
    Fang, Junhua
    Li, Jiayi
    Yu, Kaiwen
    Zhu, Jingbo
    Lu, Yan
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING, 2024,
  • [22] Comparative Study on Heart Disease Prediction Using Feature Selection Techniques on Classification Algorithms
    Dissanayake, Kaushalya
    Johar, Md Gapar Md
    APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2021, 2021
  • [23] Improving Alzheimer's Disease Prediction with Different Machine Learning Approaches and Feature Selection Techniques
    Alshamlan, Hala
    Alwassel, Arwa
    Banafa, Atheer
    Alsaleem, Layan
    DIAGNOSTICS, 2024, 14 (19)
  • [24] Feature selection to detect botnets using machine learning algorithms
    Villegas Alejandre, Francisco
    Cruz Cortes, Nareli
    Aguirre Anaya, Eleazar
    2017 INTERNATIONAL CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND COMPUTERS (CONIELECOMP), 2017,
  • [25] Obsolescence Prediction based on Joint Feature Selection and Machine Learning Techniques
    Trabelsi, Imen
    Zeddini, Besma
    Zolghadri, Marc
    Barkallah, Maher
    Haddar, Mohamed
    ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 787 - 794
  • [26] Machine Learning-Based Cardiovascular Disease Detection Using Optimal Feature Selection
    Ullah, Tahseen
    Ullah, Syed Irfan
    Ullah, Khalil
    Ishaq, Muhammad
    Khan, Ahmad
    Ghadi, Yazeed Yasin
    Algarni, Abdulmohsen
    IEEE ACCESS, 2024, 12 : 16431 - 16446
  • [27] LASSO: A Feature Selection Technique In Predictive Modeling For Machine Learning
    Muthukrishnan, R.
    Rohini, R.
    2016 IEEE INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER APPLICATIONS (ICACA), 2016, : 18 - 20
  • [28] Efficient prediction of evaporation using ensemble feature selection techniques
    Sharma, Rakhee
    Singh, Archana
    Mittal, Mamta
    MAUSAM, 2023, 74 (04): : 951 - 962
  • [29] Chronic kidney disease prediction using machine learning techniques: a comparative study of feature selection methods with SMOTE and SHAP
    Gogoi, Prokash
    Valan, J. Arul
    MULTISCALE AND MULTIDISCIPLINARY MODELING EXPERIMENTS AND DESIGN, 2025, 8 (04)
  • [30] A Supervised Machine Learning Approach using Different Feature Selection Techniques on Voice Datasets for Prediction of Parkinson's Disease
    Aich, Satyabrata
    Kim, Hee-Cheol
    Younga, Kim
    Hui, Kueh Lee
    Al-Absi, Ahmed Abdulhakim
    Sain, Mangal
    2019 21ST INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): ICT FOR 4TH INDUSTRIAL REVOLUTION, 2019, : 1116 - 1121