Efficient Prediction of Cardiovascular Disease Using Machine Learning Algorithms With Relief and LASSO Feature Selection Techniques

被引:146
|
作者
Ghosh, Pronab [1 ]
Azam, Sami [2 ]
Jonkman, Mirjam [2 ]
Karim, Asif [2 ]
Shamrat, F. M. Javed Mehedi [3 ]
Ignatious, Eva [2 ]
Shultana, Shahana [1 ]
Beeravolu, Abhijith Reddy [2 ]
De Boer, Friso [2 ]
机构
[1] Daffodil Int Univ, Dept Comp Sci & Engn, Dhaka 1225, Bangladesh
[2] Charles Darwin Univ, Coll Engn IT & Environm, Casuarina, NT 0810, Australia
[3] Govt Bangladesh, Minist Posts Telecommun & Informat Technol, Informat & Commun Technol Div, Dhaka 1000, Bangladesh
来源
IEEE ACCESS | 2021年 / 9卷
关键词
Heart; Predictive models; Prediction algorithms; Boosting; Support vector machines; Feature extraction; Classification algorithms; Heart disease; machine learning; CVD; relief feature selection; LASSO feature selection; decision tree; random forest; K-nearest neighbors; AdaBoost; and gradient boosting; HEART-FAILURE; DIAGNOSIS;
D O I
10.1109/ACCESS.2021.3053759
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cardiovascular diseases (CVD) are among the most common serious illnesses affecting human health. CVDs may be prevented or mitigated by early diagnosis, and this may reduce mortality rates. Identifying risk factors using machine learning models is a promising approach. We would like to propose a model that incorporates different methods to achieve effective prediction of heart disease. For our proposed model to be successful, we have used efficient Data Collection, Data Pre-processing and Data Transformation methods to create accurate information for the training model. We have used a combined dataset (Cleveland, Long Beach VA, Switzerland, Hungarian and Stat log). Suitable features are selected by using the Relief, and Least Absolute Shrinkage and Selection Operator (LASSO) techniques. New hybrid classifiers like Decision Tree Bagging Method (DTBM), Random Forest Bagging Method (RFBM), K-Nearest Neighbors Bagging Method (KNNBM), AdaBoost Boosting Method (ABBM), and Gradient Boosting Boosting Method (GBBM) are developed by integrating the traditional classifiers with bagging and boosting methods, which are used in the training process. We have also instrumented some machine learning algorithms to calculate the Accuracy (ACC), Sensitivity (SEN), Error Rate, Precision (PRE) and F1 Score (F1) of our model, along with the Negative Predictive Value (NPR), False Positive Rate (FPR), and False Negative Rate (FNR). The results are shown separately to provide comparisons. Based on the result analysis, we can conclude that our proposed model produced the highest accuracy while using RFBM and Relief feature selection methods (99.05%).
引用
收藏
页码:19304 / 19326
页数:23
相关论文
共 50 条
  • [41] Feature reduction for hepatocellular carcinoma prediction using machine learning algorithms
    Mostafa, Ghada
    Mahmoud, Hamdi
    Abd El-Hafeez, Tarek
    Elaraby, Mohamed E.
    JOURNAL OF BIG DATA, 2024, 11 (01)
  • [42] Analysis of Cardiovascular Diseases Prediction Using Machine Learning Classification Algorithms
    Srivastava, Srishti
    Upreti, Kamal
    Shanbhog, Manjula
    2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [43] Efficient Feature Selection and Machine Learning Based ADHD Detection Using EEG Signal
    Maniruzzaman, Md.
    Shin, Jungpil
    Hasan, Md. Al Mehedi
    Yasumura, Akira
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (03): : 5179 - 5195
  • [44] Effective Feature Engineering Technique for Heart Disease Prediction With Machine Learning
    Qadri, Azam Mehmood
    Raza, Ali
    Munir, Kashif
    Almutairi, Mubarak S.
    IEEE ACCESS, 2023, 11 : 56214 - 56224
  • [45] Solar Flare Prediction Using Advanced Feature Extraction, Machine Learning, and Feature Selection
    Ahmed, Omar W.
    Qahwaji, Rami
    Colak, Tufan
    Higgins, Paul A.
    Gallagher, Peter T.
    Bloomfield, D. Shaun
    SOLAR PHYSICS, 2013, 283 (01) : 157 - 175
  • [46] Comparative Study of Optimum Medical Diagnosis of Human Heart Disease Using Machine Learning Technique With and Without Sequential Feature Selection
    Ahmad, Ghulab Nabi
    Shafiullah
    Algethami, Abdullah
    Fatima, Hira
    Akhter, Syed Md Humayun
    IEEE ACCESS, 2022, 10 : 23808 - 23828
  • [47] Feature selection to detect botnets using machine learning algorithms
    Villegas Alejandre, Francisco
    Cruz Cortes, Nareli
    Aguirre Anaya, Eleazar
    2017 INTERNATIONAL CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND COMPUTERS (CONIELECOMP), 2017,
  • [48] Solar Flare Prediction Using Advanced Feature Extraction, Machine Learning, and Feature Selection
    Omar W. Ahmed
    Rami Qahwaji
    Tufan Colak
    Paul A. Higgins
    Peter T. Gallagher
    D. Shaun Bloomfield
    Solar Physics, 2013, 283 : 157 - 175
  • [49] Efficient early prediction and diagnosis of diseases using machine learning algorithms for IoMT data
    Elbasi, Ersin
    Zreikat, Aymen, I
    2021 IEEE WORLD AI IOT CONGRESS (AIIOT), 2021, : 155 - 159
  • [50] Prediction of fatty liver disease using machine learning algorithms
    Wu, Chieh-Chen
    Yeh, Wen-Chun
    Hsu, Wen-Ding
    Islam, Md. Mohaimenul
    Phung Anh Nguyen
    Poly, Tahmina Nasrin
    Wang, Yao-Chin
    Yang, Hsuan-Chia
    Li, Yu-Chuan
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2019, 170 : 23 - 29