Efficient Prediction of Cardiovascular Disease Using Machine Learning Algorithms With Relief and LASSO Feature Selection Techniques

被引:146
|
作者
Ghosh, Pronab [1 ]
Azam, Sami [2 ]
Jonkman, Mirjam [2 ]
Karim, Asif [2 ]
Shamrat, F. M. Javed Mehedi [3 ]
Ignatious, Eva [2 ]
Shultana, Shahana [1 ]
Beeravolu, Abhijith Reddy [2 ]
De Boer, Friso [2 ]
机构
[1] Daffodil Int Univ, Dept Comp Sci & Engn, Dhaka 1225, Bangladesh
[2] Charles Darwin Univ, Coll Engn IT & Environm, Casuarina, NT 0810, Australia
[3] Govt Bangladesh, Minist Posts Telecommun & Informat Technol, Informat & Commun Technol Div, Dhaka 1000, Bangladesh
来源
IEEE ACCESS | 2021年 / 9卷
关键词
Heart; Predictive models; Prediction algorithms; Boosting; Support vector machines; Feature extraction; Classification algorithms; Heart disease; machine learning; CVD; relief feature selection; LASSO feature selection; decision tree; random forest; K-nearest neighbors; AdaBoost; and gradient boosting; HEART-FAILURE; DIAGNOSIS;
D O I
10.1109/ACCESS.2021.3053759
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cardiovascular diseases (CVD) are among the most common serious illnesses affecting human health. CVDs may be prevented or mitigated by early diagnosis, and this may reduce mortality rates. Identifying risk factors using machine learning models is a promising approach. We would like to propose a model that incorporates different methods to achieve effective prediction of heart disease. For our proposed model to be successful, we have used efficient Data Collection, Data Pre-processing and Data Transformation methods to create accurate information for the training model. We have used a combined dataset (Cleveland, Long Beach VA, Switzerland, Hungarian and Stat log). Suitable features are selected by using the Relief, and Least Absolute Shrinkage and Selection Operator (LASSO) techniques. New hybrid classifiers like Decision Tree Bagging Method (DTBM), Random Forest Bagging Method (RFBM), K-Nearest Neighbors Bagging Method (KNNBM), AdaBoost Boosting Method (ABBM), and Gradient Boosting Boosting Method (GBBM) are developed by integrating the traditional classifiers with bagging and boosting methods, which are used in the training process. We have also instrumented some machine learning algorithms to calculate the Accuracy (ACC), Sensitivity (SEN), Error Rate, Precision (PRE) and F1 Score (F1) of our model, along with the Negative Predictive Value (NPR), False Positive Rate (FPR), and False Negative Rate (FNR). The results are shown separately to provide comparisons. Based on the result analysis, we can conclude that our proposed model produced the highest accuracy while using RFBM and Relief feature selection methods (99.05%).
引用
收藏
页码:19304 / 19326
页数:23
相关论文
共 50 条
  • [41] Prediction of Heart Disease Using Machine Learning Algorithms
    Krishnan, Santhana J.
    Geetha, S.
    PROCEEDINGS OF 2019 1ST INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION AND COMMUNICATION TECHNOLOGY (ICIICT 2019), 2019,
  • [42] Alzheimer Disease Prediction using Machine Learning Algorithms
    Neelaveni, J.
    Devasana, M. S. Geetha
    2020 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2020, : 101 - 104
  • [43] Heart Disease Prediction Using Machine Learning Algorithms
    Malavika, G.
    Rajathi, N.
    Vanitha, V.
    Parameswari, P.
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (11): : 24 - 27
  • [44] Diabetes Disease Prediction Using Machine Learning Algorithms
    Lyngdoh, Arwatki Chen
    Choudhury, Nurul Amin
    Moulik, Soumen
    2020 IEEE-EMBS CONFERENCE ON BIOMEDICAL ENGINEERING AND SCIENCES (IECBES 2020): LEADING MODERN HEALTHCARE TECHNOLOGY ENHANCING WELLNESS, 2021, : 517 - 521
  • [45] An RHMIoT Framework for Cardiovascular Disease Prediction and Severity Level Using Machine Learning and Deep Learning Algorithms
    Patro S.P.
    Padhy N.
    International Journal of Ambient Computing and Intelligence, 2022, 13 (01)
  • [46] Heart Disease Prediction Using Machine Learning Algorithms
    Mammen, Rea
    Pawar, Arti
    SMART SENSORS MEASUREMENT AND INSTRUMENTATION, CISCON 2021, 2023, 957 : 239 - 253
  • [47] Heart Disease Prediction Using Machine Learning Algorithms
    Jrab, Dina
    Eleyan, Derar
    Eleyan, Amna
    Bejaoui, Tarek
    2024 INTERNATIONAL CONFERENCE ON SMART APPLICATIONS, COMMUNICATIONS AND NETWORKING, SMARTNETS-2024, 2024,
  • [48] Heart Disease Prediction by Using Machine Learning Algorithms
    Erdogan, Alperen
    Guney, Selda
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [49] Systematic Analysis of Machine Learning and Feature Selection Techniques for Prediction of the Kp Index
    Zhelayskaya, I. S.
    Vasile, R.
    Shprits, Y. Y.
    Stolle, C.
    Matzka, J.
    SPACE WEATHER-THE INTERNATIONAL JOURNAL OF RESEARCH AND APPLICATIONS, 2019, 17 (10): : 1461 - 1486
  • [50] The Impact of Ontology on the Prediction of Cardiovascular Disease Compared to Machine Learning Algorithms
    El Massari, Hakim
    Gherabi, Noreddine
    Mhammedi, Sajida
    Ghandi, Hamza
    Bahaj, Mohamed
    Naqvi, Muhammad Raza
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2022, 18 (11) : 143 - 157