Efficient Prediction of Cardiovascular Disease Using Machine Learning Algorithms With Relief and LASSO Feature Selection Techniques

被引:146
|
作者
Ghosh, Pronab [1 ]
Azam, Sami [2 ]
Jonkman, Mirjam [2 ]
Karim, Asif [2 ]
Shamrat, F. M. Javed Mehedi [3 ]
Ignatious, Eva [2 ]
Shultana, Shahana [1 ]
Beeravolu, Abhijith Reddy [2 ]
De Boer, Friso [2 ]
机构
[1] Daffodil Int Univ, Dept Comp Sci & Engn, Dhaka 1225, Bangladesh
[2] Charles Darwin Univ, Coll Engn IT & Environm, Casuarina, NT 0810, Australia
[3] Govt Bangladesh, Minist Posts Telecommun & Informat Technol, Informat & Commun Technol Div, Dhaka 1000, Bangladesh
来源
IEEE ACCESS | 2021年 / 9卷
关键词
Heart; Predictive models; Prediction algorithms; Boosting; Support vector machines; Feature extraction; Classification algorithms; Heart disease; machine learning; CVD; relief feature selection; LASSO feature selection; decision tree; random forest; K-nearest neighbors; AdaBoost; and gradient boosting; HEART-FAILURE; DIAGNOSIS;
D O I
10.1109/ACCESS.2021.3053759
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cardiovascular diseases (CVD) are among the most common serious illnesses affecting human health. CVDs may be prevented or mitigated by early diagnosis, and this may reduce mortality rates. Identifying risk factors using machine learning models is a promising approach. We would like to propose a model that incorporates different methods to achieve effective prediction of heart disease. For our proposed model to be successful, we have used efficient Data Collection, Data Pre-processing and Data Transformation methods to create accurate information for the training model. We have used a combined dataset (Cleveland, Long Beach VA, Switzerland, Hungarian and Stat log). Suitable features are selected by using the Relief, and Least Absolute Shrinkage and Selection Operator (LASSO) techniques. New hybrid classifiers like Decision Tree Bagging Method (DTBM), Random Forest Bagging Method (RFBM), K-Nearest Neighbors Bagging Method (KNNBM), AdaBoost Boosting Method (ABBM), and Gradient Boosting Boosting Method (GBBM) are developed by integrating the traditional classifiers with bagging and boosting methods, which are used in the training process. We have also instrumented some machine learning algorithms to calculate the Accuracy (ACC), Sensitivity (SEN), Error Rate, Precision (PRE) and F1 Score (F1) of our model, along with the Negative Predictive Value (NPR), False Positive Rate (FPR), and False Negative Rate (FNR). The results are shown separately to provide comparisons. Based on the result analysis, we can conclude that our proposed model produced the highest accuracy while using RFBM and Relief feature selection methods (99.05%).
引用
收藏
页码:19304 / 19326
页数:23
相关论文
共 50 条
  • [31] Osteoporosis Detection Using Machine Learning Techniques and Feature Selection
    Iliou, Theodoros
    Anagnostopoulos, Christos-Nikolaos
    Anastassopoulos, George
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2014, 23 (05)
  • [32] Enhancing Parkinson's Disease Prediction Using Machine Learning and Feature Selection Methods
    Saeed, Faisal
    Al-Sarem, Mohammad
    Al-Mohaimeed, Muhannad
    Emara, Abdelhamid
    Boulila, Wadii
    Alasli, Mohammed
    Ghabban, Fahad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (03): : 5639 - 5657
  • [33] Cardiovascular Disease Risk Prediction with Supervised Machine Learning Techniques
    Dritsas, Elias
    Alexiou, Sotiris
    Moustakas, Konstantinos
    ICT4AWE: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES FOR AGEING WELL AND E-HEALTH, 2022, : 315 - 321
  • [34] Congestive heart failure prediction based on feature selection and machine learning algorithms
    Morillo-Velepucha, Diego
    Reategui, Ruth
    Valdiviezo-Diaz, Priscila
    Barba-Guaman, Luis
    2022 17TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2022,
  • [35] Using Machine Learning and Feature Selection for Alfalfa Yield Prediction
    Whitmire, Christopher D. D.
    Vance, Jonathan M. M.
    Rasheed, Hend K. K.
    Missaoui, Ali
    Rasheed, Khaled M. M.
    Maier, Frederick W. W.
    AI, 2021, 2 (01) : 71 - 88
  • [36] Sarcopenia feature selection and risk prediction using machine learning
    Yoo, Jun-Il
    Park, Chan-Ho
    Kim, Hyeonmok
    JOURNAL OF BONE AND MINERAL RESEARCH, 2019, 34 : 145 - 145
  • [37] Prediction of Heart Failure by using Machine Learning and Feature Selection
    Aslam, Muhammad Haseeb
    Hussain, Syed Fawad
    2022 17TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET'22), 2022, : 160 - 165
  • [38] Enhanced Cardiovascular Disease Prediction Modelling using Machine Learning Techniques: A Focus on CardioVitalnet
    Ejiyi, Chukwuebuka Joseph
    Qin, Zhen
    Nneji, Grace Ugochi
    Monday, Happy Nkanta
    Agbesi, Victor K.
    Ejiyi, Makuachukwu Bennedith
    Ejiyi, Thomas Ugochukwu
    Bamisile, Olusola O.
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2024,
  • [39] PREDICTION OF TYPE 2 DIABETES MELLITUS USING FEATURE SELECTION-BASED MACHINE LEARNING ALGORITHMS
    Yilmaz, Atinc
    HEALTH PROBLEMS OF CIVILIZATION, 2022, 16 (02) : 128 - 139
  • [40] Multiple disease prediction using Machine learning algorithms
    Arumugam K.
    Naved M.
    Shinde P.P.
    Leiva-Chauca O.
    Huaman-Osorio A.
    Gonzales-Yanac T.
    Materials Today: Proceedings, 2023, 80 : 3682 - 3685