Identifying the Main Risk Factors for Cardiovascular Diseases Prediction Using Machine Learning Algorithms

被引:16
作者
Guarneros-Nolasco, Luis Rolando [1 ]
Cruz-Ramos, Nancy Aracely [1 ]
Alor-Hernandez, Giner [1 ]
Rodriguez-Mazahua, Lisbeth [1 ]
Sanchez-Cervantes, Jose Luis [2 ]
机构
[1] Tecnol Nacl Mexico IT Orizaba, Div Estudios Posgrad & Invest, Av Oriente 9 852 Col Emiliano Zapata, Orizaba 94320, Veracruz, Mexico
[2] CONACYT, Inst Tecnol Orizaba, Av Oriente 9 852 Col Emiliano Zapata, Orizaba 94320, Veracruz, Mexico
关键词
big data; health prevention; machine learning; medical data; PERFORMANCE EVALUATION; FEATURES;
D O I
10.3390/math9202537
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Cardiovascular Diseases (CVDs) are a leading cause of death globally. In CVDs, the heart is unable to deliver enough blood to other body regions. As an effective and accurate diagnosis of CVDs is essential for CVD prevention and treatment, machine learning (ML) techniques can be effectively and reliably used to discern patients suffering from a CVD from those who do not suffer from any heart condition. Namely, machine learning algorithms (MLAs) play a key role in the diagnosis of CVDs through predictive models that allow us to identify the main risks factors influencing CVD development. In this study, we analyze the performance of ten MLAs on two datasets for CVD prediction and two for CVD diagnosis. Algorithm performance is analyzed on top-two and top-four dataset attributes/features with respect to five performance metrics -accuracy, precision, recall, f1-score, and roc-auc-using the train-test split technique and k-fold cross-validation. Our study identifies the top-two and top-four attributes from CVD datasets analyzing the performance of the accuracy metrics to determine that they are the best for predicting and diagnosing CVD. As our main findings, the ten ML classifiers exhibited appropriate diagnosis in classification and predictive performance with accuracy metric with top-two attributes, identifying three main attributes for diagnosis and prediction of a CVD such as arrhythmia and tachycardia; hence, they can be successfully implemented for improving current CVD diagnosis efforts and help patients around the world, especially in regions where medical staff is lacking.
引用
收藏
页数:25
相关论文
共 42 条
[1]   Survival analysis of heart failure patients: A case study [J].
Ahmad, Tanvir ;
Munir, Assia ;
Bhatti, Sajjad Haider ;
Aftab, Muhammad ;
Raza, Muhammad Ali .
PLOS ONE, 2017, 12 (07)
[2]   Identification of significant features and data mining techniques in predicting heart disease [J].
Amin, Mohammad Shafenoor ;
Chiam, Yin Kia ;
Varathan, Kasturi Dewi .
TELEMATICS AND INFORMATICS, 2019, 36 :82-93
[3]  
[Anonymous], 2021, HMO EPSDT Sanction Tracking Report
[4]  
[Anonymous], 2010, Commun. Surveys Tuts., DOI DOI 10.1038/nature14539
[5]  
[Anonymous], 2021, IEEE Trans. Broadcast.
[6]   Coronary Artery Heart Disease Prediction: A Comparative Study of Computational Intelligence Techniques [J].
Ayon, Safial Islam ;
Islam, Md. Milon ;
Hossain, Md. Rahat .
IETE JOURNAL OF RESEARCH, 2022, 68 (04) :2488-2507
[7]   Performance Evaluation of Supervised Machine Learning Algorithms for Intrusion Detection [J].
Belavagi, Manjula C. ;
Muniyal, Balachandra .
TWELFTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2016 / TWELFTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2016 / TWELFTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2016, 2016, 89 :117-123
[8]   Glioma Segmentation and Classification System Based on Proposed Texture Features Extraction Method and Hybrid Ensemble Learning [J].
Bhatele, Kirti Raj ;
Bhadauria, Sarita Singh .
TRAITEMENT DU SIGNAL, 2020, 37 (06) :989-1001
[9]   Accelerated gradient boosting [J].
Biau, G. ;
Cadre, B. ;
Rouviere, L. .
MACHINE LEARNING, 2019, 108 (06) :971-992
[10]  
Brownlee J, 2017, MACHINE LEARNING MAS, P91