Towards comparing and using Machine Learning techniques for detecting and predicting Heart Attack and Diseases

被引:0
作者
Obasi, Thankgod [1 ]
Shafiq, M. Omair [1 ]
机构
[1] Carleton Univ, Sch Informat Technol, Ottawa, ON, Canada
来源
2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2019年
关键词
Heart Disease; Heart Attack; Classification; Random Forest; Logistic Regression; Naive Bayes Classifier; Predicted Results; Decision Support System;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Heart diseases are one of the deadly but are silent killers for humans, which results in the increase in death rate of sufferers every year. The World Health Organization (WHO), in the year 2016, reported that 17.9 million deaths that occur worldwide per year are a result of heart disease. In the health care sector, enormous data are being generated on a daily basis, which contains different types of data, and acquiring knowledge from these data is essential. This knowledge can be acquired using various data mining techniques to mine knowledge by designing models from the medical records dataset. We implement a machine learning based system that can detect and predict heart diseases in patients using the medical records of patients. The proposed solution is based on existing techniques like Random Forest Bayesian Classification and Logistic Regression, which provides a decision support system for medical professionals to detect and predict heart diseases and heart attacks in humans or individuals using risk factors of heart disease. The dataset used in our model consists of 18 features (risk factors) and 1990 observations after performing preprocessing. It was then split into 80% train sets and 20% test sets. Using real medical records of patients, a series of experiments were conducted to examine the performance and accuracy of the proposed system. "the system was implemented in RStudio platform which predicts the risk of heart disease in patients. The compared results showed that the system performance and accuracy are acceptable with heart disease classification accuracy of 92.44% for Random Forest, 61.96%, and 59.7% for Naive Bayes Classifier and Logistic Regression, respectively.
引用
收藏
页码:2393 / 2402
页数:10
相关论文
共 30 条
[21]  
Radhimeenakshi S, 2016, PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, P3107
[22]  
Raghupathi W, 2014, HEALTH INF SCI SYST, V2, DOI 10.1186/2047-2501-2-3
[23]  
Rodriguez Carlos J., 2015, FERDINAND CURR CARDI, V9, P18
[24]  
Sameh G., 2013, J INTELLIGENT LEARNI, V5, P176, DOI DOI 10.4236/JILSA.2013.53019
[25]  
Singh J., 2016, P 2016 AI SAD INT C, P1, DOI [10.1109/WECON.2016.7993480, DOI 10.1109/AIC-MITCSA.2016.7759957]
[26]   Cardiogenic Causes of Fever [J].
Smid, Jan ;
Scherner, Maximilian ;
Wolfram, Oliver ;
Groscheck, Thomas ;
Wippermann, Jens ;
Braun-Dullaeus, Ruediger C. .
DEUTSCHES ARZTEBLATT INTERNATIONAL, 2018, 115 (12) :193-199
[27]  
Soni Devin, 2018, INTRO NAIVE BAYES CL
[28]  
Soni J., 2011, INT J COMPUTER APPL, V17
[29]  
Swati S, 2015, INT J ENG RES GEN SC, V3
[30]  
Umamaheswari M, 2017, 2017 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT TECHNIQUES IN CONTROL, OPTIMIZATION AND SIGNAL PROCESSING (INCOS)