Towards comparing and using Machine Learning techniques for detecting and predicting Heart Attack and Diseases

被引:0
作者
Obasi, Thankgod [1 ]
Shafiq, M. Omair [1 ]
机构
[1] Carleton Univ, Sch Informat Technol, Ottawa, ON, Canada
来源
2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2019年
关键词
Heart Disease; Heart Attack; Classification; Random Forest; Logistic Regression; Naive Bayes Classifier; Predicted Results; Decision Support System;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Heart diseases are one of the deadly but are silent killers for humans, which results in the increase in death rate of sufferers every year. The World Health Organization (WHO), in the year 2016, reported that 17.9 million deaths that occur worldwide per year are a result of heart disease. In the health care sector, enormous data are being generated on a daily basis, which contains different types of data, and acquiring knowledge from these data is essential. This knowledge can be acquired using various data mining techniques to mine knowledge by designing models from the medical records dataset. We implement a machine learning based system that can detect and predict heart diseases in patients using the medical records of patients. The proposed solution is based on existing techniques like Random Forest Bayesian Classification and Logistic Regression, which provides a decision support system for medical professionals to detect and predict heart diseases and heart attacks in humans or individuals using risk factors of heart disease. The dataset used in our model consists of 18 features (risk factors) and 1990 observations after performing preprocessing. It was then split into 80% train sets and 20% test sets. Using real medical records of patients, a series of experiments were conducted to examine the performance and accuracy of the proposed system. "the system was implemented in RStudio platform which predicts the risk of heart disease in patients. The compared results showed that the system performance and accuracy are acceptable with heart disease classification accuracy of 92.44% for Random Forest, 61.96%, and 59.7% for Naive Bayes Classifier and Logistic Regression, respectively.
引用
收藏
页码:2393 / 2402
页数:10
相关论文
共 30 条
[1]  
Amin SU, 2013, 2013 IEEE CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES (ICT 2013), P1227
[2]  
[Anonymous], 2010, ENCY MACHINE LEARNIN
[3]  
[Anonymous], 2018, Applied predictive modeling
[4]  
[Anonymous], 2014, INT J INNOVATIVE RES
[5]  
[Anonymous], 2007, P INT C CONV INF TEC
[6]  
Babu S, 2017, 2017 INTERNATIONAL CONFERENCE OF ELECTRONICS, COMMUNICATION AND AEROSPACE TECHNOLOGY (ICECA), VOL 1, P750, DOI 10.1109/ICECA.2017.8203643
[7]  
Cooper S, 2016, SCHOOL SCI MATH, V116, P1
[8]   MODELS OF INCREMENTAL CONCEPT-FORMATION [J].
GENNARI, JH ;
LANGLEY, P ;
FISHER, D .
ARTIFICIAL INTELLIGENCE, 1989, 40 (1-3) :11-61
[9]  
Jabbar M.A., 2016, 2016 International Conference on Circuits, Controls, Communications and Computing (I4C). 2016 International Conference on Circuits, Controls, P1, DOI DOI 10.1109/CIMCA.2016.8053261
[10]  
Kalaiselvi C, 2016, PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, P3099