Heart Disease Prediction Using Modified Machine Learning Algorithm

被引:9
作者
Kaur, Bavneet [1 ]
Kaur, Gaganpreet [2 ]
机构
[1] Sri Guru Granth Sahib World Univ, Fatehgarh Sahib, India
[2] Chitkara Univ, Inst Engn & Technol, Rajpura, Punjab, India
来源
INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1 | 2023年 / 473卷
关键词
Data mining; Heart disease; K-means; Genetic algorithm; Random forest; Logistic regression;
D O I
10.1007/978-981-19-2821-5_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Heart patient number is escalating day by day, and numerous individuals lost their precious lives each year due to sudden heart attack in all over the world. Because of this, before time diagnosis of cardiovascular disease is necessary to prevent death. Some technology-based software is required to help in medical field to recognize heart patients with more accuracy and lesser time. Huge amount of heart patients are present in different hospitals in all over the world, which can be used efficiently to diagnose the heart disease by applying data mining techniques. In the process of data mining, knowledge or useful information is extracted among the large sets of raw data. In the prediction analysis, machine learning techniques are applied to discover valuable patterns and forecast future events or trends. This research work will predict the likelihood of coronary heart disorder in patients by implementing a modified machine learning algorithm. The input data are passed through various procedures comprising preprocessing, clustering, and selection of effective attributes before classification. To determine the heart illness, four algorithms which include random forest, K-means, genetic algorithm, and logistic regression are assimilated. In this technique, the irrelevant attributes of heart dataset are discarded to improve the performance and to decrease the training period time. This process is completed by random forest technique. K-means clusters are optimized by genetic algorithm in order to group all the outlier data points. At last, logistic regression is applied to classify the patients based on the heart disease. Performance comparison among various existing techniques has analyzed on the basis of some performance measures. The calculated accuracy increased up to 95%.
引用
收藏
页码:189 / 201
页数:13
相关论文
共 22 条
[1]  
Ansarullah SI, 2016, INT RES J ENG TECHNO, V3
[2]  
Bhatla N., 2012, International Journal of Engineering, V1, P1
[3]  
Chauhan A, 2018, 2018 4TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE & COMMUNICATION TECHNOLOGY (CICT)
[4]  
David H. Benjamin Fredrick, 2018, ICTACT Journal on Soft Computing, V9, P1817, DOI 10.21917/ijsc.2018.0253
[5]  
Dewan A, 2015, 2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), P704
[6]  
Gandhi M, 2015, 2015 1ST INTERNATIONAL CONFERENCE ON FUTURISTIC TRENDS ON COMPUTATIONAL ANALYSIS AND KNOWLEDGE MANAGEMENT (ABLAZE), P520, DOI 10.1109/ABLAZE.2015.7154917
[7]  
Gomathi K, 2016, INT J RES APPL SCI E, V4
[8]  
Jabbar MA, 2016, INT C CIRCUITS CONTR
[9]  
Masethe H.D., 2014, Proceed. World Congress Eng. Compt. Sci., V2, P1
[10]  
Mirza I, 2019, INT C ADV COMPUTING