RETRACTED ARTICLE: Optimal feature selection through a cluster-based DT learning (CDTL) in heart disease prediction

被引:0
作者
G. Magesh
P. Swarnalatha
机构
[1] VIT University,School of Information Technology and Engineering
[2] VIT University,School of Computer Science and Engineering
来源
Evolutionary Intelligence | 2021年 / 14卷
关键词
Classification; Machine learning; Heart disease; Support vector machine; Random forest;
D O I
暂无
中图分类号
学科分类号
摘要
In the rural side, there is the absence of centers for cardiovascular ailment. Due to this, around 12 million people passing worldwide reported by WHO. The principal purpose of coronary illness is a propensity for smoking. ML classifiers are applied to predict the risk of cardiovascular disease. However, the ML model has some inherent problems like it’s serene to feature selection, splitting attribute, and imbalanced datasets prediction. Most of the mass datasets have multi-class labels, but their combinations are in different proportions. In this paper, we experiment with our system with Cleveland’s heart samples from the UCI repository. Our cluster-based DT learning (CDTL) mainly includes five key stages. At first, the original set has partitioned through target label distribution. From the high distribution samples, the other possible class combination has made. For each class-set combination, the significant features have identified through entropy. With the significant critical features, an entropy-based partition has made. At last, on these entropy clusters, RF performance is made through significant and all features in the prediction of heart disease. From our CDTL approach, the RF classifier achieves 89.30% improved prediction accuracy from 76.70% accuracy (without CDTL). Hence, the error rate of RF with CDTL has significantly reduced from 23.30 to 9.70%.
引用
收藏
页码:583 / 593
页数:10
相关论文
共 106 条
[1]  
Razmjooy N(2018)A hybrid neural network–world cup optimization algorithm for melanoma detection Open Med 13 9-16
[2]  
Sheykhahmad FR(2013)Computer vision-based potato defect detection using neural networks and support vector machine Int J Robot Autom 28 137-145
[3]  
Ghadimi N(2011)Digital image segmentation using rule-base classifier Am J Sci Res 35 17-23
[4]  
Moallem P(2009)Effective diagnosis of heart disease through neural networks ensembles Expert Syst Appl 36 7675-7680
[5]  
Navid R(2016)A hybrid data mining model to predict coronary artery disease cases using non-invasive clinical data J Med Syst 40 1-7
[6]  
Mohsen A(2015)Feature analysis of coronary artery heart disease data sets Procedia Comput Sci 65 459-468
[7]  
Mousavi S(2019)Effective heart disease prediction using hybrid machine learning techniques IEEE Access 7 81542-81554
[8]  
Sargolzaei P(2016)A computational intelligence method for effective diagnosis of heart disease using genetic algorithm Int J Bio-Sci Bio-Technol 8 363-372
[9]  
Razmjooy N(2018)Hybrid recommendation system for heart disease diagnosis based on multiple kernel learning with adaptive neuro-fuzzy inference system Multimed Tools Appl 77 4379-31
[10]  
Soleymani F(2016)Analysis of supervised machine learning algorithms for heart disease prediction with reduced number of attributes using principal component analysis Int J Comput Appl 140 27-11