A Machine Learning Framework for Feature Selection in Heart Disease Classification Using Improved Particle Swarm Optimization with Support Vector Machine Classifier

被引:65
作者
Vijayashree, J. [1 ]
Sultana, H. Parveen [1 ]
机构
[1] Vellore Inst Technol, Sch Comp Sci & Engn, Vellore, Tamil Nadu, India
关键词
Particle Swarm Optimization; Support Vector Machine; fitness function; ROC analysis; population diversity function; tuning function; DIFFERENTIAL EVOLUTION ALGORITHM; PRINCIPAL COMPONENT ANALYSIS; CORONARY-ARTERY-DISEASE; NEURAL-NETWORKS; DECISION-MAKING; DIAGNOSIS; SYSTEM; PREDICTION; DISCOVERY; TREE;
D O I
10.1134/S0361768818060129
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Machine learning is used as an effective support system in health diagnosis which contains large volume of data. More commonly, analyzing such a large volume of data consumes more resources and execution time. In addition, all the features present in the dataset do not support in achieving the solution of the given problem. Hence, there is a need to use an effective feature selection algorithm for finding the more important features that contribute more in diagnosing the diseases. The Particle Swarm Optimization (PSO) is one of the metaheuristic algorithms to find the best solution with less time. Nowadays, PSO algorithm is not only used to select the more significant features but also removes the irrelevant and redundant features present in the dataset. However, the traditional PSO algorithm has an issue in selecting the optimal weight to update the velocity and position of the particles. To overcome this issue, this paper presents a novel function for identifying optimal weights on the basis of population diversity function and tuning function. We have also proposed a novel fitness function for PSO with the help of Support Vector Machine (SVM). The objective of the fitness function is to minimize the number of attributes and increase the accuracy. The performance of the proposed PSO-SVM is compared with the various existing feature selection algorithms such as Info gain, Chi-squared, One attribute based, Consistency subset, Relief, CFS, Filtered subset, Filtered attribute, Gain ratio and PSO algorithm. The SVM classifier is also compared with several classifiers such as Naive Bayes, Random forest and MLP.
引用
收藏
页码:388 / 397
页数:10
相关论文
共 51 条
[1]  
[Anonymous], 2017, CARL HYBR GEN ALG
[2]   Clinical decision support system: Risk level prediction of heart disease using weighted fuzzy rules [J].
Anooj, P. K. .
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2012, 24 (01) :27-40
[3]   Computer aided decision making for heart disease detection using hybrid neural network-Genetic algorithm [J].
Arabasadi, Zeinab ;
Alizadehsani, Roohallah ;
Roshanzamir, Mohamad ;
Moosaei, Hossein ;
Yarifard, Ali Asghar .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2017, 141 :19-26
[4]   Using methods from the data-mining and machine-learning literature for disease classification and prediction: a case study examining classification of heart failure subtypes [J].
Austin, Peter C. ;
Tu, Jack V. ;
Ho, Jennifer E. ;
Levy, Daniel ;
Lee, Douglas S. .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2013, 66 (04) :398-407
[5]   A comparison of feature selection models utilizing binary particle swarm optimization and genetic algorithm in determining coronary artery disease using support vector machine [J].
Babaoglu, Ismail ;
Findik, Oguz ;
Ulker, Erkan .
EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (04) :3177-3183
[6]   Assessment of exercise stress testing with artificial neural network in determining coronary artery disease and predicting lesion localization [J].
Babaoglu, Ismail ;
Baykan, Omer Kaan ;
Aygul, Nazif ;
Ozdemir, Kurtulus ;
Bayrak, Mehmet .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) :2562-2566
[7]   A decision support system for cost-effective diagnosis [J].
Chi, Chih-Lin ;
Street, W. Nick ;
Katz, David A. .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2010, 50 (03) :149-161
[8]   Effective diagnosis of heart disease through neural networks ensembles [J].
Das, Resul ;
Turkoglu, Ibrahim ;
Sengur, Abdulkadir .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) :7675-7680
[9]   Diagnosis of valvular heart disease through neural networks ensembles [J].
Das, Resul ;
Turkoglu, Ibrahim ;
Sengur, Abdulkadir .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2009, 93 (02) :185-191
[10]   AptaCDSS-E: A classifier ensemble-based clinical decision support system for cardiovascular disease level prediction [J].
Eom, Jae-Hong ;
Kim, Sung-Chun ;
Zhang, Byoung-Tak .
EXPERT SYSTEMS WITH APPLICATIONS, 2008, 34 (04) :2465-2479