MV5: A Clinical Decision Support Framework for Heart Disease Prediction Using Majority Vote Based Classifier Ensemble

被引:36
作者
Bashir, Saba [1 ]
Qamar, Usman [1 ]
Khan, Farhan Hassan [1 ]
Javed, M. Younus [1 ]
机构
[1] NUST, Dept Comp Engn, Coll Elect & Mech Engn, Islamabad, Pakistan
关键词
Ensemble; Majority vote; Cross validation; Heterogeneous classifiers; Naive Bayes; Decision tree; Gini Index; Information gain; Memory-based learner; Support vector machine;
D O I
10.1007/s13369-014-1315-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The medical diagnosis process can be interpreted as a decision-making process during which the physician induces the diagnosis of a new and unknown case from an available set of clinical data and using his/her clinical experience. This process can be computerized in order to present medical diagnostic procedures in a rational, objective, accurate and efficient way. In the last few decades, many researchers have focused on developing effective methods for intelligent heart disease prediction and decision support systems. For such a system, high accuracy of prediction is paramount. In this research, an ensemble classifier is proposed, which uses majority vote-based scheme for heart disease data classification and prediction. The five heterogeneous classifiers used to construct the ensemble model are as follows: Na < ve Bayes, decision tree based on Gini Index, decision tree based on information gain, memory-based learner and support vector machine. Five datasets from different data repositories are employed for testing the effectiveness of the ensemble model. Each dataset has different types of attributes, for instance binary, real, continuous, categorical, etc. Experimental results with stratified cross validation show that the proposed MV5 framework deals with all the attribute types. MV5 has achieved an accuracy of 88.52% with 86.96% sensitivity, 90.83% specificity and 88.85% f-measure. Comparison of proposed MV5 model with individual classifiers shows increase in average accuracy, sensitivity, specificity and f-measure of about 14, 11, 17 and 18% respectively.
引用
收藏
页码:7771 / 7783
页数:13
相关论文
共 32 条
[1]  
Abuhaiba ISI, 2006, ARAB J SCI ENG, V31, P223
[2]  
Anbarasi M., 2010, INT J ENG SCI TECHNO, V2, P5370
[3]  
[Anonymous], 2008, INTRO INFORM RETRIEV
[4]  
[Anonymous], 2011, INT C COMP SCI INF T
[5]  
[Anonymous], 2011, Pei. data mining concepts and techniques
[6]  
[Anonymous], 2013, J. Comput. Sci. Eng
[7]  
[Anonymous], 2007, Principles of Data Mining: Undergraduate Topics in Computer Science
[8]  
Chen AH, 2011, COMPUT CARDIOL CONF, V38, P557
[9]  
Chitra R., 2013, BONFRING INT J SOFTW, V3, P1
[10]   Effective diagnosis of heart disease through neural networks ensembles [J].
Das, Resul ;
Turkoglu, Ibrahim ;
Sengur, Abdulkadir .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) :7675-7680