A hybrid cost-sensitive ensemble for heart disease prediction

被引:37
作者
Qi Zhenya [1 ]
Zhang, Zuoru [2 ]
机构
[1] Tianjin Univ, Coll Management & Econ, Tianjin 300072, Peoples R China
[2] Hebei Normal Univ, Sch Math Sci, Shijiazhuang 050024, Hebei, Peoples R China
关键词
Cost-sensitive; Ensemble; Heart disease; CLASSIFIER ENSEMBLE; DIAGNOSIS; OPTIMIZATION; ALGORITHM; MACHINE; SYSTEM;
D O I
10.1186/s12911-021-01436-7
中图分类号
R-058 [];
学科分类号
摘要
BackgroundHeart disease is the primary cause of morbidity and mortality in the world. It includes numerous problems and symptoms. The diagnosis of heart disease is difficult because there are too many factors to analyze. What's more, the misclassification cost could be very high.MethodsA cost-sensitive ensemble method was proposed to improve the efficiency of diagnosis and reduce the misclassification cost. The proposed method contains five heterogeneous classifiers: random forest, logistic regression, support vector machine, extreme learning machine and k-nearest neighbor. T-test was used to investigate if the performance of the ensemble was better than individual classifiers and the contribution of Relief algorithm.ResultsThe best performance was achieved by the proposed method according to ten-fold cross validation. The statistical tests demonstrated that the performance of the proposed ensemble was significantly superior to individual classifiers, and the efficiency of classification was distinctively improved by Relief algorithm.ConclusionsThe proposed ensemble gained significantly better results compared with individual classifiers and previous studies, which implies that it can be used as a promising alternative tool in medical decision making for heart disease diagnosis.
引用
收藏
页数:18
相关论文
共 64 条
[41]   Interpreting parameters in the logistic regression model with random effects [J].
Larsen, K ;
Petersen, JH ;
Budtz-Jorgensen, E ;
Endahl, L .
BIOMETRICS, 2000, 56 (03) :909-914
[42]   Neural network classifier optimization using Differential Evolution with Global Information and Back Propagation algorithm for clinical datasets [J].
Leema, N. ;
Nehemiah, H. Khanna ;
Kannan, A. .
APPLIED SOFT COMPUTING, 2016, 49 :834-844
[43]   A Hybrid Classification System for Heart Disease Diagnosis Based on the RFRS Method [J].
Liu, Xiao ;
Wang, Xiaoli ;
Su, Qiang ;
Zhang, Mo ;
Zhu, Yanhong ;
Wang, Qiugen ;
Wang, Qian .
COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2017, 2017
[44]  
LiuN ShenJ, 2018, MATH PROBL ENG, V2018, P1
[45]  
Lopez-Sendon J., 2011, MEDICOGRAPHIA, V33, P363
[46]  
Marateb HR, 2015, J RES MED SCI, V20, P214
[47]  
Mejia OAV., 2018, PLOS ONE, V2018, P1
[48]  
Mokeddem S, 2016, FUZZY SYSTEMS CONCEP, P184
[49]   A fuzzy classification model for myocardial infarction risk assessment [J].
Mokeddem, Sid Ahmed .
APPLIED INTELLIGENCE, 2018, 48 (05) :1233-1250
[50]  
Safdar S., 2017, ARTIF INTELL REV, V2017, P1