A hybrid cost-sensitive ensemble for heart disease prediction

被引:37
作者
Qi Zhenya [1 ]
Zhang, Zuoru [2 ]
机构
[1] Tianjin Univ, Coll Management & Econ, Tianjin 300072, Peoples R China
[2] Hebei Normal Univ, Sch Math Sci, Shijiazhuang 050024, Hebei, Peoples R China
关键词
Cost-sensitive; Ensemble; Heart disease; CLASSIFIER ENSEMBLE; DIAGNOSIS; OPTIMIZATION; ALGORITHM; MACHINE; SYSTEM;
D O I
10.1186/s12911-021-01436-7
中图分类号
R-058 [];
学科分类号
摘要
BackgroundHeart disease is the primary cause of morbidity and mortality in the world. It includes numerous problems and symptoms. The diagnosis of heart disease is difficult because there are too many factors to analyze. What's more, the misclassification cost could be very high.MethodsA cost-sensitive ensemble method was proposed to improve the efficiency of diagnosis and reduce the misclassification cost. The proposed method contains five heterogeneous classifiers: random forest, logistic regression, support vector machine, extreme learning machine and k-nearest neighbor. T-test was used to investigate if the performance of the ensemble was better than individual classifiers and the contribution of Relief algorithm.ResultsThe best performance was achieved by the proposed method according to ten-fold cross validation. The statistical tests demonstrated that the performance of the proposed ensemble was significantly superior to individual classifiers, and the efficiency of classification was distinctively improved by Relief algorithm.ConclusionsThe proposed ensemble gained significantly better results compared with individual classifiers and previous studies, which implies that it can be used as a promising alternative tool in medical decision making for heart disease diagnosis.
引用
收藏
页数:18
相关论文
共 64 条
[51]   Combining multiple classifiers using vote based classifier ensemble technique for named entity recognition [J].
Saha, Sriparna ;
Ekbal, Asif .
DATA & KNOWLEDGE ENGINEERING, 2013, 85 :15-39
[52]   THE STRENGTH OF WEAK LEARNABILITY [J].
SCHAPIRE, RE .
MACHINE LEARNING, 1990, 5 (02) :197-227
[53]   Feature extraction through parallel Probabilistic Principal Component Analysis for heart disease diagnosis [J].
Shah, Syed Muhammad Saqliain ;
Batool, Safeera ;
Khan, Imran ;
Ashraf, Muhammad Usman ;
Abbas, Syed Hussnain ;
Hussain, Syed Adnan .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2017, 482 :796-807
[54]  
Subbulakshmi C. V., 2015, Scientific World Journal, V2015, DOI 10.1155/2015/418060
[55]  
Subbulakshmi CV, 2012, 2012 IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), P458, DOI 10.1109/ICACCCT.2012.6320822
[56]  
Tomar D., 2014, INT J BIOSCIENCE BIO, V6, P69, DOI DOI 10.14257/ijbsbt.2014.6.2.07
[57]   A Hybrid Intelligent System Framework for the Prediction of Heart Disease Using Machine Learning Algorithms [J].
Ul Haq, Amin ;
Li, Jian Ping ;
Memon, Muhammad Hammad ;
Nazir, Shah ;
Sun, Ruinan .
MOBILE INFORMATION SYSTEMS, 2018, 2018
[58]   Relief-based feature selection: Introduction and review [J].
Urbanowicz, Ryan J. ;
Meeker, Melissa ;
La Cava, William ;
Olson, Randal S. ;
Moore, Jason H. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 85 :189-203
[59]  
Wang, 2016, BIOMED RES-TOKYO, V2, P1
[60]   Transaction aggregation as a strategy for credit card fraud detection [J].
Whitrow, C. ;
Hand, D. J. ;
Juszczak, P. ;
Weston, D. ;
Adams, N. M. .
DATA MINING AND KNOWLEDGE DISCOVERY, 2009, 18 (01) :30-55