A Comparative Performance Evaluation of Classification Algorithms for Clinical Decision Support Systems

被引:14
作者
Tama, Bayu Adhi [1 ]
Lim, Sunghoon [2 ]
机构
[1] Inst Basic Sci IBS, Ctr Math & Computat Sci, Data Sci Grp, Daejeon 34141, South Korea
[2] Ulsan Natl Inst Sci & Technol, Dept Ind Engn, Ulsan 44919, South Korea
基金
新加坡国家研究基金会;
关键词
disease prediction; classification algorithm; multiple diseases; comparative study; significance test; BREAST-CANCER DIAGNOSIS; COMPUTATIONAL INTELLIGENCE; KNOWLEDGE DISCOVERY; CLASSIFIERS; PREDICTION; FOREST; TESTS; MODEL;
D O I
10.3390/math8101814
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Classification algorithms are widely taken into account for clinical decision support systems. However, it is not always straightforward to understand the behavior of such algorithms on a multiple disease prediction task. When a new classifier is introduced, we, in most cases, will ask ourselves whether the classifier performs well on a particular clinical dataset or not. The decision to utilize classifiers mostly relies upon the type of data and classification task, thus making it often made arbitrarily. In this study, a comparative evaluation of a wide-array classifier pertaining to six different families, i.e., tree, ensemble, neural, probability, discriminant, and rule-based classifiers are dealt with. A number of real-world publicly datasets ranging from different diseases are taken into account in the experiment in order to demonstrate the generalizability of the classifiers in multiple disease prediction. A total of 25 classifiers, 14 datasets, and three different resampling techniques are explored. This study reveals that the classifier that is likely to become the best performer is the conditional inference tree forest (cforest), followed by linear discriminant analysis, generalize linear model, random forest, and Gaussian process classifier. This work contributes to existing literature regarding a thorough benchmark of classification algorithms for multiple diseases prediction.
引用
收藏
页码:1 / 24
页数:25
相关论文
共 74 条
[1]   A new machine learning technique for an accurate diagnosis of coronary artery disease [J].
Abdar, Moloud ;
Ksiazek, Wojciech ;
Acharya, U. Rajendra ;
Tan, Ru-San ;
Makarenkov, Vladimir ;
Plawiak, Pawel .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2019, 179
[2]   A new nested ensemble technique for automated diagnosis of breast cancer [J].
Abdar, Moloud ;
Zomorodi-Moghadam, Mariam ;
Zhou, Xujuan ;
Gururajan, Raj ;
Tao, Xiaohui ;
Barua, Prabal D. ;
Gururajan, Rashmi .
PATTERN RECOGNITION LETTERS, 2020, 132 :123-131
[3]   Performance analysis of classification algorithms on early detection of liver disease [J].
Abdar, Moloud ;
Zomorodi-Moghadam, Mariam ;
Das, Resul ;
Ting, I-Hsien .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 67 :239-251
[4]   Building classification trees using the total uncertainty criterion [J].
Abellán, J ;
Moral, S .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2003, 18 (12) :1215-1225
[5]   Forest PA: Constructing a decision forest by penalizing attributes used in previous trees [J].
Adnan, Md Nasim ;
Islam, Md Zahidul .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 89 :389-403
[6]   Breast cancer diagnosis using GA feature selection and Rotation Forest [J].
Alickovic, Emina ;
Subasi, Abdulhamit .
NEURAL COMPUTING & APPLICATIONS, 2017, 28 (04) :753-763
[7]   Coronary artery disease detection using computational intelligence methods [J].
Alizadehsani, Roohallah ;
Zangooei, Mohammad Hossein ;
Hosseini, Mohammad Javad ;
Habibi, Jafar ;
Khosravi, Abbas ;
Roshanzamir, Mohamad ;
Khozeimeh, Fahime ;
Sarrafzadegan, Nizal ;
Nahavandi, Saeid .
KNOWLEDGE-BASED SYSTEMS, 2016, 109 :187-197
[8]   Identification of significant features and data mining techniques in predicting heart disease [J].
Amin, Mohammad Shafenoor ;
Chiam, Yin Kia ;
Varathan, Kasturi Dewi .
TELEMATICS AND INFORMATICS, 2019, 36 :82-93
[9]   Indications of nonlinear deterministic and finite-dimensional structures in time series of brain electrical activity: Dependence on recording region and brain state [J].
Andrzejak, RG ;
Lehnertz, K ;
Mormann, F ;
Rieke, C ;
David, P ;
Elger, CE .
PHYSICAL REVIEW E, 2001, 64 (06) :8-061907
[10]  
[Anonymous], 2007, Proc. Adv. Neural Inf. Process. Syst.