A New Kind of Nonparametric Test for Statistical Comparison of Multiple Classifiers Over Multiple Datasets

被引：38

作者：

Yu, Zhiwen ^{[1
,2
]}

Wang, Zhiqiang ^{[1
]}

You, Jane

Zhang, Jun ^{[1
]}

Liu, Jiming ^{[3
]}

Wong, Hau-San ^{[4
]}

Han, Guoqiang ^{[1
]}

机构：

[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Guangdong, Peoples R China

[2] Hong Kong Baptist Univ, Hong Kong, Hong Kong, Peoples R China

[3] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China

[4] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2017年 / 47卷 / 12期

关键词：

Classification; classifier ensemble; Friedman test (FT); nonparametric test; statistical test; CLUSTER ENSEMBLE FRAMEWORK; SELECTION; INTELLIGENCE; EVOLUTIONARY; COMBINATION; PERFORMANCE; ALGORITHMS; EXPRESSION; PREDICTION; FEATURES;

D O I：

10.1109/TCYB.2016.2611020

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Nonparametric statistical analysis, such as the Friedman test (FT), is gaining more and more attention due to its useful applications in a lot of experimental studies. However, traditional FT for the comparison of multiple learning algorithms on different datasets adopts the naive ranking approach. The ranking is based on the average accuracy values obtained by the set of learning algorithms on the datasets, which neither considers the differences of the results obtained by the learning algorithms on each dataset nor takes into account the performance of the learning algorithms in each run. In this paper, we will first propose three kinds of ranking approaches, which are the weighted ranking approach, the global ranking approach (GRA), and the weighted GRA. Then, a theoretical analysis is performed to explore the properties of the proposed ranking approaches. Next, a set of the modified FTs based on the proposed ranking approaches are designed for the comparison of the learning algorithms. Finally, the modified FTs are evaluated through six classifier ensemble approaches on 34 real-world datasets. The experiments show the effectiveness of the modified FTs.

引用

页码：4418 / 4431

页数：14

共 50 条

[1] Significance of non-parametric statistical tests for comparison of classifiers over multiple datasets
Singh, Pawan Kumar
Sarkar, Ram
Nasipuri, Mita
INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2016, 7 (05) : 410 - 442
[2] Statistical comparisons of active learning strategies over multiple datasets
Reyes, Oscar
Altalhi, Abdulrahman H.
Ventura, Sebastian
KNOWLEDGE-BASED SYSTEMS, 2018, 145 : 274 - 288
[3] A hybrid genetic based functional link artificial neural network with a statistical comparison of classifiers over multiple datasets
Dehuri, Satchidananda
Cho, Sung-Bae
NEURAL COMPUTING & APPLICATIONS, 2010, 19 (02) : 317 - 328
[4] A hybrid genetic based functional link artificial neural network with a statistical comparison of classifiers over multiple datasets
Satchidananda Dehuri
Sung-Bae Cho
Neural Computing and Applications, 2010, 19 : 317 - 328
[5] NONPARAMETRIC STATISTICAL ANALYSIS FOR MULTIPLE COMPARISON OF MACHINE LEARNING REGRESSION ALGORITHMS
Trawinski, Bogdan
Smetek, Magdalena
Telec, Zbigniew
Lasota, Tadeusz
INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2012, 22 (04) : 867 - 881
[6] Comparison of multiple datasets with gridded precipitation observations over the Tibetan Plateau
You, Qinglong
Min, Jinzhong
Zhang, Wei
Pepin, Nick
Kang, Shichang
CLIMATE DYNAMICS, 2015, 45 (3-4) : 791 - 806
[7] An Extension on "Statistical Comparisons of Classifiers over Multiple Data Sets" for all Pairwise Comparisons
Garcia, Salvador
Herrera, Francisco
JOURNAL OF MACHINE LEARNING RESEARCH, 2008, 9 : 2677 - 2694
[8] Nonparametric predictive inference for comparison of multiple diagnostic tests
Alabdulhadi, Manal H.
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2025,
[9] A nonparametric hypothesis test for heteroscedasticity in multiple regression
Zambom, Adriano Z.
Kim, Seonjin
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2017, 45 (04): : 425 - 441
[10] Bayesian model averaging of Bayesian network classifiers over multiple node-orders: Application to sparse datasets
Hwang, KB
Zhang, BT
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2005, 35 (06): : 1302 - 1310

← 1 2 3 4 5 →