A New Kind of Nonparametric Test for Statistical Comparison of Multiple Classifiers Over Multiple Datasets

被引:38
|
作者
Yu, Zhiwen [1 ,2 ]
Wang, Zhiqiang [1 ]
You, Jane
Zhang, Jun [1 ]
Liu, Jiming [3 ]
Wong, Hau-San [4 ]
Han, Guoqiang [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Guangdong, Peoples R China
[2] Hong Kong Baptist Univ, Hong Kong, Hong Kong, Peoples R China
[3] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
[4] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
关键词
Classification; classifier ensemble; Friedman test (FT); nonparametric test; statistical test; CLUSTER ENSEMBLE FRAMEWORK; SELECTION; INTELLIGENCE; EVOLUTIONARY; COMBINATION; PERFORMANCE; ALGORITHMS; EXPRESSION; PREDICTION; FEATURES;
D O I
10.1109/TCYB.2016.2611020
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nonparametric statistical analysis, such as the Friedman test (FT), is gaining more and more attention due to its useful applications in a lot of experimental studies. However, traditional FT for the comparison of multiple learning algorithms on different datasets adopts the naive ranking approach. The ranking is based on the average accuracy values obtained by the set of learning algorithms on the datasets, which neither considers the differences of the results obtained by the learning algorithms on each dataset nor takes into account the performance of the learning algorithms in each run. In this paper, we will first propose three kinds of ranking approaches, which are the weighted ranking approach, the global ranking approach (GRA), and the weighted GRA. Then, a theoretical analysis is performed to explore the properties of the proposed ranking approaches. Next, a set of the modified FTs based on the proposed ranking approaches are designed for the comparison of the learning algorithms. Finally, the modified FTs are evaluated through six classifier ensemble approaches on 34 real-world datasets. The experiments show the effectiveness of the modified FTs.
引用
收藏
页码:4418 / 4431
页数:14
相关论文
共 50 条
  • [1] Significance of non-parametric statistical tests for comparison of classifiers over multiple datasets
    Singh, Pawan Kumar
    Sarkar, Ram
    Nasipuri, Mita
    INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2016, 7 (05) : 410 - 442
  • [2] Statistical comparisons of active learning strategies over multiple datasets
    Reyes, Oscar
    Altalhi, Abdulrahman H.
    Ventura, Sebastian
    KNOWLEDGE-BASED SYSTEMS, 2018, 145 : 274 - 288
  • [3] A hybrid genetic based functional link artificial neural network with a statistical comparison of classifiers over multiple datasets
    Dehuri, Satchidananda
    Cho, Sung-Bae
    NEURAL COMPUTING & APPLICATIONS, 2010, 19 (02) : 317 - 328
  • [4] A hybrid genetic based functional link artificial neural network with a statistical comparison of classifiers over multiple datasets
    Satchidananda Dehuri
    Sung-Bae Cho
    Neural Computing and Applications, 2010, 19 : 317 - 328
  • [5] NONPARAMETRIC STATISTICAL ANALYSIS FOR MULTIPLE COMPARISON OF MACHINE LEARNING REGRESSION ALGORITHMS
    Trawinski, Bogdan
    Smetek, Magdalena
    Telec, Zbigniew
    Lasota, Tadeusz
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2012, 22 (04) : 867 - 881
  • [6] Comparison of multiple datasets with gridded precipitation observations over the Tibetan Plateau
    You, Qinglong
    Min, Jinzhong
    Zhang, Wei
    Pepin, Nick
    Kang, Shichang
    CLIMATE DYNAMICS, 2015, 45 (3-4) : 791 - 806
  • [7] An Extension on "Statistical Comparisons of Classifiers over Multiple Data Sets" for all Pairwise Comparisons
    Garcia, Salvador
    Herrera, Francisco
    JOURNAL OF MACHINE LEARNING RESEARCH, 2008, 9 : 2677 - 2694
  • [8] Nonparametric predictive inference for comparison of multiple diagnostic tests
    Alabdulhadi, Manal H.
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2025,
  • [9] A nonparametric hypothesis test for heteroscedasticity in multiple regression
    Zambom, Adriano Z.
    Kim, Seonjin
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2017, 45 (04): : 425 - 441
  • [10] Bayesian model averaging of Bayesian network classifiers over multiple node-orders: Application to sparse datasets
    Hwang, KB
    Zhang, BT
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2005, 35 (06): : 1302 - 1310