A New Kind of Nonparametric Test for Statistical Comparison of Multiple Classifiers Over Multiple Datasets

被引:38
|
作者
Yu, Zhiwen [1 ,2 ]
Wang, Zhiqiang [1 ]
You, Jane
Zhang, Jun [1 ]
Liu, Jiming [3 ]
Wong, Hau-San [4 ]
Han, Guoqiang [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Guangdong, Peoples R China
[2] Hong Kong Baptist Univ, Hong Kong, Hong Kong, Peoples R China
[3] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
[4] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
关键词
Classification; classifier ensemble; Friedman test (FT); nonparametric test; statistical test; CLUSTER ENSEMBLE FRAMEWORK; SELECTION; INTELLIGENCE; EVOLUTIONARY; COMBINATION; PERFORMANCE; ALGORITHMS; EXPRESSION; PREDICTION; FEATURES;
D O I
10.1109/TCYB.2016.2611020
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nonparametric statistical analysis, such as the Friedman test (FT), is gaining more and more attention due to its useful applications in a lot of experimental studies. However, traditional FT for the comparison of multiple learning algorithms on different datasets adopts the naive ranking approach. The ranking is based on the average accuracy values obtained by the set of learning algorithms on the datasets, which neither considers the differences of the results obtained by the learning algorithms on each dataset nor takes into account the performance of the learning algorithms in each run. In this paper, we will first propose three kinds of ranking approaches, which are the weighted ranking approach, the global ranking approach (GRA), and the weighted GRA. Then, a theoretical analysis is performed to explore the properties of the proposed ranking approaches. Next, a set of the modified FTs based on the proposed ranking approaches are designed for the comparison of the learning algorithms. Finally, the modified FTs are evaluated through six classifier ensemble approaches on 34 real-world datasets. The experiments show the effectiveness of the modified FTs.
引用
收藏
页码:4418 / 4431
页数:14
相关论文
共 50 条
  • [21] Permutation Test (PT) and Tolerated Difference Test (TDT): Two new, robust and powerful nonparametric tests for statistical comparison of dissolution profiles
    Gomez-Mantilla, Jose-David
    German Casabo, Vicente
    Schaefer, Ulrich F.
    Lehr, Claus-Michael
    INTERNATIONAL JOURNAL OF PHARMACEUTICS, 2013, 441 (1-2) : 458 - 467
  • [22] New Nonparametric Statistical Test for Problems with Three Samples, which is More Effective than the Whitney Test
    Salov, G. I.
    OPTOELECTRONICS INSTRUMENTATION AND DATA PROCESSING, 2015, 51 (02) : 110 - 119
  • [23] Statistical test for Multiple Detrended Cross-Correlation Coefficient
    da Silva Filho, A. M.
    Zebende, G. F.
    de Castro, A. P. N.
    Guedes, E. F.
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2021, 562
  • [24] Statistical inference for diagnostic test accuracy studies with multiple comparisons
    Westphal, Max
    Zapf, Antonia
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2024, 33 (04) : 669 - 680
  • [25] New term weighting schemes with combination of multiple classifiers for sentiment analysis
    Fattah, Mohamed Abdel
    NEUROCOMPUTING, 2015, 167 : 434 - 442
  • [26] Multiple Statistical Analysis Techniques Corroborate Intratumor Heterogeneity in Imaging Mass Spectrometry Datasets of Myxofibrosarcoma
    Jones, Emrys A.
    van Remoortere, Alexandra
    van Zeijl, Rene J. M.
    Hogendoorn, Pancras C. W.
    Bovee, Judith V. M. G.
    Deelder, Andre M.
    McDonnell, Liam A.
    PLOS ONE, 2011, 6 (09):
  • [27] Aggregating Polytomous DIF Results Over Multiple Test Administrations
    Zwick, Rebecca
    Ye, Lei
    Isham, Steven
    JOURNAL OF EDUCATIONAL MEASUREMENT, 2018, 55 (01) : 132 - 151
  • [28] Mean and extreme precipitation over Aotearoa New Zealand: A comparison across multiple different estimation techniques
    Vishwanathan, Gokul
    McDonald, Adrian
    Stone, Daithi A.
    Rosier, Suzanne
    Rana, Sapna
    Noble, Chris
    INTERNATIONAL JOURNAL OF CLIMATOLOGY, 2023, 43 (07) : 3072 - 3093
  • [29] Global Assessment of Mesoscale Eddies with TOEddies: Comparison Between Multiple Datasets and Colocation with In Situ Measurements
    Ioannou, Artemis
    Guez, Lionel
    Laxenaire, Remi
    Speich, Sabrina
    REMOTE SENSING, 2024, 16 (22)
  • [30] A MODIFIED STATISTICAL TEST BASED ON SUPPORT ESTIMATION FOR MULTIPLE SCATTERERS DETECTION IN SAR TOMOGRAPHY
    Budillon, Alessandra
    Johnsy, Angel Caroline
    Schirinzi, Gilda
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 6676 - 6679