A New Kind of Nonparametric Test for Statistical Comparison of Multiple Classifiers Over Multiple Datasets

被引:38
作者
Yu, Zhiwen [1 ,2 ]
Wang, Zhiqiang [1 ]
You, Jane
Zhang, Jun [1 ]
Liu, Jiming [3 ]
Wong, Hau-San [4 ]
Han, Guoqiang [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Guangdong, Peoples R China
[2] Hong Kong Baptist Univ, Hong Kong, Hong Kong, Peoples R China
[3] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
[4] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
关键词
Classification; classifier ensemble; Friedman test (FT); nonparametric test; statistical test; CLUSTER ENSEMBLE FRAMEWORK; SELECTION; INTELLIGENCE; EVOLUTIONARY; COMBINATION; PERFORMANCE; ALGORITHMS; EXPRESSION; PREDICTION; FEATURES;
D O I
10.1109/TCYB.2016.2611020
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nonparametric statistical analysis, such as the Friedman test (FT), is gaining more and more attention due to its useful applications in a lot of experimental studies. However, traditional FT for the comparison of multiple learning algorithms on different datasets adopts the naive ranking approach. The ranking is based on the average accuracy values obtained by the set of learning algorithms on the datasets, which neither considers the differences of the results obtained by the learning algorithms on each dataset nor takes into account the performance of the learning algorithms in each run. In this paper, we will first propose three kinds of ranking approaches, which are the weighted ranking approach, the global ranking approach (GRA), and the weighted GRA. Then, a theoretical analysis is performed to explore the properties of the proposed ranking approaches. Next, a set of the modified FTs based on the proposed ranking approaches are designed for the comparison of the learning algorithms. Finally, the modified FTs are evaluated through six classifier ensemble approaches on 34 real-world datasets. The experiments show the effectiveness of the modified FTs.
引用
收藏
页码:4418 / 4431
页数:14
相关论文
共 50 条
  • [41] Automatic lesion detection at Multiple Sclerosis patients - Comparison of 2D-and 3D-FLAIR-datasets
    Seehafer, Svea
    Schmill, Lars-Patrick
    Aludin, Schekeb
    Huhndorf, Monika
    Larsen, Naomi
    Jansen, Olav
    Stuerner, Klarissa
    Peters, Soenke
    MULTIPLE SCLEROSIS AND RELATED DISORDERS, 2024, 88
  • [42] PaintOmics 4: new tools for the integrative analysis of multi-omics datasets supported by multiple pathway databases
    Liu, Tianyuan
    Salguero, Pedro
    Petek, Marko
    Martinez-Mira, Carlos
    Balzano-Nogueira, Leandro
    Ramsak, Ziva
    McIntyre, Lauren
    Gruden, Kristina
    Tarazona, Sonia
    Conesa, Ana
    NUCLEIC ACIDS RESEARCH, 2022, 50 (W1) : W551 - W559
  • [43] Statistical Test of Detrended Multiple Moving Average Cross-Correlation Analysis and Its Application in Financial Market
    Cao, Guangxi
    Xie, Wenhao
    FLUCTUATION AND NOISE LETTERS, 2023, 22 (03):
  • [44] Comparison of Centor and Mclsaac scores in primary care: a meta-analysis over multiple thresholds
    Willis, Brian H.
    Coomar, Dyuti
    Baragilly, Mohammed
    BRITISH JOURNAL OF GENERAL PRACTICE, 2020, 70 (693) : E245 - E254
  • [45] Causes of gender DIF on an EFL language test: A multiple-data analysis over nine years
    Pae, Tae-Il
    LANGUAGE TESTING, 2012, 29 (04) : 533 - 554
  • [46] Fitts' Tapping Task as a New Test for Cognition and Manual Dexterity in Multiple Sclerosis: Validation Study
    Glavor, Klaudia Duka
    Weinstock-Guttman, Bianca
    Vuletic, Gorka
    Vranic Ivanac, Iva
    Simic, Natasa
    Covey, Thomas J. J.
    Jakimovski, Dejan
    MEDICINA-LITHUANIA, 2023, 59 (01):
  • [47] Comparison of multiple statistical models for the development of clinical prediction scores to detect advanced colorectal neoplasms in asymptomatic Thai patients
    Soonklang, Kamonwan
    Siribumrungwong, Boonying
    Siripongpreeda, Bunchorn
    Auewarakul, Chirayu
    MEDICINE, 2021, 100 (20) : E26065
  • [48] The Multiple Food Test: Development and validation of a new tool to measure food choice and applied nutrition knowledge
    Schreiber, Mike
    Bucher, Tamara
    Collins, Clare E.
    Dohle, Simone
    APPETITE, 2020, 150
  • [49] Protein aggregation, structural disorder and RNA-binding ability: a new approach for physico-chemical and gene ontology classification of multiple datasets
    Klus, Petr
    Delli Ponti, Riccardo
    Maria Livi, Carmen
    Gaetano Tartaglia, Gian
    BMC GENOMICS, 2015, 16
  • [50] Comparison of Multiple Linear Regressions and Neural Networks based QSAR models for the design of new antitubercular compounds
    Ventura, Cristina
    Latino, Diogo A. R. S.
    Martins, Filomena
    EUROPEAN JOURNAL OF MEDICINAL CHEMISTRY, 2013, 70 : 831 - 845