Where are the large and difficult datasets?

被引:0
作者
Adrien Jamain
David J. Hand
机构
[1] BNP-Paribas,Department of Mathematics
[2] Institute for Mathematical Sciences,undefined
来源
Advances in Data Analysis and Classification | 2009年 / 3卷
关键词
Error rate; Meta-analysis; Comparative studies; Repositories; 6207; 68T10;
D O I
暂无
中图分类号
学科分类号
摘要
A great many comparative performance assessments of classification rules have been undertaken, ranging from small ones involving just one or two methods, to large ones involving many tens of methods. We are undertaking a meta-analytic study of these studies, attempting to distil some overall conclusions. This paper describes just one of our observations. The dataset analysed in this paper contains 5,203 error rates taken from 45 articles and describing 146 datasets. One curious general relationship which was persistent in our data, despite the fact that we were looking at results mixed between distributions rather than conditional on distributions, was that error rate decreased with increasing dataset size. We believe this to be an artefact of the way datasets are collected by the research community.
引用
收藏
页码:25 / 38
页数:13
相关论文
共 50 条
  • [31] VALIDATE: A deep dive into vulnerability prediction datasets
    Esposito, Matteo
    Falessi, Davide
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2024, 170
  • [32] Findings from GitHub: Methods, Datasets and Limitations
    Cosentino, Valerio
    Luis, Javier
    Izquierdo, Canovas
    Cabot, Jordi
    [J]. 13TH WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2016), 2016, : 137 - 141
  • [33] A Commentary on Cognitive Behavior Therapy: Where We Have Been, Where We Are, and Where We Need to Go From Here
    Ollendick, Thomas H.
    [J]. COGNITIVE AND BEHAVIORAL PRACTICE, 2016, 23 (04) : 436 - 440
  • [34] Homeopathy: Where is the bias?
    Bellavite, Paolo
    Fisher, Peter
    [J]. EUROPEAN JOURNAL OF INTERNAL MEDICINE, 2014, 25 (05) : E66 - E66
  • [35] Where is the "Corruption?" Response
    Egilman, David
    [J]. AMERICAN JOURNAL OF INDUSTRIAL MEDICINE, 2017, 60 (10) : 915 - 920
  • [36] Negative Symptoms in Schizophrenia: Where We have been and Where We are Heading
    Azorin, Jean-Michel
    Belzeaux, Raoul
    Adida, Marc
    [J]. CNS NEUROSCIENCE & THERAPEUTICS, 2014, 20 (09) : 801 - 808
  • [37] Where is the "where" in the brain? A meta-analysis of neuroimaging studies on spatial cognition
    Cona, Giorgia
    Scarpazza, Cristina
    [J]. HUMAN BRAIN MAPPING, 2019, 40 (06) : 1867 - 1886
  • [38] Sleep, circadian rhythms, and schizophrenia: where we are and where we need to go
    Cosgrave, Jan
    Wulff, Katharina
    Gehrman, Philip
    [J]. CURRENT OPINION IN PSYCHIATRY, 2018, 31 (03) : 176 - 182
  • [39] An inflamed subtype of difficult-to-treat depression
    Suneson, Klara
    Grudet, Cecile
    Ventorp, Filip
    Malm, Johan
    Asp, Marie
    Westrin, Asa
    Lindqvist, Daniel
    [J]. PROGRESS IN NEURO-PSYCHOPHARMACOLOGY & BIOLOGICAL PSYCHIATRY, 2023, 125
  • [40] Predictors of Difficult Intubation with the Bonfils Rigid Fiberscope
    Nowakowski, Michal
    Williams, Stephan
    Gallant, Jason
    Ruel, Monique
    Robitaille, Arnaud
    [J]. ANESTHESIA AND ANALGESIA, 2016, 122 (06) : 1901 - 1906