Fair Comparison: Quantifying Variance in Results for Fine-grained Visual Categorization

被引：11

作者：

Gwilliam, Matthew ^{[1
,2
]}

Teuscher, Adam ^{[1
]}

Anderson, Connor ^{[1
]}

Farrell, Ryan ^{[1
]}

机构：

[1] Brigham Young Univ, Provo, UT 84602 USA

[2] Univ Maryland, College Pk, MD 20742 USA

来源：

2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021 | 2021年

基金：

美国国家科学基金会;

关键词：

D O I：

10.1109/WACV48630.2021.00335

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

For the task of image classification, researchers work arduously to develop the next state-of-the-art (SOTA) model, each bench-marking their own performance against that of their predecessors and of their peers. Unfortunately, the metric used most frequently to describe a model's performance, average categorization accuracy, is often used in isolation. As the number of classes increases, such as in fine-grained visual categorization (FGVC), the amount of information conveyed by average accuracy alone dwindles. While its most glaring weakness is its failure to describe the model's performance on a class-by-class basis, average accuracy also fails to describe how performance may vary from one trained model of the same architecture, on the same dataset, to another (both averaged across all categories and at the per-class level). We first demonstrate the magnitude of these variations across models and across class distributions based on attributes of the data, comparing results on different visual domains and different per-class image distributions, including long-tailed distributions and few-shot subsets. We then analyze the impact various FGVC methods have on overall and per-class variance. From this analysis, we both highlight the importance of reporting and comparing methods based on information beyond overall accuracy, as well as point out techniques that mitigate variance in FGVC results.

引用

页码：3308 / 3317

页数：10

共 48 条

[1]

[Anonymous], 2018, IEEE INT SYMP CIRC S, DOI DOI 10.1109/ISCAS.2018.8351436

[2]

[Anonymous], 2014, BMVC

[3]

Atlas ML., 2019, PAPERS CODE WEBSITE

[4]

Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473

[5] Destruction and Construction Learning for Fine-grained Image Recognition [J].

Chen, Yue ;

Bai, Yalong ;

Zhang, Wei ;

Mei, Tao .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5152-5161

[6] Kernel Pooling for Convolutional Neural Networks [J].

Cui, Yin ;

Zhou, Feng ;

Wang, Jiang ;

Liu, Xiao ;

Lin, Yuanqing ;

Belongie, Serge .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3049-3058

[7] Selective Sparse Sampling for Fine-grained Image Recognition [J].

Ding, Yao ;

Zhou, Yanzhao ;

Zhu, Yi ;

Ye, Qixiang ;

Jiao, Jianbin .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6598-6607

[8]

Dua D., 2017, UCI Machine Learning Repository

[9]

Dubey A., 2018, P ADV NEUR INF PROC, P635

[10]

Duchi J, 2011, J MACH LEARN RES, V12, P2121

← 1 2 3 4 5 →