Fair Comparison: Quantifying Variance in Results for Fine-grained Visual Categorization

被引：11

作者：

Gwilliam, Matthew ^{[1
,2
]}

Teuscher, Adam ^{[1
]}

Anderson, Connor ^{[1
]}

Farrell, Ryan ^{[1
]}

机构：

[1] Brigham Young Univ, Provo, UT 84602 USA

[2] Univ Maryland, College Pk, MD 20742 USA

来源：

2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021 | 2021年

基金：

美国国家科学基金会;

关键词：

D O I：

10.1109/WACV48630.2021.00335

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

For the task of image classification, researchers work arduously to develop the next state-of-the-art (SOTA) model, each bench-marking their own performance against that of their predecessors and of their peers. Unfortunately, the metric used most frequently to describe a model's performance, average categorization accuracy, is often used in isolation. As the number of classes increases, such as in fine-grained visual categorization (FGVC), the amount of information conveyed by average accuracy alone dwindles. While its most glaring weakness is its failure to describe the model's performance on a class-by-class basis, average accuracy also fails to describe how performance may vary from one trained model of the same architecture, on the same dataset, to another (both averaged across all categories and at the per-class level). We first demonstrate the magnitude of these variations across models and across class distributions based on attributes of the data, comparing results on different visual domains and different per-class image distributions, including long-tailed distributions and few-shot subsets. We then analyze the impact various FGVC methods have on overall and per-class variance. From this analysis, we both highlight the importance of reporting and comparing methods based on information beyond overall accuracy, as well as point out techniques that mitigate variance in FGVC results.

引用

页码：3308 / 3317

页数：10

共 48 条

[21] 3D Object Representations for Fine-Grained Categorization [J].

Krause, Jonathan ;

Stark, Michael ;

Deng, Jia ;

Li Fei-Fei .

2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, :554-561

[22] Is Second-order Information Helpful for Large-scale Visual Recognition? [J].

Li, Peihua ;

Xie, Jiangtao ;

Wang, Qilong ;

Zuo, Wangmeng .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2089-2097

[23] Bilinear CNN Models for Fine-grained Visual Recognition [J].

Lin, Tsung-Yu ;

RoyChowdhury, Aruni ;

Maji, Subhransu .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1449-1457

[24] Cross-X Learning for Fine-Grained Visual Categorization [J].

Luo, Wei ;

Yang, Xitong ;

Mo, Xianjie ;

Lu, Yuheng ;

Davis, Larry S. ;

Li, Jun ;

Yang, Jian ;

Lim, Ser-Nam .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8241-8250

[25] Towards UCI plus : A mindful repository design [J].

Macia, Nuria ;

Bernado-Mansilla, Ester .

INFORMATION SCIENCES, 2014, 261 :237-262

[26]

Maji S., 2013, ARXIV

[27]

Müller R, 2019, ADV NEUR IN, V32

[28]

Niu Xing, 2017, P 2017 C EMPIRICAL, P2814

[29] PMLB: a large benchmark suite for machine learning evaluation and comparison [J].

Olson, Randal S. ;

La Cava, William ;

Orzechowski, Patryk ;

Urbanowicz, Ryan J. ;

Moore, Jason H. .

BIODATA MINING, 2017, 10

[30] BLEU: a method for automatic evaluation of machine translation [J].

Papineni, K ;

Roukos, S ;

Ward, T ;

Zhu, WJ .

40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2002, :311-318

← 1 2 3 4 5 →