Adaptive Testing of Computer Vision Models

被引：3

作者：

Gao, Irena ^{[1
,3
]}

Ilharco, Gabriel ^{[2
]}

Lundberg, Scott ^{[3
]}

Ribeiro, Marco Tulio ^{[3
]}

机构：

[1] Stanford Univ, Stanford, CA 94305 USA

[2] Univ Washington, Seattle, WA 98195 USA

[3] Microsoft Res, Redmond, WA USA

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV | 2023年

关键词：

D O I：

10.1109/ICCV51070.2023.00370

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Vision models often fail systematically on groups of data that share common semantic characteristics (e.g., rare objects or unusual scenes), but identifying these failure modes is a challenge. We introduce AdaVision, an interactive process for testing vision models which helps users identify and fix coherent failure modes. Given a natural language description of a coherent group, AdaVision retrieves relevant images from LAION-5B with CLIP. The user then labels a small amount of data for model correctness, which is used in successive retrieval rounds to hill-climb towards high-error regions, refining the group definition. Once a group is saturated, AdaVision uses GPT-3 to suggest new group descriptions for the user to explore. We demonstrate the usefulness and generality of AdaVision in user studies, where users find major bugs in state-of-the-art classification, object detection, and image captioning models. These user-discovered groups have failure rates 2-3x higher than those surfaced by automatic error clustering methods. Finally, finetuning on examples found with AdaVision fixes the discovered bugs when evaluated on unseen examples, without degrading in-distribution accuracy, and while also improving performance on out-of-distribution datasets.

引用

页码：3980 / 3991

页数：12

共 50 条

[1] ADAPTIVE TESTING BY COMPUTER
WEISS, DJ
JOURNAL OF CONSULTING AND CLINICAL PSYCHOLOGY, 1985, 53 (06) : 774 - 789
[2] Probabilistic models in computer vision
Bowden, R
IMAGE AND VISION COMPUTING, 2003, 21 (10) : 841 - 841
[3] COMPUTER MODELS OF VISION AND SPEECH
REDDY, R
DATAMATION, 1968, 14 (11): : 109 - &
[4] An adaptive parallel computer vision system
Kim, JM
Kim, Y
Kim, SD
Han, TD
Yang, SB
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1998, 12 (03) : 311 - 334
[5] GUI Testing Using Computer Vision
Chang, Tsung-Hsiang
Yeh, Tom
Miller, Robert C.
CHI2010: PROCEEDINGS OF THE 28TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, VOLS 1-4, 2010, : 1535 - +
[6] Testing dynamical models of vision
Rueter, Johannes
Francis, Gregory
Frehe, Patricia
Herzog, Michael H.
VISION RESEARCH, 2011, 51 (03) : 343 - 351
[7] Computer-adaptive Testing
Bengel, Juergen
REHABILITATION, 2014, 53 (05) : 289 - 289
[8] A framework for the automation of testing computer vision systems
Wotawa, Franz
Klampfl, Lorenz
Jahaj, Ledio
2021 IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATION OF SOFTWARE TEST (AST 2021), 2021, : 121 - 124
[9] Computer animated childrens pictures for vision testing
Mueller, D.
Kandzia, C.
Roider, J.
OPHTHALMOLOGE, 2009, 106 (04): : 328 - 333
[10] X-ray Testing by Computer Vision
Mery, Domingo
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, : 360 - 367

← 1 2 3 4 5 →