Adaptive Testing of Computer Vision Models

被引:3
|
作者
Gao, Irena [1 ,3 ]
Ilharco, Gabriel [2 ]
Lundberg, Scott [3 ]
Ribeiro, Marco Tulio [3 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Univ Washington, Seattle, WA 98195 USA
[3] Microsoft Res, Redmond, WA USA
来源
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV | 2023年
关键词
D O I
10.1109/ICCV51070.2023.00370
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vision models often fail systematically on groups of data that share common semantic characteristics (e.g., rare objects or unusual scenes), but identifying these failure modes is a challenge. We introduce AdaVision, an interactive process for testing vision models which helps users identify and fix coherent failure modes. Given a natural language description of a coherent group, AdaVision retrieves relevant images from LAION-5B with CLIP. The user then labels a small amount of data for model correctness, which is used in successive retrieval rounds to hill-climb towards high-error regions, refining the group definition. Once a group is saturated, AdaVision uses GPT-3 to suggest new group descriptions for the user to explore. We demonstrate the usefulness and generality of AdaVision in user studies, where users find major bugs in state-of-the-art classification, object detection, and image captioning models. These user-discovered groups have failure rates 2-3x higher than those surfaced by automatic error clustering methods. Finally, finetuning on examples found with AdaVision fixes the discovered bugs when evaluated on unseen examples, without degrading in-distribution accuracy, and while also improving performance on out-of-distribution datasets.
引用
收藏
页码:3980 / 3991
页数:12
相关论文
共 50 条
  • [1] ADAPTIVE TESTING BY COMPUTER
    WEISS, DJ
    JOURNAL OF CONSULTING AND CLINICAL PSYCHOLOGY, 1985, 53 (06) : 774 - 789
  • [2] Probabilistic models in computer vision
    Bowden, R
    IMAGE AND VISION COMPUTING, 2003, 21 (10) : 841 - 841
  • [3] COMPUTER MODELS OF VISION AND SPEECH
    REDDY, R
    DATAMATION, 1968, 14 (11): : 109 - &
  • [4] An adaptive parallel computer vision system
    Kim, JM
    Kim, Y
    Kim, SD
    Han, TD
    Yang, SB
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1998, 12 (03) : 311 - 334
  • [5] GUI Testing Using Computer Vision
    Chang, Tsung-Hsiang
    Yeh, Tom
    Miller, Robert C.
    CHI2010: PROCEEDINGS OF THE 28TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, VOLS 1-4, 2010, : 1535 - +
  • [6] Testing dynamical models of vision
    Rueter, Johannes
    Francis, Gregory
    Frehe, Patricia
    Herzog, Michael H.
    VISION RESEARCH, 2011, 51 (03) : 343 - 351
  • [7] Computer-adaptive Testing
    Bengel, Juergen
    REHABILITATION, 2014, 53 (05) : 289 - 289
  • [8] A framework for the automation of testing computer vision systems
    Wotawa, Franz
    Klampfl, Lorenz
    Jahaj, Ledio
    2021 IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATION OF SOFTWARE TEST (AST 2021), 2021, : 121 - 124
  • [9] Computer animated childrens pictures for vision testing
    Mueller, D.
    Kandzia, C.
    Roider, J.
    OPHTHALMOLOGE, 2009, 106 (04): : 328 - 333
  • [10] X-ray Testing by Computer Vision
    Mery, Domingo
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, : 360 - 367