Adaptive Testing of Computer Vision Models

被引:3
|
作者
Gao, Irena [1 ,3 ]
Ilharco, Gabriel [2 ]
Lundberg, Scott [3 ]
Ribeiro, Marco Tulio [3 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Univ Washington, Seattle, WA 98195 USA
[3] Microsoft Res, Redmond, WA USA
来源
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV | 2023年
关键词
D O I
10.1109/ICCV51070.2023.00370
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vision models often fail systematically on groups of data that share common semantic characteristics (e.g., rare objects or unusual scenes), but identifying these failure modes is a challenge. We introduce AdaVision, an interactive process for testing vision models which helps users identify and fix coherent failure modes. Given a natural language description of a coherent group, AdaVision retrieves relevant images from LAION-5B with CLIP. The user then labels a small amount of data for model correctness, which is used in successive retrieval rounds to hill-climb towards high-error regions, refining the group definition. Once a group is saturated, AdaVision uses GPT-3 to suggest new group descriptions for the user to explore. We demonstrate the usefulness and generality of AdaVision in user studies, where users find major bugs in state-of-the-art classification, object detection, and image captioning models. These user-discovered groups have failure rates 2-3x higher than those surfaced by automatic error clustering methods. Finally, finetuning on examples found with AdaVision fixes the discovered bugs when evaluated on unseen examples, without degrading in-distribution accuracy, and while also improving performance on out-of-distribution datasets.
引用
收藏
页码:3980 / 3991
页数:12
相关论文
共 50 条
  • [41] ADAPTIVE GROUP TESTING WITH MISMATCHED MODELS
    Fan, Mingzhou
    Yoon, Byung-Jun
    Alexander, Francis J.
    Dougherty, Edward R.
    Qian, Xiaoning
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4533 - 4537
  • [42] Adaptive testing for hierarchical student models
    Eduardo Guzmán
    Ricardo Conejo
    José-Luis Pérez-de-la-Cruz
    User Modeling and User-Adapted Interaction, 2007, 17 : 119 - 157
  • [43] Adaptive testing for hierarchical student models
    Guzman, Eduardo
    Conejo, Ricardo
    Perez-de-la-Cruz, Jose-Luis
    USER MODELING AND USER-ADAPTED INTERACTION, 2007, 17 (1-2) : 119 - 157
  • [44] Graphical models and computerized adaptive testing
    Almond, RG
    Mislevy, RJ
    APPLIED PSYCHOLOGICAL MEASUREMENT, 1999, 23 (03) : 223 - 237
  • [45] Adaptive Testing and Debugging of NLP Models
    Ribeiro, Marco Tulio
    Lundberg, Scott
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 3253 - 3267
  • [46] SWCAT 1.0:: A SAS computer program for simulating computer adaptive testing
    Raîche, G
    Blais, JG
    APPLIED PSYCHOLOGICAL MEASUREMENT, 2006, 30 (01) : 60 - 61
  • [47] Evaluating the Fairness of Discriminative Foundation Models in Computer Vision
    Ali, Junaid
    Kleindessner, Matthaus
    Wenzel, Florian
    Budhathoki, Kailash
    Cevher, Volkan
    Russell, Chris
    PROCEEDINGS OF THE 2023 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2023, 2023, : 809 - 833
  • [48] MGMM: Multiresolution Gaussian mixture models for computer vision
    Wilson, R
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 212 - 215
  • [49] Generative and probability models in image processing and computer vision
    Potapov, A. S.
    JOURNAL OF OPTICAL TECHNOLOGY, 2015, 82 (08) : 495 - 498
  • [50] Computer Vision Models for Image Analysis in Advertising Research
    Li, Hairong
    Zhang, Nan
    JOURNAL OF ADVERTISING, 2024, 53 (05) : 771 - 790