ViM: Out-Of-Distribution with Virtual-logit Matching

被引：140

作者：

Wang, Haoqi ^{[1
]}

Li, Zhizhong ^{[1
]}

Feng, Litong ^{[1
]}

Zhang, Wayne ^{[1
,2
]}

机构：

[1] SenseTime Res, Hong Kong, Peoples R China

[2] Shanghai Jiao Tong Univ, Qing Yuan Res Inst, Shanghai, Peoples R China

来源：

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年

关键词：

D O I：

10.1109/CVPR52688.2022.00487

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most of the existing Out-Of-Distribution (OOD) detection algorithms depend on single input source: the feature, the logit, or the softmax probability. However, the immense diversity of the OOD examples makes such methods fragile. There are OOD samples that are easy to identify in the feature space while hard to distinguish in the logit space and vice versa. Motivated by this observation, we propose a novel OOD scoring method named Virtual-logit Matching (ViM), which combines the class-agnostic score from feature space and the In-Distribution (ID) class-dependent logits. Specifically, an additional logit representing the virtual OOD class is generated from the residual of the feature against the principal space, and then matched with the original logits by a constant scaling. The probability of this virtual logit after softmax is the indicator of OOD-ness. To facilitate the evaluation of large-scale OOD detection in academia, we create a new OOD dataset for ImageNet-1K, which is human-annotated and is 8.8x the size of existing datasets. We conducted extensive experiments, including CNNs and vision transformers, to demonstrate the effectiveness of the proposed ViM score. In particular, using the BiT-S model, our method gets an average AUROC 90.91% on four difficult OOD benchmarks, which is 4% ahead of the best baseline. Code and dataset are available at https://github.com/haoqiwang/vim.

引用

页码：4911 / 4920

页数：10

共 40 条

[1] MVTec AD - A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection [J].

Bergmann, Paul ;

Fauser, Michael ;

Sattlegger, David ;

Steger, Carsten .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9584-9592

[2] Describing Textures in the Wild [J].

Cimpoi, Mircea ;

Maji, Subhransu ;

Kokkinos, Iasonas ;

Mohamed, Sammy ;

Vedaldi, Andrea .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3606-3613

[3]

Cook Matthew, 2020, ARXIV200701263

[4]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[5]

DeVries T., 2018, ARXIV180204865

[6]

Dhamija AR, 2018, ADV NEUR IN, V31

[7] RepVGG: Making VGG-style ConvNets Great Again [J].

Ding, Xiaohan ;

Zhang, Xiangyu ;

Ma, Ningning ;

Han, Jungong ;

Ding, Guiguang ;

Sun, Jian .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13728-13737

[8]

Dosovitskiy A, 2020, ARXIV

[9]

Drummond N., 2006, ESI WORKSH CLOS WORL, V15

[10]

Fort S, 2021, ADV NEUR IN, V34

← 1 2 3 4 →