Full-Spectrum Out-of-Distribution Detection

被引：14

作者：

Yang, Jingkang ^{[1
]}

Zhou, Kaiyang ^{[1
]}

Liu, Ziwei ^{[1
]}

机构：

[1] Nanyang Technol Univ, S Lab, Singapore, Singapore

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2023年 / 131卷 / 10期

关键词：

Out-of-distribution detection; AI safety; Model trustworthy;

D O I：

10.1007/s11263-023-01811-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Existing out-of-distribution (OOD) detection literature clearly defines semantic shift as a sign of OOD but does not have a consensus over covariate shift. Samples experiencing covariate shift but not semantic shift from the in-distribution (ID) are either excluded from the test set or treated as OOD, which contradicts the primary goal in machine learning-being able to generalize beyond the training distribution. In this paper, we take into account both shift types and introduce full-spectrum OOD (F-OOD) detection, a more realistic problem setting that considers both detecting semantic shift and being tolerant to covariate shift; and design three benchmarks. These new benchmarks have a more fine-grained categorization of distributions (i.elet@tokeneonedot, training ID, covariate-shifted ID, near-OOD, and far-OOD) for the purpose of more comprehensively evaluating the pros and cons of algorithms. To address the F-OOD detection problem, we propose SEM, a simple featurebased semantics score function. SEM is mainly composed of two probability measures: one is based on high-level features containing both semantic and non-semantic information, while the other is based on low-level feature statistics only capturing non-semantic image styles. With a simple combination, the non-semantic part is canceled out, which leaves only semantic information in SEM that can better handle F-OOD detection. Extensive experiments on the three new benchmarks show that SEM significantly outperforms current state-of-the-art methods. Our code and benchmarks are released in https://github.com/ Jingkang50/OpenOOD.

引用

页码：2607 / 2622

页数：16

共 43 条

[1] Representation Learning: A Review and New Perspectives
Bengio, Yoshua
Courville, Aaron
Vincent, Pascal
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) : 1798 - 1828
[2] Bulatov Y., 2011, notMNIST dataset
[3] Choi Hyunsun, 2018, arXiv
[4] Describing Textures in the Wild
Cimpoi, Mircea
Maji, Subhransu
Kokkinos, Iasonas
Mohamed, Sammy
Vedaldi, Andrea
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3606 - 3613
[5] Public Covid-19 X-ray datasets and their impact on model bias-A systematic review of a significant problem
Cruz, Beatriz Garcia Santa
Bossa, Matias Nicolas
Solter, Jan
Husch, Andreas Dominik
[J]. MEDICAL IMAGE ANALYSIS, 2021, 74
[6] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[7] Hendrycks D., 2019, P 7 INT C LEARN REPR
[8] Hendrycks D., 2017, P INT C LEARNING REP, P1
[9] Huang GL, 2017, IEEE ICC
[10] Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization
Huang, Xun
Belongie, Serge
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1510 - 1519

← 1 2 3 4 5 →