Accuracy on the Line: On the Strong Correlation Between Out-of-Distribution and In-Distribution Generalization

被引:0
|
作者
Miller, John [1 ]
Taori, Rohan [2 ]
Raghunathan, Aditi [2 ]
Sagawa, Shiori [2 ]
Koh, Pang Wei [2 ]
Shankar, Vaishaal [1 ]
Liang, Percy [2 ]
Carmon, Yair [3 ]
Schmidt, Ludwig [4 ]
机构
[1] Univ Calif Berkeley, Dept Comp Sci, Berkeley, CA 94720 USA
[2] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[3] Tel Aviv Univ, Sch Comp Sci, Tel Aviv, Israel
[4] Toyota Res Inst, Cambridge, MA USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For machine learning systems to be reliable, we must understand their performance in unseen, out-of-distribution environments. In this paper, we empirically show that out-of-distribution performance is strongly correlated with in-distribution performance for a wide range of models and distribution shifts. Specifically, we demonstrate strong correlations between in-distribution and out-of-distribution performance on variants of CIFAR-10 & ImageNet, a synthetic pose estimation task derived from YCB objects, FMoW-WILDS satellite imagery classification, and wildlife classification in iWildCam-WILDS. The correlation holds across model architectures, hyperparameters, training set size, and training duration, and is more precise than what is expected from existing domain adaptation theory. To complete the picture, we also investigate cases where the correlation is weaker, for instance some synthetic distribution shifts from CIFAR-10-C and the tissue classification dataset Camelyon17-WILDS. Finally, we provide a candidate theory based on a Gaussian data model that shows how changes in the data covariance arising from distribution shift can affect the observed correlations.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Supervision Adaptation Balancing In-Distribution Generalization and Out-of-Distribution Detection
    Zhao, Zhilin
    Cao, Longbing
    Lin, Kun-Yu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 15743 - 15758
  • [2] Towards In-Distribution Compatible Out-of-Distribution Detection
    Wu, Boxi
    Jiang, Jie
    Ren, Haidong
    Du, Zifan
    Wang, Wenxiao
    Li, Zhifeng
    Cai, Deng
    He, Xiaofei
    Lin, Binbin
    Liu, Wei
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 10333 - 10341
  • [3] iDECODe: In-Distribution Equivariance for Conformal Out-of-Distribution Detection
    Kaur, Ramneet
    Jha, Susmit
    Roy, Anirban
    Park, Sangdon
    Dobriban, Edgar
    Sokolsky, Oleg
    Lee, Insup
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7104 - 7114
  • [4] Key Feature Replacement of In-Distribution Samples for Out-of-Distribution Detection
    Kim, Jaeyoung
    Kong, Seo Taek
    Na, Dongbin
    Jung, Kyu-Hwan
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 7, 2023, : 8246 - 8254
  • [5] Certifiable Out-of-Distribution Generalization
    Ye, Nanyang
    Zhu, Lin
    Wang, Jia
    Zeng, Zhaoyu
    Shao, Jiayao
    Peng, Chensheng
    Pan, Bikang
    Li, Kaican
    Zhu, Jun
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 10927 - 10935
  • [6] Out-of-Distribution Detection by Cross-Class Vicinity Distribution of In-Distribution Data
    Zhao, Zhilin
    Cao, Longbing
    Lin, Kun-Yu
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (10) : 13777 - 13788
  • [7] STEP : Out-of-Distribution Detection in the Presence of Limited In-distribution Labeled Data
    Zhou, Zhi
    Guo, Lan-Zhe
    Cheng, Zhanzhan
    Li, Yu-Feng
    Pu, Shiliang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [8] Out-of-Distribution Generalization in Kernel Regression
    Canatar, Abdulkadir
    Bordelon, Blake
    Pehlevan, Cengiz
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [9] Causal softmax for out-of-distribution generalization
    Luo, Jing
    Zhao, Wanqing
    Peng, Jinye
    DIGITAL SIGNAL PROCESSING, 2025, 156
  • [10] In-Distribution and Out-of-Distribution Self-Supervised ECG Representation Learning for Arrhythmia Detection
    Soltanieh, Sahar
    Hashemi, Javad
    Etemad, Ali
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (02) : 789 - 800