Accuracy on the Line: On the Strong Correlation Between Out-of-Distribution and In-Distribution Generalization

被引:0
|
作者
Miller, John [1 ]
Taori, Rohan [2 ]
Raghunathan, Aditi [2 ]
Sagawa, Shiori [2 ]
Koh, Pang Wei [2 ]
Shankar, Vaishaal [1 ]
Liang, Percy [2 ]
Carmon, Yair [3 ]
Schmidt, Ludwig [4 ]
机构
[1] Univ Calif Berkeley, Dept Comp Sci, Berkeley, CA 94720 USA
[2] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[3] Tel Aviv Univ, Sch Comp Sci, Tel Aviv, Israel
[4] Toyota Res Inst, Cambridge, MA USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For machine learning systems to be reliable, we must understand their performance in unseen, out-of-distribution environments. In this paper, we empirically show that out-of-distribution performance is strongly correlated with in-distribution performance for a wide range of models and distribution shifts. Specifically, we demonstrate strong correlations between in-distribution and out-of-distribution performance on variants of CIFAR-10 & ImageNet, a synthetic pose estimation task derived from YCB objects, FMoW-WILDS satellite imagery classification, and wildlife classification in iWildCam-WILDS. The correlation holds across model architectures, hyperparameters, training set size, and training duration, and is more precise than what is expected from existing domain adaptation theory. To complete the picture, we also investigate cases where the correlation is weaker, for instance some synthetic distribution shifts from CIFAR-10-C and the tissue classification dataset Camelyon17-WILDS. Finally, we provide a candidate theory based on a Gaussian data model that shows how changes in the data covariance arising from distribution shift can affect the observed correlations.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Discovering causally invariant features for out-of-distribution generalization
    Wang, Yujie
    Yu, Kui
    Xiang, Guodu
    Cao, Fuyuan
    Liang, Jiye
    PATTERN RECOGNITION, 2024, 150
  • [32] Graph Out-of-Distribution Generalization With Controllable Data Augmentation
    Lu, Bin
    Zhao, Ze
    Gan, Xiaoying
    Liang, Shiyu
    Fu, Luoyi
    Wang, Xinbing
    Zhou, Chenghu
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (11) : 6317 - 6329
  • [33] Probing out-of-distribution generalization in machine learning for materials
    Li, Kangming
    Rubungo, Andre Niyongabo
    Lei, Xiangyun
    Persaud, Daniel
    Choudhary, Kamal
    Decost, Brian
    Dieng, Adji Bousso
    Hattrick-Simpers, Jason
    COMMUNICATIONS MATERIALS, 2025, 6 (01)
  • [34] Tackling Domain Generalization for Out-of-Distribution Endoscopic Imaging
    Ali Teevno, Mansoor
    Ochoa-Ruiz, Gilberto
    Ali, Sharib
    MACHINE LEARNING IN MEDICAL IMAGING, PT II, MLMI 2024, 2025, 15242 : 43 - 52
  • [35] RetroOOD: Understanding Out-of-Distribution Generalization in Retrosynthesis Prediction
    Yu, Yemin
    Yuan, Luotian
    Wei, Ying
    Gao, Hanyu
    Wu, Fei
    Wang, Zhihua
    Ye, Xinhai
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 374 - 382
  • [36] Deep Relevant Feature Focusing for Out-of-Distribution Generalization
    Wang, Fawu
    Zhang, Kang
    Liu, Zhengyu
    Yuan, Xia
    Zhao, Chunxia
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022, 2022, 13534 : 245 - 253
  • [37] Can Subnetwork Structure be the Key to Out-of-Distribution Generalization?
    Zhang, Dinghuai
    Ahuja, Kartik
    Xu, Yilun
    Wang, Yisen
    Courville, Aaron
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [38] Understanding and Improving Feature Learning for Out-of-Distribution Generalization
    Chen, Yongqiang
    Huang, Wei
    Zhou, Kaiwen
    Bian, Yatao
    Han, Bo
    Cheng, James
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [39] Face Reconstruction Transfer Attack as Out-of-Distribution Generalization
    June, Yoon Gyo
    Park, Jaewoo
    Dong, Xingbo
    Park, Hojin
    Teoh, Andrew Beng Jin
    Camps, Octavia
    COMPUTER VISION - ECCV 2024, PT LXXV, 2025, 15133 : 396 - 413
  • [40] Learning Invariant Graph Representations for Out-of-Distribution Generalization
    Li, Haoyang
    Zhang, Ziwei
    Wang, Xin
    Zhu, Wenwu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,