Accuracy on the Line: On the Strong Correlation Between Out-of-Distribution and In-Distribution Generalization

被引：0

作者：

Miller, John ^{[1
]}

Taori, Rohan ^{[2
]}

Raghunathan, Aditi ^{[2
]}

Sagawa, Shiori ^{[2
]}

Koh, Pang Wei ^{[2
]}

Shankar, Vaishaal ^{[1
]}

Liang, Percy ^{[2
]}

Carmon, Yair ^{[3
]}

Schmidt, Ludwig ^{[4
]}

机构：

[1] Univ Calif Berkeley, Dept Comp Sci, Berkeley, CA 94720 USA

[2] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA

[3] Tel Aviv Univ, Sch Comp Sci, Tel Aviv, Israel

[4] Toyota Res Inst, Cambridge, MA USA

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139 | 2021年 / 139卷

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

For machine learning systems to be reliable, we must understand their performance in unseen, out-of-distribution environments. In this paper, we empirically show that out-of-distribution performance is strongly correlated with in-distribution performance for a wide range of models and distribution shifts. Specifically, we demonstrate strong correlations between in-distribution and out-of-distribution performance on variants of CIFAR-10 & ImageNet, a synthetic pose estimation task derived from YCB objects, FMoW-WILDS satellite imagery classification, and wildlife classification in iWildCam-WILDS. The correlation holds across model architectures, hyperparameters, training set size, and training duration, and is more precise than what is expected from existing domain adaptation theory. To complete the picture, we also investigate cases where the correlation is weaker, for instance some synthetic distribution shifts from CIFAR-10-C and the tissue classification dataset Camelyon17-WILDS. Finally, we provide a candidate theory based on a Gaussian data model that shows how changes in the data covariance arising from distribution shift can affect the observed correlations.

引用

页数：15

共 50 条

[31] Discovering causally invariant features for out-of-distribution generalization
Wang, Yujie
Yu, Kui
Xiang, Guodu
Cao, Fuyuan
Liang, Jiye
PATTERN RECOGNITION, 2024, 150
[32] Graph Out-of-Distribution Generalization With Controllable Data Augmentation
Lu, Bin
Zhao, Ze
Gan, Xiaoying
Liang, Shiyu
Fu, Luoyi
Wang, Xinbing
Zhou, Chenghu
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (11) : 6317 - 6329
[33] Probing out-of-distribution generalization in machine learning for materials
Li, Kangming
Rubungo, Andre Niyongabo
Lei, Xiangyun
Persaud, Daniel
Choudhary, Kamal
Decost, Brian
Dieng, Adji Bousso
Hattrick-Simpers, Jason
COMMUNICATIONS MATERIALS, 2025, 6 (01)
[34] Tackling Domain Generalization for Out-of-Distribution Endoscopic Imaging
Ali Teevno, Mansoor
Ochoa-Ruiz, Gilberto
Ali, Sharib
MACHINE LEARNING IN MEDICAL IMAGING, PT II, MLMI 2024, 2025, 15242 : 43 - 52
[35] RetroOOD: Understanding Out-of-Distribution Generalization in Retrosynthesis Prediction
Yu, Yemin
Yuan, Luotian
Wei, Ying
Gao, Hanyu
Wu, Fei
Wang, Zhihua
Ye, Xinhai
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 374 - 382
[36] Deep Relevant Feature Focusing for Out-of-Distribution Generalization
Wang, Fawu
Zhang, Kang
Liu, Zhengyu
Yuan, Xia
Zhao, Chunxia
PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022, 2022, 13534 : 245 - 253
[37] Can Subnetwork Structure be the Key to Out-of-Distribution Generalization?
Zhang, Dinghuai
Ahuja, Kartik
Xu, Yilun
Wang, Yisen
Courville, Aaron
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[38] Understanding and Improving Feature Learning for Out-of-Distribution Generalization
Chen, Yongqiang
Huang, Wei
Zhou, Kaiwen
Bian, Yatao
Han, Bo
Cheng, James
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[39] Face Reconstruction Transfer Attack as Out-of-Distribution Generalization
June, Yoon Gyo
Park, Jaewoo
Dong, Xingbo
Park, Hojin
Teoh, Andrew Beng Jin
Camps, Octavia
COMPUTER VISION - ECCV 2024, PT LXXV, 2025, 15133 : 396 - 413
[40] Learning Invariant Graph Representations for Out-of-Distribution Generalization
Li, Haoyang
Zhang, Ziwei
Wang, Xin
Zhu, Wenwu
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,

← 1 2 3 4 5 →