Deep Stable Learning for Out-Of-Distribution Generalization

被引:190
作者
Zhang, Xingxuan [1 ]
Cui, Peng [1 ,2 ]
Xu, Renzhe [1 ]
Zhou, Linjun [1 ]
He, Yue [1 ]
Shen, Zheyan [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci, Beijing, Peoples R China
[2] Beijing Key Lab Networked Multimedia, Beijing, Peoples R China
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
D O I
10.1109/CVPR46437.2021.00533
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Approaches based on deep neural networks have achieved striking performance when testing data and training data share similar distribution, but can significantly fail otherwise. Therefore, eliminating the impact of distribution shifts between training and testing data is crucial for building performance-promising deep models. Conventional methods assume either the known heterogeneity of training data (e.g. domain labels) or the approximately equal capacities of different domains. In this paper, we consider a more challenging case where neither of the above assumptions holds. We propose to address this problem by removing the dependencies between features via learning weights for training samples, which helps deep models get rid of spurious correlations and, in turn, concentrate more on the true connection between discriminative features and labels. Extensive experiments clearly demonstrate the effectiveness of our method on multiple distribution generalization benchmarks compared with state-of-the-art counterparts. Through extensive experiments on distribution generalization benchmarks including PACS, VLCS, MNIST-M, and NICO, we show the effectiveness of our method compared with state-of-the-art counterparts.
引用
收藏
页码:5368 / 5378
页数:11
相关论文
共 64 条
[21]  
He Yue, 2020, PATTERN RECOGNITION
[22]  
Hendrycks D., 2019, INT C LEARN REPR
[23]   Self-challenging Improves Cross-Domain Generalization [J].
Huang, Zeyi ;
Wang, Haohan ;
Xing, Eric P. ;
Huang, Dong .
COMPUTER VISION - ECCV 2020, PT II, 2020, 12347 :124-140
[24]  
Jin Xin, 2020, ARXIV200612009
[25]  
Khosla A, 2012, LECT NOTES COMPUT SC, V7572, P158, DOI 10.1007/978-3-642-33718-5_12
[26]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[27]  
Krueger David, 2020, ARXIV200300688
[28]  
Kuang Kun, 2020, AAAI, P4485
[29]   Building machines that learn and think like people [J].
Lake, Brenden M. ;
Ullman, Tomer D. ;
Tenenbaum, Joshua B. ;
Gershman, Samuel J. .
BEHAVIORAL AND BRAIN SCIENCES, 2017, 40
[30]   Episodic Training for Domain Generalization [J].
Li, Da ;
Zhang, Jianshu ;
Yang, Yongxin ;
Liu, Cong ;
Song, Yi-Zhe ;
Hospedales, Timothy M. .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1446-1455