Invariance Principle Meets Information Bottleneck for Out-of-Distribution Generalization

被引:0
|
作者
Ahuja, Kartik [1 ]
Caballero, Ethan [1 ]
Zhang, Dinghuai [1 ]
Gagnon-Audet, Jean-Christophe [1 ]
Bengio, Yoshua [1 ]
Mitliagkas, Ioannis [1 ]
Rish, Irina [1 ]
机构
[1] Univ Montreal, Quebec AI Inst, Mila, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The invariance principle from causality is at the heart of notable approaches such as invariant risk minimization (IRM) that seek to address out-of-distribution (OOD) generalization failures. Despite the promising theory, invariance principle-based approaches fail in common classification tasks, where invariant (causal) features capture all the information about the label. Are these failures due to the methods failing to capture the invariance? Or is the invariance principle itself insufficient? To answer these questions, we revisit the fundamental assumptions in linear regression tasks, where invariance-based approaches were shown to provably generalize OOD. In contrast to the linear regression tasks, we show that for linear classification tasks we need much stronger restrictions on the distribution shifts, or otherwise OOD generalization is impossible. Furthermore, even with appropriate restrictions on distribution shifts in place, we show that the invariance principle alone is insufficient. We prove that a form of the information bottleneck constraint along with invariance helps address key failures when invariant features capture all the information about the label and also retains the existing success when they do not. We propose an approach that incorporates both of these principles and demonstrate its effectiveness in several experiments.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Counterfactual Supervision-Based Information Bottleneck for Out-of-Distribution Generalization
    Deng, Bin
    Jia, Kui
    ENTROPY, 2023, 25 (02)
  • [2] Certifiable Out-of-Distribution Generalization
    Ye, Nanyang
    Zhu, Lin
    Wang, Jia
    Zeng, Zhaoyu
    Shao, Jiayao
    Peng, Chensheng
    Pan, Bikang
    Li, Kaican
    Zhu, Jun
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 10927 - 10935
  • [3] Class Is Invariant to Context and Vice Versa: On Learning Invariance for Out-Of-Distribution Generalization
    Qi, Jiaxin
    Tang, Kaihua
    Sun, Qianru
    Hua, Xian-Sheng
    Zhang, Hanwang
    COMPUTER VISION, ECCV 2022, PT XXV, 2022, 13685 : 92 - 109
  • [4] Individual and Structural Graph Information Bottlenecks for Out-of-Distribution Generalization
    Yang, Ling
    Zheng, Jiayi
    Wang, Heyuan
    Liu, Zhongyi
    Huang, Zhilin
    Hong, Shenda
    Zhang, Wentao
    Cui, Bin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (02) : 682 - 693
  • [5] Out-of-Distribution Generalization in Kernel Regression
    Canatar, Abdulkadir
    Bordelon, Blake
    Pehlevan, Cengiz
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [6] Causal softmax for out-of-distribution generalization
    Luo, Jing
    Zhao, Wanqing
    Peng, Jinye
    DIGITAL SIGNAL PROCESSING, 2025, 156
  • [7] Task-Oriented Communication with Out-of-Distribution Detection: An Information Bottleneck Framework
    Li, Hongru
    Yu, Wentao
    He, Hengtao
    Shao, Jiawei
    Song, Shenghui
    Zhang, Jun
    Letaief, Khaled B.
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 3136 - 3141
  • [8] Out-of-distribution generalization for learning quantum dynamics
    Caro, Matthias C.
    Huang, Hsin-Yuan
    Ezzell, Nicholas
    Gibbs, Joe
    Sornborger, Andrew T.
    Cincio, Lukasz
    Coles, Patrick J.
    Holmes, Zoe
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [9] On the Adversarial Robustness of Out-of-distribution Generalization Models
    Zou, Xin
    Liu, Weiwei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [10] On the Out-of-distribution Generalization of Probabilistic Image Modelling
    Zhang, Mingtian
    Zhang, Andi
    McDonagh, Steven
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34