ROTOGBML: TOWARDS OUT-OF-DISTRIBUTION GENERALIZATION FOR GRADIENT-BASED META-LEARNING

被引:0
|
作者
Zhang, Min [1 ,2 ]
Zhuang, Zifeng [2 ]
Wang, Zhitao [2 ]
Wang, Donglin [2 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Westlake Univ, Hangzhou, Peoples R China
来源
2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024 | 2024年
关键词
In-distribution; Out-of-distribution; Few-shot image classification; Gradient-based meta-learning;
D O I
10.1109/ICME57554.2024.10687395
中图分类号
学科分类号
摘要
Gradient-based meta-learning (GBML) algorithms can quickly adapt to new tasks by transferring the learned meta-knowledge while assuming that all tasks come from the same distribution (in-distribution, ID). However, in the real world, they often grapple with an out-of-distribution (OOD) generalization challenge, where tasks stem from diverse distributions. OOD exacerbates discrepancies in task gradient magnitudes and directions, posing a formidable challenge for GBML in optimizing meta-knowledge by minimizing the sum of task gradients in each minibatch. To address this problem, we propose RotoGBML, a novel approach designed to homogenize OOD task gradients. RotoGBML employs reweighted vectors to dynamically balance diverse magnitudes to a standardized scale and uses rotation matrices to align conflicting directions. To reduce overhead, we homogenize gradients with the features rather than network parameters. Additionally, to circumvent the impact of non-causal features (e.g., backgrounds), we propose an Invariant Self-Information (ISI) module to extract invariant causal features (e.g., the outlines of objects). Finally, task gradients are homogenized based on these invariant causal features. Experiments demonstrate that RotoGBML outperforms state-of-the-art methods across various few-shot benchmarks.
引用
收藏
页数:6
相关论文
共 37 条
  • [1] SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
    Nguyen, Bac
    Uhlich, Stefan
    Cardinaux, Fabien
    Mauch, Lukas
    Edraki, Marzieh
    Courville, Aaron
    COMPUTER VISION - ECCV 2024, PT LXIX, 2025, 15127 : 138 - 154
  • [2] FLOOD: A Flexible Invariant Learning Framework for Out-of-Distribution Generalization on Graphs
    Liu, Yang
    Ao, Xiang
    Feng, Fuli
    Ma, Yunshan
    Li, Kuan
    Chua, Tat-Seng
    He, Qing
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 1548 - 1558
  • [3] Learning to Improve Out-of-Distribution Generalization via Self-Adaptive Language Masking
    Jiang, Shuoran
    Pan, Youcheng
    Chen, Qingcai
    Xiang, Yang
    Wu, Xiangping
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 2739 - 2750
  • [4] LEARNING WITH OUT-OF-DISTRIBUTION DATA FOR AUDIO CLASSIFICATION
    Iqbal, Turab
    Cao, Yin
    Kong, Qiuqiang
    Plumbley, Mark D.
    Wang, Wenwu
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 636 - 640
  • [5] Dual-branch neural operator for enhanced out-of-distribution generalization
    Li, Jiacheng
    Yang, Min
    ENGINEERING ANALYSIS WITH BOUNDARY ELEMENTS, 2025, 171
  • [6] Diversify: A General Framework for Time Series Out-of-Distribution Detection and Generalization
    Lu, Wang
    Wang, Jindong
    Sun, Xinwei
    Chen, Yiqiang
    Ji, Xiangyang
    Yang, Qiang
    Xie, Xing
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (06) : 4534 - 4550
  • [7] Enhancing Model Agnostic Meta-Learning via Gradient Similarity Loss
    Tak, Jae-Ho
    Hong, Byung-Woo
    ELECTRONICS, 2024, 13 (03)
  • [8] Towards out of distribution generalization for problems in mechanics
    Yuan, Lingxiao
    Park, Harold S.
    Lejeune, Emma
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2022, 400
  • [9] Machine and deep learning performance in out-of-distribution regressions
    Shmuel, Assaf
    Glickman, Oren
    Lazebnik, Teddy
    Machine Learning: Science and Technology, 2024, 5 (04):
  • [10] Out-of-distribution Internet Traffic Prediction Generalization Using Deep Sequence Model
    Saha, Sajal
    Haque, Anwar
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 3830 - 3835