ROTOGBML: TOWARDS OUT-OF-DISTRIBUTION GENERALIZATION FOR GRADIENT-BASED META-LEARNING

被引:0
|
作者
Zhang, Min [1 ,2 ]
Zhuang, Zifeng [2 ]
Wang, Zhitao [2 ]
Wang, Donglin [2 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Westlake Univ, Hangzhou, Peoples R China
来源
2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024 | 2024年
关键词
In-distribution; Out-of-distribution; Few-shot image classification; Gradient-based meta-learning;
D O I
10.1109/ICME57554.2024.10687395
中图分类号
学科分类号
摘要
Gradient-based meta-learning (GBML) algorithms can quickly adapt to new tasks by transferring the learned meta-knowledge while assuming that all tasks come from the same distribution (in-distribution, ID). However, in the real world, they often grapple with an out-of-distribution (OOD) generalization challenge, where tasks stem from diverse distributions. OOD exacerbates discrepancies in task gradient magnitudes and directions, posing a formidable challenge for GBML in optimizing meta-knowledge by minimizing the sum of task gradients in each minibatch. To address this problem, we propose RotoGBML, a novel approach designed to homogenize OOD task gradients. RotoGBML employs reweighted vectors to dynamically balance diverse magnitudes to a standardized scale and uses rotation matrices to align conflicting directions. To reduce overhead, we homogenize gradients with the features rather than network parameters. Additionally, to circumvent the impact of non-causal features (e.g., backgrounds), we propose an Invariant Self-Information (ISI) module to extract invariant causal features (e.g., the outlines of objects). Finally, task gradients are homogenized based on these invariant causal features. Experiments demonstrate that RotoGBML outperforms state-of-the-art methods across various few-shot benchmarks.
引用
收藏
页数:6
相关论文
共 37 条
  • [31] Real-time Out-of-distribution Detection in Learning-Enabled Cyber-Physical Systems
    Cai, Feiyang
    Koutsoukos, Xenofon
    2020 ACM/IEEE 11TH INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS (ICCPS 2020), 2020, : 174 - 183
  • [32] Real-time out-of-distribution detection in cyber-physical systems with learning-enabled components
    Cai, Feiyang
    Koutsoukos, Xenofon
    IET CYBER-PHYSICAL SYSTEMS: THEORY & APPLICATIONS, 2022, 7 (04) : 212 - 234
  • [33] A deep semi-supervised learning approach to the detection of glaucoma on out-of-distribution retinal fundus image datasets
    Lei Wang
    Xiaoyun Zhang
    Zhongwen Li
    Shuchen Yu
    Yabo Wu
    Shaodan Zhang
    Gaoqiang Jiang
    Bihan Tian
    Chenyang Mei
    Jiantao Pu
    Yuanbo Liang
    Quanyong Yi
    Wencan Wu
    BMC Ophthalmology, 25 (1)
  • [34] SPN: A Method of Few-Shot Traffic Classification With Out-of-Distribution Detection Based on Siamese Prototypical Network
    Miao, Gongxun
    Wu, Guohua
    Zhang, Zhen
    Tong, Yongjie
    Lu, Bing
    IEEE ACCESS, 2023, 11 : 114403 - 114414
  • [35] Out-of-Distribution Detection Based on Feature Fusion in Neural Network Classifier Pre-Trained by PEDCC-Loss
    Zhu, Qiuyu
    Zheng, Guohui
    Shen, Jiakang
    Wang, Rui
    IEEE ACCESS, 2022, 10 : 66190 - 66197
  • [36] Navigating Out-of-Distribution Electricity Load Forecasting during COVID-19: Benchmarking energy load forecasting models without and with continual learning
    Prabowo, Arian
    Chen, Kaixuan
    Xue, Hao
    Sethuvenkatraman, Subbu
    Salim, Flora D.
    PROCEEDINGS OF THE 10TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, BUILDSYS 2023, 2023, : 41 - 50
  • [37] A graph neural network-based model with out-of-distribution robustness for enhancing antiretroviral therapy outcome prediction for HIV-1
    Di Teodoro, Giulia
    Siciliano, Federico
    Guarrasi, Valerio
    Vandamme, Anne-Mieke
    Ghisetti, Valeria
    Soennerborg, Anders
    Zazzi, Maurizio
    Silvestri, Fabrizio
    Palagi, Laura
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2025, 120