Contextual Gradient Scaling for Few-Shot Learning

被引：1

作者：

Lee, Sanghyuk ^{[1
]}

Lee, Seunghyun ^{[1
]}

Song, Byung Cheol ^{[1
]}

机构：

[1] Inha Univ, Dept Elect & Comp Engn, Incheon, South Korea

来源：

2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022) | 2022年

关键词：

D O I：

10.1109/WACV51458.2022.00356

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Model-agnostic meta-learning (MAML) is a well-known optimization-based meta-learning algorithm that works well in various computer vision tasks, e.g., few-shot classification. MAML is to learn an initialization so that a model can adapt to a new task in a few steps. However, since the gradient norm of a classifier (head) is much bigger than those of backbone layers, the model focuses on learning the decision boundary of the classifier with similar representations. Furthermore, gradient norms of high-level layers are small than those of the other layers. So, the backbone of MAML usually learns task-generic features, which results in deteriorated adaptation performance in the inner-loop. To resolve or mitigate this problem, we propose contextual gradient scaling (CxGrad), which scales gradient norms of the backbone to facilitate learning task-specific knowledge in the inner-loop. Since the scaling factors are generated from task-conditioned parameters, gradient norms of the backbone can be scaled in a task-wise fashion. Experimental results show that CxGrad effectively encourages the backbone to learn task-specific knowledge in the inner-loop and improves the performance of MAML up to a significant margin in both same- and cross-domain few-shot classification.

引用

页码：3503 / 3512

页数：10

共 50 条

[1] Scaling Few-Shot Learning for the Open World
Lin, Zhipeng
Yang, Wenjing
Wang, Haotian
Chi, Haoang
Lan, Long
Wang, Ji
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 12, 2024, : 13846 - 13854
[2] Few-shot learning through contextual data augmentation
Arthaud, Farid
Bawden, Rachel
Birch, Alexandra
16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1049 - 1062
[3] Towards Contextual Learning in Few-shot Object Classification
Fortin, Mathieu Page
Chaib-draa, Brahim
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3278 - 3287
[4] FEW-SHOT LEARNING BY DIMENSIONALITY REDUCTION IN GRADIENT SPACE
Gauch, Martin
Beck, Maximilian
Adler, Thomas
Kotsur, Dmytro
Fiel, Stefan
Eghbal-Zadeh, Hamid
Brandstetter, Johannes
Kofler, Johannes
Holzleitner, Markus
Zellinger, Werner
Klotz, Daniel
Hochreiter, Sepp
Lehner, Sebastian
CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199, 2022, 199
[5] Few-Shot Few-Shot Learning and the role of Spatial Attention
Lifchitz, Yann
Avrithis, Yannis
Picard, Sylvaine
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2693 - 2700
[6] Federated Few-shot Learning
Wang, Song
Fu, Xingbo
Ding, Kaize
Chen, Chen
Chen, Huiyuan
Li, Jundong
PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 2374 - 2385
[7] Defensive Few-Shot Learning
Li, Wenbin
Wang, Lei
Zhang, Xingxing
Qi, Lei
Huo, Jing
Gao, Yang
Luo, Jiebo
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 5649 - 5667
[8] Fractal Few-Shot Learning
Zhou, Fobao
Huang, Wenkai
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 35 (11) : 1 - 15
[9] Survey on Few-shot Learning
Zhao K.-L.
Jin X.-L.
Wang Y.-Z.
Ruan Jian Xue Bao/Journal of Software, 2021, 32 (02): : 349 - 369
[10] Variational Few-Shot Learning
Zhang, Jian
Zhao, Chenglong
Ni, Bingbing
Xu, Minghao
Yang, Xiaokang
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1685 - 1694

← 1 2 3 4 5 →