Meta-Learning with a Geometry-Adaptive Preconditioner

被引:14
|
作者
Kang, Suhyun [1 ]
Hwang, Duhun [1 ]
Eo, Moonjung [1 ]
Kim, Taesup [2 ]
Rhee, Wonjong [1 ,3 ,4 ]
机构
[1] Seoul Natl Univ, Dept Intelligence & Informat, Seoul, South Korea
[2] Seoul Natl Univ, Grad Sch Data Sci, Seoul, South Korea
[3] Seoul Natl Univ, IPAI, Seoul, South Korea
[4] Seoul Natl Univ, AIIS, Seoul, South Korea
关键词
D O I
10.1109/CVPR52729.2023.01543
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Model-agnostic meta-learning (MAML) is one of the most successful meta-learning algorithms. It has a bi-level optimization structure where the outer-loop process learns a shared initialization and the inner-loop process optimizes task-specific weights. Although MAML relies on the standard gradient descent in the inner-loop, recent studies have shown that controlling the inner-loop's gradient descent with a meta-learned preconditioner can be beneficial. Existing preconditioners, however, cannot simultaneously adapt in a task-specific and path-dependent way. Additionally, they do not satisfy the Riemannian metric condition, which can enable the steepest descent learning with preconditioned gradient. In this study, we propose Geometry-Adaptive Pre-conditioned gradient descent (GAP) that can overcome the limitations in MAML; GAP can efficiently meta-learn a preconditioner that is dependent on task-specific parameters, and its preconditioner can be shown to be a Riemannian metric. Thanks to the two properties, the geometryadaptive preconditioner is effective for improving the innerloop optimization. Experiment results show that GAP outperforms the state-of-the-art MAML family and preconditioned gradient descent-MAML (PGD-MAML) family in a variety of few-shot learning tasks. Code is available at: https://github.com/ Suhyun777/CVPR23-GAP.
引用
收藏
页码:16080 / 16090
页数:11
相关论文
共 50 条
  • [21] Domain Adaptive Meta-Learning for Dialogue State Tracking
    Zeng, Jiali
    Yin, Yongjing
    Liu, Yang
    Ge, Yubin
    Su, Jinsong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2493 - 2501
  • [22] Non-monotone Adaptive Submodular Meta-Learning
    Tang, Shaojie
    Yuan, Jing
    PROCEEDINGS OF THE 2021 SIAM CONFERENCE ON APPLIED AND COMPUTATIONAL DISCRETE ALGORITHMS, ACDA21, 2021, : 57 - 65
  • [23] TGOnline: Enhancing Temporal Graph Learning with Adaptive Online Meta-Learning
    Wang, Ruijie
    Huang, Jingyuan
    Zhang, Yutong
    Li, Jinyang
    Wang, Yufeng
    Zhao, Wanyu
    Liu, Shengzhong
    Mendis, Charith
    Abdelzaher, Tarek
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 1659 - 1669
  • [24] GEOMETRY-ADAPTIVE SURFACE GRID GENERATION USING A PARAMETRIC PROJECTION
    LEE, KD
    LOELLBACH, JM
    JOURNAL OF AIRCRAFT, 1989, 26 (02): : 162 - 167
  • [25] Learning Meta-Learning (LML) dataset: Survey data of meta-learning parameters
    Corraya, Sonia
    Al Mamun, Shamim
    Kaiser, M. Shamim
    DATA IN BRIEF, 2023, 51
  • [26] Adaptive Multi-Teacher Knowledge Distillation with Meta-Learning
    Zhang, Hailin
    Chen, Defang
    Wang, Can
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1943 - 1948
  • [27] A Novel Hierarchical Adaptive Feature Fusion Method for Meta-Learning
    Ding, Enjie
    Chu, Xu
    Liu, Zhongyu
    Zhang, Kai
    Yu, Qiankun
    APPLIED SCIENCES-BASEL, 2022, 12 (11):
  • [28] SPEAKER ADAPTIVE TRAINING USING MODEL AGNOSTIC META-LEARNING
    Klejch, Ondrej
    Fainberg, Joachim
    Bell, Peter
    Renals, Steve
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 881 - 888
  • [29] Bayesian Meta-Learning for Adaptive Traffic Prediction in Wireless Networks
    Wang, Zihuan
    Wong, Vincent W. S.
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (06) : 6620 - 6633
  • [30] Geometry-Adaptive Motion Partitioning Using Improved Temporal Prediction
    Blaeser, Max
    Heithausen, Cordula
    Wien, Mathias
    2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,