Few-shot learning with adaptively initialized task optimizer: a practical meta-learning approach

被引:34
|
作者
Ye, Han-Jia [1 ]
Sheng, Xiang-Rong [1 ]
Zhan, De-Chuan [1 ]
机构
[1] Nanjing Univ, Nanjing, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Few-shot learning; Meta-learning; Supervised-learning; Multi-task learning; Task-specific; MODEL;
D O I
10.1007/s10994-019-05838-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Considering the data collection and labeling cost in real-world applications, training a model with limited examples is an essential problem in machine learning, visual recognition, etc. Directly training a model on such few-shot learning (FSL) tasks falls into the over-fitting dilemma, which would turn to an effective task-level inductive bias as a key supervision. By treating the few-shot task as an entirety, extracting task-level pattern, and learning a task-agnostic model initialization, the model-agnostic meta-learning (MAML) framework enables the applications of various models on the FSL tasks. Given a training set with a few examples, MAML optimizes a model via fixed gradient descent steps from an initial point chosen beforehand. Although this general framework possesses empirically satisfactory results, its initialization neglects the task-specific characteristics and aggravates the computational burden as well. In this manuscript, we propose our AdaptiVely InitiAlized Task OptimizeR (Aviator) approach for few-shot learning, which incorporates task context into the determination of the model initialization. This task-specific initialization facilitates the model optimization process so that it obtains high-quality model solutions efficiently. To this end, we decouple the model and apply a set transformation over the training set to determine the initial top-layer classifier. Re-parameterization of the first-order gradient descent approximation promotes the gradient back-propagation. Experiments on synthetic and benchmark data sets validate that our Aviator approach achieves the state-of-the-art performance, and visualization results demonstrate the task-adaptive features of our proposed Aviator method.
引用
收藏
页码:643 / 664
页数:22
相关论文
共 50 条
  • [21] Few-Shot Named Entity Recognition via Meta-Learning
    Li, Jing
    Chiu, Billy
    Feng, Shanshan
    Wang, Hao
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (09) : 4245 - 4256
  • [22] Few-Shot Human Motion Prediction via Meta-learning
    Gui, Liang-Yan
    Wang, Yu-Xiong
    Ramanan, Deva
    Moura, Jose M. F.
    COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 441 - 459
  • [23] Meta-Learning With Relation Embedding for Few-Shot Deepfake Detection
    Liu, Xiaoyong
    Song, Pengcheng
    Lu, Pei
    Wang, Yanjun
    IEEE ACCESS, 2024, 12 : 180135 - 180145
  • [24] Learning to Diagnose: Meta-Learning for Efficient Adaptation in Few-Shot AIOps Scenarios
    Duan, Yunfeng
    Bao, Haotong
    Bai, Guotao
    Wei, Yadong
    Xue, Kaiwen
    You, Zhangzheng
    Zhang, Yuantian
    Liu, Bin
    Chen, Jiaxing
    Wang, Shenhuan
    Ou, Zhonghong
    ELECTRONICS, 2024, 13 (11)
  • [25] Prior-knowledge and attention based meta-learning for few-shot learning
    Qin, Yunxiao
    Zhang, Weiguo
    Zhao, Chenxu
    Wang, Zezheng
    Zhu, Xiangyu
    Shi, Jingping
    Qi, Guojun
    Lei, Zhen
    KNOWLEDGE-BASED SYSTEMS, 2021, 213
  • [26] Few-Shot Load Forecasting Under Data Scarcity in Smart Grids: A Meta-Learning Approach
    Tsoumplekas, Georgios
    Athanasiadis, Christos
    Doukas, Dimitrios I.
    Chrysopoulos, Antonios
    Mitkas, Pericles
    ENERGIES, 2025, 18 (03)
  • [27] A metric-based meta-learning approach combined attention mechanism and ensemble learning for few-shot learning
    Guo, Nan
    Di, Kexin
    Liu, Hongyan
    Wang, Yifei
    Qiao, Junfei
    DISPLAYS, 2021, 70
  • [28] Contrastive Meta-Learning for Few-shot Node Classification
    Wang, Song
    Tan, Zhen
    Liu, Huan
    Li, Jundong
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 2386 - 2397
  • [29] Decomposed Meta-Learning for Few-Shot Sequence Labeling
    Ma, Tingting
    Wu, Qianhui
    Jiang, Huiqiang
    Lin, Jieru
    Karlsson, Borje F.
    Zhao, Tiejun
    Lin, Chin-Yew
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1980 - 1993
  • [30] Meta-Learning for Few-Shot Plant Disease Detection
    Chen, Liangzhe
    Cui, Xiaohui
    Li, Wei
    FOODS, 2021, 10 (10)