Meta-training with Demonstration Retrieval for Efficient Few-shot Learning

被引:0
|
作者
Mueller, Aaron [1 ]
Narang, Kanika [2 ]
Mathias, Lambert [2 ]
Wang, Qifan [2 ]
Firooz, Hamed [2 ]
机构
[1] Johns Hopkins Univ, Baltimore, MD 21218 USA
[2] Meta AI, Menlo Pk, CA USA
来源
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023 | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large language models show impressive results on few-shot NLP tasks. However, these models are memory and computation-intensive. Meta-training allows one to leverage smaller models for few-shot generalization in a domain-general and task-agnostic manner (Min et al., 2022a; Wei et al., 2022; Chen et al., 2022); however, these methods alone results in models that may not have sufficient parameterization or knowledge to adapt quickly to a large variety of tasks. To overcome this issue, we propose meta-training with demonstration retrieval, where we use a dense passage retriever to retrieve semantically similar labeled demonstrations to each example for more varied supervision. By separating external knowledge from model parameters, we can use meta-training to train parameter-efficient models that generalize well on a larger variety of tasks. We construct a meta-training set from UNIFIEDQA and CROSSFIT, and propose a demonstration bank based on UNIFIEDQA tasks. To our knowledge, our work is the first to combine retrieval with meta-training, to use DPR models to retrieve demonstrations, and to leverage demonstrations from many tasks simultaneously, rather than randomly sampling demonstrations from the training set of the target task. Our approach outperforms a variety of targeted parameter-efficient and retrieval-augmented few-shot methods on QA, NLI, and text classification tasks (including SQuAD, QNLI, and TREC). Our approach can be metatrained and fine-tuned quickly on a single GPU.
引用
收藏
页码:6049 / 6064
页数:16
相关论文
共 50 条
  • [1] Regularized Meta-Training with Embedding Mixup for Improved Few-Shot Learning
    Walsh, Reece
    Shehata, Mohamed
    ADVANCES IN VISUAL COMPUTING, ISVC 2023, PT II, 2023, 14362 : 177 - 187
  • [2] Label-Efficient Few-Shot Semantic Segmentation with Unsupervised Meta-Training
    Li, Jianwu
    Shi, Kaiyue
    Xie, Guo-Sen
    Liu, Xiaofeng
    Zhang, Jian
    Zhou, Tianfei
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 4, 2024, : 3109 - 3117
  • [3] Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot Learning
    Shu, Yang
    Cao, Zhangjie
    Gao, Jinghan
    Wang, Jianmin
    Yu, Philip S.
    Long, Mingsheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 15275 - 15291
  • [4] An Adversarial Meta-Training Framework for Cross-Domain Few-Shot Learning
    Tian, Pinzhuo
    Xie, Shaorong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 6881 - 6891
  • [5] Pseudo-Labeling Based Practical Semi-Supervised Meta-Training for Few-Shot Learning
    Dong, Xingping
    Ouyang, Tianran
    Liao, Shengcai
    Du, Bo
    Shao, Ling
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 5663 - 5675
  • [6] EMAS: Efficient Meta Architecture Search for Few-Shot Learning
    Liu, Dongkai
    Li, Jiaxing
    Chen, Honglong
    Liu, Baodi
    Lu, Xiaoping
    Liu, Weifeng
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 638 - 643
  • [7] Few-Shot Hash Learning for Image Retrieval
    Wang, Yu-Xiong
    Gui, Liangke
    Hebert, Martial
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 1228 - 1237
  • [8] Few-Shot Learning with Embedded Class Models and Shot-Free Meta Training
    Ravichandran, Avinash
    Bhotika, Rahul
    Soatto, Stefano
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 331 - 339
  • [9] Unsupervised meta-learning for few-shot learning
    Xu, Hui
    Wang, Jiaxing
    Li, Hao
    Ouyang, Deqiang
    Shao, Jie
    PATTERN RECOGNITION, 2021, 116
  • [10] Ornament image retrieval using few-shot learning
    Sk Maidul Islam
    Subhankar Joardar
    Arif Ahmed Sekh
    International Journal of Multimedia Information Retrieval, 2023, 12