Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data Augmentation

被引:0
|
作者
Chen, Xiusi [1 ]
Zhang, Yu [2 ]
Deng, Jinliang [3 ]
Jiang, Jyun-Yu [4 ]
Wang, Wei [1 ]
机构
[1] Univ Calif Los Angeles, Los Angeles, CA 90095 USA
[2] Univ Illinois, Urbana, IL USA
[3] Univ Technol Sydney, Sydney, NSW, Australia
[4] Amazon Search, Palo Alto, CA USA
关键词
question answering; knowledge base; entity; data augmentation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot question answering (QA) aims at precisely discovering answers to a set of questions from context passages while only a few training samples are available. Although existing studies have made some progress and can usually achieve proper results, they suffer from understanding deep semantics for reasoning out the questions. In this paper, we develop Gotta, a Generative prOmpTbased daTa Augmentation framework to mitigate the challenge above. Inspired by the human reasoning process, we propose to integrate the doze task to enhance few-shot QA learning. Following the recent success of prompt-tuning, we present the doze task in the same format as the main QA task, allowing the model to learn both tasks seamlessly together to fully take advantage of the power of prompt-tuning. Extensive experiments on widely used benchmarks demonstrate that Gotta consistently outperforms competitive baselines, validating the effectiveness of our proposed prompt -tuning -based doze task, which not only fine-tunes language models but also learns to guide reasoning in QA tasks. Further analysis shows that the prompt-based loss incorporates the auxiliary task better than the multi -task loss, highlighting the strength of prompt-tuning on the few-shot QA task.
引用
收藏
页码:909 / 917
页数:9
相关论文
共 50 条
  • [21] VulPrompt: Prompt-Based Vulnerability Detection Using Few-Shot Graph Learning
    Irtiza, Saquib
    Li, Xiaodi
    Zamani, Mahmoud
    Khan, Latifur
    Hamlen, Kevin W.
    DATA AND APPLICATIONS SECURITY AND PRIVACY XXXVIII, DBSEC 2024, 2024, 14901 : 221 - 240
  • [22] PMRC: Prompt-Based Machine Reading Comprehension for Few-Shot Named Entity Recognition
    Huang, Jin
    Yan, Danfeng
    Cai, Yuanqiang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 18316 - 18326
  • [23] Prompt-Based Self-training Framework for Few-Shot Named Entity Recognition
    Huang, Ganghong
    Zhong, Jiang
    Wang, Chen
    Dai, Qizhu
    Li, Rongzhen
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2022, PT III, 2022, 13370 : 91 - 103
  • [24] Few-Shot Question Answering by Pretraining Span Selection
    Ram, Ori
    Kirstain, Yuval
    Berant, Jonathan
    Globerson, Amir
    Levy, Omer
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 3066 - 3079
  • [25] Cross-language few-shot intent recognition via prompt-based tuning
    Cao, Pei
    Li, Yu
    Li, Xinlu
    APPLIED INTELLIGENCE, 2025, 55 (01)
  • [26] Few-shot Log Analysis with Prompt-based Multi-task Transfer Learning
    Zhou, Mingjie
    Yang, Weidong
    Ma, Lipeng
    Jiang, Sihang
    Xu, Bo
    Xiao, Yanghua
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2024, PT 2, 2025, 14851 : 466 - 475
  • [27] Zero- and Few-Shot Event Detection via Prompt-Based Meta Learning
    Yue, Zhenrui
    Zeng, Huimin
    Lan, Mengfei
    Ji, Heng
    Wang, Dong
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 7928 - 7943
  • [28] Few-shot imbalanced classification based on data augmentation
    Chao, Xuewei
    Zhang, Lixin
    MULTIMEDIA SYSTEMS, 2023, 29 (05) : 2843 - 2851
  • [29] OmniTab: Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering
    Jiang, Zhengbao
    Mao, Yi
    He, Pengcheng
    Neubig, Graham
    Chen, Weizhu
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 932 - 942
  • [30] Few-shot imbalanced classification based on data augmentation
    Xuewei Chao
    Lixin Zhang
    Multimedia Systems, 2023, 29 : 2843 - 2851