MINPROMPT: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering

被引:0
|
作者
Chen, Xiusi [1 ]
Jiang, Jyun-Yu [2 ]
Chang, Wei-Cheng [2 ]
Hsieh, Cho-Jui [1 ]
Yu, Hsiang-Fu [2 ]
Wang, Wei [1 ]
机构
[1] Univ Calif Los Angeles, Los Angeles, CA 90024 USA
[2] Amazon Search, Seattle, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in few-shot question answering (QA) mostly rely on the power of pre-trained large language models (LLMs) and fine-tuning in specific settings. Although the pre-training stage has already equipped LLMs with powerful reasoning capabilities, LLMs still need to be fine-tuned to adapt to specific domains to achieve the best results. In this paper, we propose to select the most informative data for fine-tuning, thereby improving the efficiency of the fine-tuning process with comparative or even better accuracy on the open-domain QA task. We present MINPROMPT, a minimal data augmentation framework for open-domain QA based on an approximate graph algorithm and unsupervised question generation. We transform the raw text into a graph structure to build connections between different factual sentences, then apply graph algorithms to identify the minimal set of sentences needed to cover the most information in the raw text. We then generate QA pairs based on the identified sentence subset and train the model on the selected sentences to obtain the final model. Empirical results on several benchmark datasets and theoretical analysis show that MINPROMPT is able to achieve comparable or better results than baselines with a high degree of efficiency, bringing consistent improvements in F-1 scores.
引用
收藏
页码:254 / 266
页数:13
相关论文
共 50 条
  • [1] Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data Augmentation
    Chen, Xiusi
    Zhang, Yu
    Deng, Jinliang
    Jiang, Jyun-Yu
    Wang, Wei
    PROCEEDINGS OF THE 2023 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2023, : 909 - 917
  • [2] Prompt Based CVAE Data Augmentation for Few-Shot Intention Detection
    Xue, Junhao
    Yin, Chuantao
    Li, Chen
    Bai, Jun
    Chen, Hui
    Rong, Wenge
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, KSEM 2024, 2024, 14886 : 312 - 323
  • [3] Prompt-Based Data Augmentation Framework for Few-Shot Named Entity Recognition
    Wang, Moyao
    Gao, Hui
    Zhang, Peng
    Zhang, Jing
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT III, ICIC 2024, 2024, 14877 : 451 - 462
  • [4] Few-Shot Question Answering by Pretraining Span Selection
    Ram, Ori
    Kirstain, Yuval
    Berant, Jonathan
    Globerson, Amir
    Levy, Omer
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 3066 - 3079
  • [5] Few-shot imbalanced classification based on data augmentation
    Chao, Xuewei
    Zhang, Lixin
    MULTIMEDIA SYSTEMS, 2023, 29 (05) : 2843 - 2851
  • [6] OmniTab: Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering
    Jiang, Zhengbao
    Mao, Yi
    He, Pengcheng
    Neubig, Graham
    Chen, Weizhu
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 932 - 942
  • [7] Few-shot imbalanced classification based on data augmentation
    Xuewei Chao
    Lixin Zhang
    Multimedia Systems, 2023, 29 : 2843 - 2851
  • [8] Domain-Specific Few-Shot Table Prompt Question Answering via Contrastive Exemplar Selection
    Mo, Tianjin
    Xiao, Qiao
    Zhang, Hongyi
    Li, Ren
    Wu, Yunsong
    ALGORITHMS, 2024, 17 (07)
  • [9] Few-Shot Multihop Question Answering over Knowledge Base
    Fan, Meihao
    Zhang, Lei
    Xiao, Siyao
    Liang, Yuru
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [10] SymKGQA: Few-Shot Knowledge Graph Question Answering via Symbolic Program Generation and Execution
    Agarwal, Prerna
    Kumar, Nishant
    Bedathur, Srikanta
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 10119 - 10140