MINPROMPT: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering

被引:0
|
作者
Chen, Xiusi [1 ]
Jiang, Jyun-Yu [2 ]
Chang, Wei-Cheng [2 ]
Hsieh, Cho-Jui [1 ]
Yu, Hsiang-Fu [2 ]
Wang, Wei [1 ]
机构
[1] Univ Calif Los Angeles, Los Angeles, CA 90024 USA
[2] Amazon Search, Seattle, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in few-shot question answering (QA) mostly rely on the power of pre-trained large language models (LLMs) and fine-tuning in specific settings. Although the pre-training stage has already equipped LLMs with powerful reasoning capabilities, LLMs still need to be fine-tuned to adapt to specific domains to achieve the best results. In this paper, we propose to select the most informative data for fine-tuning, thereby improving the efficiency of the fine-tuning process with comparative or even better accuracy on the open-domain QA task. We present MINPROMPT, a minimal data augmentation framework for open-domain QA based on an approximate graph algorithm and unsupervised question generation. We transform the raw text into a graph structure to build connections between different factual sentences, then apply graph algorithms to identify the minimal set of sentences needed to cover the most information in the raw text. We then generate QA pairs based on the identified sentence subset and train the model on the selected sentences to obtain the final model. Empirical results on several benchmark datasets and theoretical analysis show that MINPROMPT is able to achieve comparable or better results than baselines with a high degree of efficiency, bringing consistent improvements in F-1 scores.
引用
收藏
页码:254 / 266
页数:13
相关论文
共 50 条
  • [21] VulPrompt: Prompt-Based Vulnerability Detection Using Few-Shot Graph Learning
    Irtiza, Saquib
    Li, Xiaodi
    Zamani, Mahmoud
    Khan, Latifur
    Hamlen, Kevin W.
    DATA AND APPLICATIONS SECURITY AND PRIVACY XXXVIII, DBSEC 2024, 2024, 14901 : 221 - 240
  • [22] Few-shot learning through contextual data augmentation
    Arthaud, Farid
    Bawden, Rachel
    Birch, Alexandra
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1049 - 1062
  • [23] Few-Shot Website Fingerprinting Attack with Data Augmentation
    Chen, Mantun
    Wang, Yongjun
    Qin, Zhiquan
    Zhu, Xiatian
    SECURITY AND COMMUNICATION NETWORKS, 2021, 2021
  • [24] Graph-based few-shot learning with transformed feature propagation and optimal class allocation
    Zhang, Ruiheng
    Yang, Shuo
    Zhang, Qi
    Xu, Lixin
    He, Yang
    Zhang, Fan
    NEUROCOMPUTING, 2022, 470 : 247 - 256
  • [25] TabPrompt: Graph-based Pre-training and Prompting for Few-shot Table Understanding
    Jin, Rihui
    Wang, Jianan
    Tan, Wei
    Chen, YongRui
    Qi, Guilin
    Hao, Wang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 7373 - 7383
  • [26] Zero-Shot and Few-Shot Video Question Answering with Multi-Modal Prompts
    Engin, Deniz
    Avrithis, Yannis
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2796 - 2802
  • [27] Metric Based Few-Shot Graph Classification
    Crisostomi, Donato
    Antonelli, Simone
    Maiorca, Valentino
    Moschella, Luca
    Marin, Riccardo
    Rodola, Emanuele
    LEARNING ON GRAPHS CONFERENCE, VOL 198, 2022, 198
  • [28] Explicit knowledge transfer of graph-based correlation distillation and diversity data hallucination for few-shot object detection
    Wang, Meng
    Wang, Yang
    Liu, Haipeng
    IMAGE AND VISION COMPUTING, 2024, 143
  • [29] Fast Parameter Adaptation for Few-shot Image Captioning and Visual Question Answering
    Dong, Xuanyi
    Zhu, Linchao
    Zhang, De
    Yang, Yi
    Wu, Fei
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 54 - 62
  • [30] Few-shot English Text Classification Method Based On Graph Convolutional Network And Prompt Learning
    Jin, Yunfei
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2025, 28 (09): : 1777 - 1784