MINPROMPT: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering

被引:0
|
作者
Chen, Xiusi [1 ]
Jiang, Jyun-Yu [2 ]
Chang, Wei-Cheng [2 ]
Hsieh, Cho-Jui [1 ]
Yu, Hsiang-Fu [2 ]
Wang, Wei [1 ]
机构
[1] Univ Calif Los Angeles, Los Angeles, CA 90024 USA
[2] Amazon Search, Seattle, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in few-shot question answering (QA) mostly rely on the power of pre-trained large language models (LLMs) and fine-tuning in specific settings. Although the pre-training stage has already equipped LLMs with powerful reasoning capabilities, LLMs still need to be fine-tuned to adapt to specific domains to achieve the best results. In this paper, we propose to select the most informative data for fine-tuning, thereby improving the efficiency of the fine-tuning process with comparative or even better accuracy on the open-domain QA task. We present MINPROMPT, a minimal data augmentation framework for open-domain QA based on an approximate graph algorithm and unsupervised question generation. We transform the raw text into a graph structure to build connections between different factual sentences, then apply graph algorithms to identify the minimal set of sentences needed to cover the most information in the raw text. We then generate QA pairs based on the identified sentence subset and train the model on the selected sentences to obtain the final model. Empirical results on several benchmark datasets and theoretical analysis show that MINPROMPT is able to achieve comparable or better results than baselines with a high degree of efficiency, bringing consistent improvements in F-1 scores.
引用
收藏
页码:254 / 266
页数:13
相关论文
共 50 条
  • [41] FlipDA: Effective and Robust Data Augmentation for Few-Shot Learning
    Zhou, Jing
    Zheng, Yanan
    Tang, Jie
    Li, Jian
    Yang, Zhilin
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 8646 - 8665
  • [42] Domain Knowledge Graph Question Answering Based on Semantic Analysis and Data Augmentation
    Hu, Shulin
    Zhang, Huajun
    Zhang, Wanying
    APPLIED SCIENCES-BASEL, 2023, 13 (15):
  • [43] OVERCOMING CHALLENGES IN LEVERAGING GANS FOR FEW-SHOT DATA AUGMENTATION
    Beckham, Christopher
    Laradji, Issam
    Rodriguez, Pau
    Vazquez, David
    Nowrouzezahrai, Derek
    Pal, Christopher
    CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199, 2022, 199
  • [44] Graph-Based Embedding Smoothing Network for Few-Shot Scene Classification of Remote Sensing Images
    Yuan, Zhengwu
    Huang, Wendong
    Tang, Chan
    Yang, Aixia
    Luo, Xiaobo
    REMOTE SENSING, 2022, 14 (05)
  • [45] A2-CLM: Few-Shot Malware Detection Based on Adversarial Heterogeneous Graph Augmentation
    Liu, Chen
    Li, Bo
    Zhao, Jun
    Feng, Weiwei
    Liu, Xudong
    Li, Chunpei
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 2023 - 2038
  • [46] Few-Shot Complex Knowledge Base Question Answering via Meta Reinforcement Learning
    Hua, Yuncheng
    Li, Yuan-Fang
    Haffari, Gholamreza
    Qi, Guilin
    Wu, Tongtong
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 5827 - 5837
  • [47] Cross-Modal Feature Distribution Calibration for Few-Shot Visual Question Answering
    Zhang, Jing
    Liu, Xiaoqiang
    Chen, Mingzhe
    Wang, Zhe
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7151 - 7159
  • [48] Prompt-Based Graph Convolution Adversarial Meta-Learning for Few-Shot Text Classification
    Gong, Ruwei
    Qin, Xizhong
    Ran, Wensheng
    APPLIED SCIENCES-BASEL, 2023, 13 (16):
  • [49] Combat data shift in few-shot learning with knowledge graph
    Zhu, Yongchun
    Zhuang, Fuzhen
    Zhang, Xiangliang
    Qi, Zhiyuan
    Shi, Zhiping
    Cao, Juan
    He, Qing
    FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (01)
  • [50] Combat data shift in few-shot learning with knowledge graph
    Yongchun Zhu
    Fuzhen Zhuang
    Xiangliang Zhang
    Zhiyuan Qi
    Zhiping Shi
    Juan Cao
    Qing He
    Frontiers of Computer Science, 2023, 17