Reinforcement learning for few-shot text generation adaptation

被引:4
|
作者
Cheng, Pengsen [1 ]
Dai, Jinqiao [1 ]
Liu, Jiamiao [1 ]
Liu, Jiayong [1 ]
Jia, Peng [1 ]
机构
[1] Sichuan Univ, Sch Cyber Sci & Engn, Chuanda Rd, Chengdu 610207, Peoples R China
关键词
Text generation; Domain adaption; Few-shot learning; Reinforcement learning;
D O I
10.1016/j.neucom.2023.126689
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel method based on reinforcement learning (RL) to control the generation model in adapting to new domains with limited samples. To avoid the problem of overfitting, the method combines maximum likelihood estimation (MLE) with RL process to improve the sample utilization rate and reduce the sample requirement of RL. The training process is divided into two parts: pre-training and fine-tuning, to effectively express the semantic of the target domain. In order to ensure the robustness of the reward function, adversarial training is introduced. A new measurement called "Net Accuracy"is proposed to better evaluate the domain relevance of the generated text and eliminate the problem of inaccurate domain relevance measurement caused by overfitting and generating a large amount of duplicate text. Finally, experimental results show the effectiveness and superiority of the proposed method in five target domains.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Prototype Reinforcement for Few-Shot Learning
    Xu, Liheng
    Xie, Qian
    Jiang, Baoqing
    Zhang, Jiashuo
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 4912 - 4916
  • [2] Few-Shot System Identification for Reinforcement Learning
    Farid, Karim
    Sakr, Nourhan
    2021 6TH ASIA-PACIFIC CONFERENCE ON INTELLIGENT ROBOT SYSTEMS (ACIRS), 2021, : 76 - 82
  • [3] Reinforcement Learning in Few-Shot Scenarios: A Survey
    Zhechao Wang
    Qiming Fu
    Jianping Chen
    Yunzhe Wang
    You Lu
    Hongjie Wu
    Journal of Grid Computing, 2023, 21
  • [4] Reinforcement Learning in Few-Shot Scenarios: A Survey
    Wang, Zhechao
    Fu, Qiming
    Chen, Jianping
    Wang, Yunzhe
    Lu, You
    Wu, Hongjie
    JOURNAL OF GRID COMPUTING, 2023, 21 (02)
  • [5] Few-shot learning for short text classification
    Yan, Leiming
    Zheng, Yuhui
    Cao, Jie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (22) : 29799 - 29810
  • [6] Few-shot learning for short text classification
    Leiming Yan
    Yuhui Zheng
    Jie Cao
    Multimedia Tools and Applications, 2018, 77 : 29799 - 29810
  • [7] Continual Few-Shot Learning for Text Classification
    Pasunuru, Ramakanth
    Stoyanov, Veselin
    Bansal, Mohit
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 5688 - 5702
  • [8] Few-shot Learning for New Environment Adaptation
    Wang, Ouya
    Zhou, Shenglong
    Li, Geoffrey Ye
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 351 - 356
  • [9] Subspace Adaptation Prior for Few-Shot Learning
    Mike Huisman
    Aske Plaat
    Jan N. van Rijn
    Machine Learning, 2024, 113 : 725 - 752
  • [10] Subspace Adaptation Prior for Few-Shot Learning
    Huisman, Mike
    Plaat, Aske
    van Rijn, Jan N.
    MACHINE LEARNING, 2024, 113 (02) : 725 - 752