Fine-Tuning a Large Language Model with Reinforcement Learning for Educational Question Generation

被引:0
|
作者
Lamsiyah, Salima [1 ]
El Mahdaouy, Abdelkader [2 ]
Nourbakhsh, Aria [1 ]
Schommer, Christoph [1 ]
机构
[1] Univ Luxembourg, Fac Sci Technol & Med, Dept Comp Sci, Esch Sur Alzette, Luxembourg
[2] Mohammed VI Polytech Univ, Coll Comp, Ben Guerir, Morocco
来源
ARTIFICIAL INTELLIGENCE IN EDUCATION, PT I, AIED 2024 | 2024年 / 14829卷
关键词
Educational Question Generation; Large Language Model; Google FLAN-T5; Reinforcement Learning; Self-Critical Sequence Training;
D O I
10.1007/978-3-031-64302-6_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Educational Natural Language Generation (EduQG) aims to automatically generate educational questions from textual content, which is crucial for the expansion of online education. Prior research in EduQG has predominantly relied on cross-entropy loss for training, which can lead to issues such as exposure bias and inconsistencies between training and testing metrics. To mitigate this issue, we propose a reinforcement learning (RL) based large language model (LLM) for educational question generation. In particular, we fine-tune the Google FLAN-T5 model using a mixed objective function that combines cross-entropy and RL losses to ensure the generation of questions that are syntactically and semantically accurate. The experimental results on the SciQ question generation dataset show that the proposed method is competitive with current state-of-the-art systems in terms of predictive performance and linguistic quality.
引用
收藏
页码:424 / 438
页数:15
相关论文
共 50 条
  • [41] Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning
    Nagabandi, Anusha
    Kahn, Gregory
    Fearing, Ronald S.
    Levine, Sergey
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 7579 - 7586
  • [42] Fine-Tuning via Mask Language Model Enhanced Representations Based Contrastive Learning and Application
    Zhang, Dechi
    Wan, Weibing
    Computer Engineering and Applications, 60 (17): : 129 - 138
  • [43] Improving unbalanced image classification through fine-tuning method of reinforcement learning
    Wang, Jin-Qiang
    Guo, Lan
    Jiang, Yuanbo
    Zhang, Shengjie
    Zhou, Qingguo
    APPLIED SOFT COMPUTING, 2024, 163
  • [44] DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
    Fan, Ying
    Watkins, Olivia
    Du, Yuqing
    Liu, Hao
    Ryu, Moonkyung
    Boutilier, Craig
    Abbeel, Pieter
    Ghavamzadeh, Mohammad
    Lee, Kangwook
    Lee, Kimin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [45] SUF: Stabilized Unconstrained Fine-Tuning for Offline-to-Online Reinforcement Learning
    Feng, Jiaheng
    Feng, Mingxiao
    Song, Haolin
    Zhou, Wengang
    Li, Houqiang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 11961 - 11969
  • [46] Multi-phase Fine-Tuning: A New Fine-Tuning Approach for Sign Language Recognition
    Sarhan, Noha
    Lauri, Mikko
    Frintrop, Simone
    KUNSTLICHE INTELLIGENZ, 2022, 36 (01): : 91 - 98
  • [47] Multi-phase Fine-Tuning: A New Fine-Tuning Approach for Sign Language Recognition
    Noha Sarhan
    Mikko Lauri
    Simone Frintrop
    KI - Künstliche Intelligenz, 2022, 36 : 91 - 98
  • [48] ConoGPT: Fine-Tuning a Protein Language Model by Incorporating Disulfide Bond Information for Conotoxin Sequence Generation
    Zhao, Guohui
    Ge, Cheng
    Han, Wenzheng
    Yu, Rilei
    Liu, Hao
    TOXINS, 2025, 17 (02)
  • [49] Artificial intelligence-based data extraction for next generation risk assessment: Is fine-tuning of a large language model worth the effort?
    Sonnenburg, Anna
    van der Lugt, Benthe
    Rehn, Johannes
    Wittkowski, Paul
    Bech, Karsten
    Padberg, Florian
    Eleftheriadou, Dimitra
    Dobrikov, Todor
    Bouwmeester, Hans
    Mereu, Carla
    Graf, Ferdinand
    Kneuer, Carsten
    Kramer, Nynke I.
    Bluemmel, Tilmann
    TOXICOLOGY, 2024, 508
  • [50] Efficient Index Learning via Model Reuse and Fine-tuning
    Liu, Guanli
    Qi, Jianzhong
    Kulik, Lars
    Soga, Kazuya
    Borovica-Gajic, Renata
    Rubinstein, Benjamin I. P.
    2023 IEEE 39TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS, ICDEW, 2023, : 60 - 66