Fine-Tuning a Large Language Model with Reinforcement Learning for Educational Question Generation

被引：0

作者：

Lamsiyah, Salima ^{[1
]}

El Mahdaouy, Abdelkader ^{[2
]}

Nourbakhsh, Aria ^{[1
]}

Schommer, Christoph ^{[1
]}

机构：

[1] Univ Luxembourg, Fac Sci Technol & Med, Dept Comp Sci, Esch Sur Alzette, Luxembourg

[2] Mohammed VI Polytech Univ, Coll Comp, Ben Guerir, Morocco

来源：

ARTIFICIAL INTELLIGENCE IN EDUCATION, PT I, AIED 2024 | 2024年 / 14829卷

关键词：

Educational Question Generation; Large Language Model; Google FLAN-T5; Reinforcement Learning; Self-Critical Sequence Training;

D O I：

10.1007/978-3-031-64302-6_30

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Educational Natural Language Generation (EduQG) aims to automatically generate educational questions from textual content, which is crucial for the expansion of online education. Prior research in EduQG has predominantly relied on cross-entropy loss for training, which can lead to issues such as exposure bias and inconsistencies between training and testing metrics. To mitigate this issue, we propose a reinforcement learning (RL) based large language model (LLM) for educational question generation. In particular, we fine-tune the Google FLAN-T5 model using a mixed objective function that combines cross-entropy and RL losses to ensure the generation of questions that are syntactically and semantically accurate. The experimental results on the SciQ question generation dataset show that the proposed method is competitive with current state-of-the-art systems in terms of predictive performance and linguistic quality.

引用

页码：424 / 438

页数：15

共 50 条

[41] Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning
Nagabandi, Anusha
Kahn, Gregory
Fearing, Ronald S.
Levine, Sergey
2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 7579 - 7586
[42] Fine-Tuning via Mask Language Model Enhanced Representations Based Contrastive Learning and Application
Zhang, Dechi
Wan, Weibing
Computer Engineering and Applications, 60 (17): : 129 - 138
[43] Improving unbalanced image classification through fine-tuning method of reinforcement learning
Wang, Jin-Qiang
Guo, Lan
Jiang, Yuanbo
Zhang, Shengjie
Zhou, Qingguo
APPLIED SOFT COMPUTING, 2024, 163
[44] DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Fan, Ying
Watkins, Olivia
Du, Yuqing
Liu, Hao
Ryu, Moonkyung
Boutilier, Craig
Abbeel, Pieter
Ghavamzadeh, Mohammad
Lee, Kangwook
Lee, Kimin
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[45] SUF: Stabilized Unconstrained Fine-Tuning for Offline-to-Online Reinforcement Learning
Feng, Jiaheng
Feng, Mingxiao
Song, Haolin
Zhou, Wengang
Li, Houqiang
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 11961 - 11969
[46] Multi-phase Fine-Tuning: A New Fine-Tuning Approach for Sign Language Recognition
Sarhan, Noha
Lauri, Mikko
Frintrop, Simone
KUNSTLICHE INTELLIGENZ, 2022, 36 (01): : 91 - 98
[47] Multi-phase Fine-Tuning: A New Fine-Tuning Approach for Sign Language Recognition
Noha Sarhan
Mikko Lauri
Simone Frintrop
KI - Künstliche Intelligenz, 2022, 36 : 91 - 98
[48] ConoGPT: Fine-Tuning a Protein Language Model by Incorporating Disulfide Bond Information for Conotoxin Sequence Generation
Zhao, Guohui
Ge, Cheng
Han, Wenzheng
Yu, Rilei
Liu, Hao
TOXINS, 2025, 17 (02)
[49] Artificial intelligence-based data extraction for next generation risk assessment: Is fine-tuning of a large language model worth the effort?
Sonnenburg, Anna
van der Lugt, Benthe
Rehn, Johannes
Wittkowski, Paul
Bech, Karsten
Padberg, Florian
Eleftheriadou, Dimitra
Dobrikov, Todor
Bouwmeester, Hans
Mereu, Carla
Graf, Ferdinand
Kneuer, Carsten
Kramer, Nynke I.
Bluemmel, Tilmann
TOXICOLOGY, 2024, 508
[50] Efficient Index Learning via Model Reuse and Fine-tuning
Liu, Guanli
Qi, Jianzhong
Kulik, Lars
Soga, Kazuya
Borovica-Gajic, Renata
Rubinstein, Benjamin I. P.
2023 IEEE 39TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS, ICDEW, 2023, : 60 - 66

← 1 2 3 4 5 →