Fine-Tuning a Large Language Model with Reinforcement Learning for Educational Question Generation

被引：0

作者：

Lamsiyah, Salima ^{[1
]}

El Mahdaouy, Abdelkader ^{[2
]}

Nourbakhsh, Aria ^{[1
]}

Schommer, Christoph ^{[1
]}

机构：

[1] Univ Luxembourg, Fac Sci Technol & Med, Dept Comp Sci, Esch Sur Alzette, Luxembourg

[2] Mohammed VI Polytech Univ, Coll Comp, Ben Guerir, Morocco

来源：

ARTIFICIAL INTELLIGENCE IN EDUCATION, PT I, AIED 2024 | 2024年 / 14829卷

关键词：

Educational Question Generation; Large Language Model; Google FLAN-T5; Reinforcement Learning; Self-Critical Sequence Training;

D O I：

10.1007/978-3-031-64302-6_30

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Educational Natural Language Generation (EduQG) aims to automatically generate educational questions from textual content, which is crucial for the expansion of online education. Prior research in EduQG has predominantly relied on cross-entropy loss for training, which can lead to issues such as exposure bias and inconsistencies between training and testing metrics. To mitigate this issue, we propose a reinforcement learning (RL) based large language model (LLM) for educational question generation. In particular, we fine-tune the Google FLAN-T5 model using a mixed objective function that combines cross-entropy and RL losses to ensure the generation of questions that are syntactically and semantically accurate. The experimental results on the SciQ question generation dataset show that the proposed method is competitive with current state-of-the-art systems in terms of predictive performance and linguistic quality.

引用

页码：424 / 438

页数：15

共 50 条

[21] Getting it right: the limits of fine-tuning large language models
Browning, Jacob
ETHICS AND INFORMATION TECHNOLOGY, 2024, 26 (02)
[22] Fine-tuning large language models for chemical text mining
Zhang, Wei
Wang, Qinggong
Kong, Xiangtai
Xiong, Jiacheng
Ni, Shengkun
Cao, Duanhua
Niu, Buying
Chen, Mingan
Li, Yameng
Zhang, Runze
Wang, Yitian
Zhang, Lehan
Li, Xutong
Xiong, Zhaoping
Shi, Qian
Huang, Ziming
Fu, Zunyun
Zheng, Mingyue
CHEMICAL SCIENCE, 2024, 15 (27) : 10600 - 10611
[23] Fine-Tuning Large Language Model Based Explainable Recommendation with Explainable Quality Reward
Yang, Mengyuan
Zhu, Mengying
Wang, Yan
Chen, Linxun
Zhao, Yilei
Wang, Xiuyuan
Han, Bing
Zheng, Xiaolin
Yin, Jianwei
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 8, 2024, : 9250 - 9259
[24] Drive as Veteran: Fine-tuning of an Onboard Large Language Model for Highway Autonomous Driving
Wang, Yujin
Huang, Zhaoyan
Liu, Quanfeng
Zheng, Yutong
Hong, Jinlong
Chen, Junyi
Xiong, Lu
Gao, Bingzhao
Chen, Hong
2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 502 - 508
[25] Combining Large Model Fine-Tuning and Graph Neural Networks for Knowledge Graph Question Answering
Chen, Junzhen
Wang, Shuying
Luo, Haoran
Computer Engineering and Applications, 2024, 60 (24) : 166 - 176
[26] Evaluating the Effectiveness of Fine-Tuning Large Language Model for Domain-Specific Task
Dabhi, Saumya
Martinez, Joseph
Poursardar, Faryaneh
2024 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE, IRI 2024, 2024, : 176 - 177
[27] Fine-tuning large neural language models for biomedical natural language processing
Tinn, Robert
Cheng, Hao
Gu, Yu
Usuyama, Naoto
Liu, Xiaodong
Naumann, Tristan
Gao, Jianfeng
Poon, Hoifung
PATTERNS, 2023, 4 (04):
[28] Efficient Unsupervised Sentence Compression by Fine-tuning Transformers with Reinforcement Learning
Ghalandari, Demian Gholipour
Hokamp, Chris
Ifrim, Georgiana
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 1267 - 1280
[29] Selecting Informative Contexts Improves Language Model Fine-tuning
Antonello, Richard
Beckage, Nicole M.
Turek, Javier S.
Huth, Alexander G.
59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 1072 - 1085
[30] On Transferability of Bias Mitigation Effects in Language Model Fine-Tuning
Jin, Xisen
Barbieri, Francesco
Kennedy, Brendan
Davani, Aida Mostafazadeh
Neves, Leonardo
Ren, Xiang
2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 3770 - 3783

← 1 2 3 4 5 →