Fine-Tuning a Large Language Model with Reinforcement Learning for Educational Question Generation

被引:0
|
作者
Lamsiyah, Salima [1 ]
El Mahdaouy, Abdelkader [2 ]
Nourbakhsh, Aria [1 ]
Schommer, Christoph [1 ]
机构
[1] Univ Luxembourg, Fac Sci Technol & Med, Dept Comp Sci, Esch Sur Alzette, Luxembourg
[2] Mohammed VI Polytech Univ, Coll Comp, Ben Guerir, Morocco
来源
ARTIFICIAL INTELLIGENCE IN EDUCATION, PT I, AIED 2024 | 2024年 / 14829卷
关键词
Educational Question Generation; Large Language Model; Google FLAN-T5; Reinforcement Learning; Self-Critical Sequence Training;
D O I
10.1007/978-3-031-64302-6_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Educational Natural Language Generation (EduQG) aims to automatically generate educational questions from textual content, which is crucial for the expansion of online education. Prior research in EduQG has predominantly relied on cross-entropy loss for training, which can lead to issues such as exposure bias and inconsistencies between training and testing metrics. To mitigate this issue, we propose a reinforcement learning (RL) based large language model (LLM) for educational question generation. In particular, we fine-tune the Google FLAN-T5 model using a mixed objective function that combines cross-entropy and RL losses to ensure the generation of questions that are syntactically and semantically accurate. The experimental results on the SciQ question generation dataset show that the proposed method is competitive with current state-of-the-art systems in terms of predictive performance and linguistic quality.
引用
收藏
页码:424 / 438
页数:15
相关论文
共 50 条
  • [21] Getting it right: the limits of fine-tuning large language models
    Browning, Jacob
    ETHICS AND INFORMATION TECHNOLOGY, 2024, 26 (02)
  • [22] Fine-tuning large language models for chemical text mining
    Zhang, Wei
    Wang, Qinggong
    Kong, Xiangtai
    Xiong, Jiacheng
    Ni, Shengkun
    Cao, Duanhua
    Niu, Buying
    Chen, Mingan
    Li, Yameng
    Zhang, Runze
    Wang, Yitian
    Zhang, Lehan
    Li, Xutong
    Xiong, Zhaoping
    Shi, Qian
    Huang, Ziming
    Fu, Zunyun
    Zheng, Mingyue
    CHEMICAL SCIENCE, 2024, 15 (27) : 10600 - 10611
  • [23] Fine-Tuning Large Language Model Based Explainable Recommendation with Explainable Quality Reward
    Yang, Mengyuan
    Zhu, Mengying
    Wang, Yan
    Chen, Linxun
    Zhao, Yilei
    Wang, Xiuyuan
    Han, Bing
    Zheng, Xiaolin
    Yin, Jianwei
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 8, 2024, : 9250 - 9259
  • [24] Drive as Veteran: Fine-tuning of an Onboard Large Language Model for Highway Autonomous Driving
    Wang, Yujin
    Huang, Zhaoyan
    Liu, Quanfeng
    Zheng, Yutong
    Hong, Jinlong
    Chen, Junyi
    Xiong, Lu
    Gao, Bingzhao
    Chen, Hong
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 502 - 508
  • [25] Combining Large Model Fine-Tuning and Graph Neural Networks for Knowledge Graph Question Answering
    Chen, Junzhen
    Wang, Shuying
    Luo, Haoran
    Computer Engineering and Applications, 2024, 60 (24) : 166 - 176
  • [26] Evaluating the Effectiveness of Fine-Tuning Large Language Model for Domain-Specific Task
    Dabhi, Saumya
    Martinez, Joseph
    Poursardar, Faryaneh
    2024 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE, IRI 2024, 2024, : 176 - 177
  • [27] Fine-tuning large neural language models for biomedical natural language processing
    Tinn, Robert
    Cheng, Hao
    Gu, Yu
    Usuyama, Naoto
    Liu, Xiaodong
    Naumann, Tristan
    Gao, Jianfeng
    Poon, Hoifung
    PATTERNS, 2023, 4 (04):
  • [28] Efficient Unsupervised Sentence Compression by Fine-tuning Transformers with Reinforcement Learning
    Ghalandari, Demian Gholipour
    Hokamp, Chris
    Ifrim, Georgiana
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 1267 - 1280
  • [29] Selecting Informative Contexts Improves Language Model Fine-tuning
    Antonello, Richard
    Beckage, Nicole M.
    Turek, Javier S.
    Huth, Alexander G.
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 1072 - 1085
  • [30] On Transferability of Bias Mitigation Effects in Language Model Fine-Tuning
    Jin, Xisen
    Barbieri, Francesco
    Kennedy, Brendan
    Davani, Aida Mostafazadeh
    Neves, Leonardo
    Ren, Xiang
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 3770 - 3783