Using the ITS Components in Improving the Q-Learning Policy for Instructional Sequencing

被引:0
作者
Yessad, Amel [1 ]
机构
[1] Sorbonne Univ, LIP6, CNRS, 4 Pl Jussieu, F-75252 Paris 05, France
来源
AUGMENTED INTELLIGENCE AND INTELLIGENT TUTORING SYSTEMS, ITS 2023 | 2023年 / 13891卷
关键词
Instructional Sequencing; Sequencing policy; Reinforcement learning; Q-learning; student model; domain model; Intelligent tutoring system;
D O I
10.1007/978-3-031-32883-1_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we aim to optimize the sequencing of learning activities using the Q-learning, a reinforcement learning method. The Q-learning agent decides each time which activity to propose to the student. The sequencing policy we propose is guided by the aim to improve efficiently the student knowledge state. Thus, the Q-learning learns a mapping of the student knowledge states to the optimal activity to perform in that state. In this paper, we tackle two main issues in implementing the Q-learning off-policy: the combinatorial explosion of the student knowledge states and the definition of the reward function allowing to improve efficiently the student knowledge state. We rely on the student model and the domain model to answer these two challenges. We carried out a study to evaluate the approach we propose on simulated students. We show that our approach is more efficient since it achieves better learning gain with fewer activities than a random policy or an expert based policy.
引用
收藏
页码:247 / 256
页数:10
相关论文
共 15 条
  • [1] Aleven V, 2017, EDUC PSYCHOL HANDB, P522
  • [2] Reinforcement Learning for the Adaptive Scheduling of Educational Activities
    Bassen, Jonathan
    Balaji, Bharathan
    Schaarschmidt, Michael
    Thille, Candace
    Painter, Jay
    Zimmaro, Dawn
    Gamest, Alex
    Fast, Ethan
    Mitchell, John C.
    [J]. PROCEEDINGS OF THE 2020 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'20), 2020,
  • [3] Clement B, 2015, Arxiv, DOI arXiv:1310.3174
  • [4] CORBETT AT, 1994, USER MODEL USER-ADAP, V4, P253, DOI 10.1007/BF01099821
  • [5] Where's the Reward?: A Review of Reinforcement Learning for Instructional Sequencing
    Doroudi, Shayan
    Aleven, Vincent
    Brunskill, Emma
    [J]. INTERNATIONAL JOURNAL OF ARTIFICIAL INTELLIGENCE IN EDUCATION, 2019, 29 (04) : 568 - 620
  • [6] Efremov Aleksandr, 2020, EDM
  • [7] Mandel T, 2014, AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, P1077
  • [8] Sen A., 2018, Machine beats human at sequencing visuals for perceptual-fluency practice
  • [9] Singla A., 2021, arXiv, DOI DOI 10.48550/ARXIV.2107.08828
  • [10] Sondik Edward J, 1971, OPTIMAL CONTROL PART, P1