Using the ITS Components in Improving the Q-Learning Policy for Instructional Sequencing

被引：0

作者：

Yessad, Amel ^{[1
]}

机构：

[1] Sorbonne Univ, LIP6, CNRS, 4 Pl Jussieu, F-75252 Paris 05, France

来源：

AUGMENTED INTELLIGENCE AND INTELLIGENT TUTORING SYSTEMS, ITS 2023 | 2023年 / 13891卷

关键词：

Instructional Sequencing; Sequencing policy; Reinforcement learning; Q-learning; student model; domain model; Intelligent tutoring system;

D O I：

10.1007/978-3-031-32883-1_21

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we aim to optimize the sequencing of learning activities using the Q-learning, a reinforcement learning method. The Q-learning agent decides each time which activity to propose to the student. The sequencing policy we propose is guided by the aim to improve efficiently the student knowledge state. Thus, the Q-learning learns a mapping of the student knowledge states to the optimal activity to perform in that state. In this paper, we tackle two main issues in implementing the Q-learning off-policy: the combinatorial explosion of the student knowledge states and the definition of the reward function allowing to improve efficiently the student knowledge state. We rely on the student model and the domain model to answer these two challenges. We carried out a study to evaluate the approach we propose on simulated students. We show that our approach is more efficient since it achieves better learning gain with fewer activities than a random policy or an expert based policy.

引用

页码：247 / 256

页数：10

共 15 条

[1] Aleven V, 2017, EDUC PSYCHOL HANDB, P522
[2] Reinforcement Learning for the Adaptive Scheduling of Educational Activities
Bassen, Jonathan
Balaji, Bharathan
Schaarschmidt, Michael
Thille, Candace
Painter, Jay
Zimmaro, Dawn
Gamest, Alex
Fast, Ethan
Mitchell, John C.
[J]. PROCEEDINGS OF THE 2020 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'20), 2020,
[3] Clement B, 2015, Arxiv, DOI arXiv:1310.3174
[4] CORBETT AT, 1994, USER MODEL USER-ADAP, V4, P253, DOI 10.1007/BF01099821
[5] Where's the Reward?: A Review of Reinforcement Learning for Instructional Sequencing
Doroudi, Shayan
Aleven, Vincent
Brunskill, Emma
[J]. INTERNATIONAL JOURNAL OF ARTIFICIAL INTELLIGENCE IN EDUCATION, 2019, 29 (04) : 568 - 620
[6] Efremov Aleksandr, 2020, EDM
[7] Mandel T, 2014, AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, P1077
[8] Sen A., 2018, Machine beats human at sequencing visuals for perceptual-fluency practice
[9] Singla A., 2021, arXiv, DOI DOI 10.48550/ARXIV.2107.08828
[10] Sondik Edward J, 1971, OPTIMAL CONTROL PART, P1

← 1 2 →