Where's the Reward?: A Review of Reinforcement Learning for Instructional Sequencing

被引:40
|
作者
Doroudi, Shayan [1 ,2 ,3 ]
Aleven, Vincent [4 ]
Brunskill, Emma [2 ]
机构
[1] Carnegie Mellon Univ, Comp Sci Dept, Pittsburgh, PA 15213 USA
[2] Stanford Univ, Comp Sci Dept, Stanford, CA 94305 USA
[3] Univ Calif Irvine, Sch Educ, Irvine, CA 92697 USA
[4] Carnegie Mellon Univ, Human Comp Interact Inst, Pittsburgh, PA 15213 USA
关键词
Reinforcement learning; Instructional sequencing; Adaptive instruction; History of artificial intelligence in education; TEACHING STRATEGIES; WORKED EXAMPLES; KNOWLEDGE; ALLOCATION; EXPERTISE; GAME; EFFICIENCY; RETENTION; SELECTION; IMPROVE;
D O I
10.1007/s40593-019-00187-x
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Since the 1960s, researchers have been trying to optimize the sequencing of instructional activities using the tools of reinforcement learning (RL) and sequential decision making under uncertainty. Many researchers have realized that reinforcement learning provides a natural framework for optimal instructional sequencing given a particular model of student learning, and excitement towards this area of research is as alive now as it was over fifty years ago. But does RL actually help students learn? If so, when and where might we expect it to be most helpful? To help answer these questions, we review the variety of attempts to use RL for instructional sequencing. First, we present a historical narrative of this research area. We identify three waves of research, which gives us a sense of the various communities of researchers that have been interested in this problem and where the field is going. Second, we review all of the empirical research that has compared RL-induced instructional policies to baseline methods of sequencing. We find that over half of the studies found that RL-induced policies significantly outperform baselines. Moreover, we identify five clusters of studies with different characteristics and varying levels of success in using RL to help students learn. We find that reinforcement learning has been most successful in cases where it has been constrained with ideas and theories from cognitive psychology and the learning sciences. However, given that our theories and models are limited, we also find that it has been useful to complement this approach with running more robust offline analyses that do not rely heavily on the assumptions of one particular model. Given that many researchers are turning to deep reinforcement learning and big data to tackle instructional sequencing, we believe keeping these best practices in mind can help guide the way to the reward in using RL for instructional sequencing.
引用
收藏
页码:568 / 620
页数:53
相关论文
共 50 条
  • [1] Where’s the Reward?A Review of Reinforcement Learning for Instructional Sequencing
    Shayan Doroudi
    Vincent Aleven
    Emma Brunskill
    International Journal of Artificial Intelligence in Education, 2019, 29 : 568 - 620
  • [2] A Deep Reinforcement Learning Framework for Instructional Sequencing
    Pu, Yanjun
    Wang, Caimeng
    Wu, Wenjun
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 5201 - 5208
  • [3] A Review of Reward Functions for Reinforcement Learning in the context of Autonomous Driving
    Abouelazm, Ahmed
    Michel, Jonas
    Zoellner, Marius
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 156 - 163
  • [4] Reward Shaping in Episodic Reinforcement Learning
    Grzes, Marek
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 565 - 573
  • [5] Reinforcement Learning using Reward Expectations in Scenarios with Aleatoric Uncertainties
    Wang, Yubin
    Sun, Yifeng
    Wu, Jiang
    Hu, Hao
    Wu, Zhiqiang
    Huang, Weigui
    2022 IEEE CONFERENCE ON GAMES, COG, 2022, : 261 - 267
  • [6] Actively learning costly reward functions for reinforcement learning
    Eberhard, Andre
    Metni, Houssam
    Fahland, Georg
    Stroh, Alexander
    Friederich, Pascal
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (01):
  • [7] Reinforcement Learning for Data Preparation with Active Reward Learning
    Berti-Equille, Laure
    INTERNET SCIENCE, INSCI 2019, 2019, 11938 : 121 - 132
  • [8] Learning classifier system with average reward reinforcement learning
    Zang, Zhaoxiang
    Li, Dehua
    Wang, Junying
    Xia, Dan
    KNOWLEDGE-BASED SYSTEMS, 2013, 40 : 58 - 71
  • [9] Maximum reward reinforcement learning: A non-cumulative reward criterion
    Quah, K. H.
    Quek, Chai
    EXPERT SYSTEMS WITH APPLICATIONS, 2006, 31 (02) : 351 - 359
  • [10] Skill Reward for Safe Deep Reinforcement Learning
    Cheng, Jiangchang
    Yu, Fumin
    Zhang, Hongliang
    Dai, Yinglong
    UBIQUITOUS SECURITY, 2022, 1557 : 203 - 213