Reinforcement Learning Based on Contextual Bandits for Personalized Online Learning Recommendation Systems

被引:1
作者
Wacharawan Intayoad
Chayapol Kamyod
Punnarumol Temdee
机构
[1] Mae Fah Luang University,Computer and Communication Engineering for Capacity Building Research Unit, School of Information Technology
来源
Wireless Personal Communications | 2020年 / 115卷
关键词
Reinforcement learning; Personalized learning; Recommendation;
D O I
暂无
中图分类号
学科分类号
摘要
Personalized online learning has been significantly adopted in recent years and become a potential instructional strategy in online learning. The promising way to provide personalized online learning is personalized recommendation by navigating students to suitable learning contents at the right time. However, this is a nontrivial problem as the learning environments are considered as a high degree of flexibility as students independently learn according to their characteristics, and situations. Existing recommendation methods do not work effectively in such environment. Therefore, our objective of this study is to provide personalized dynamic and continuous recommendation for online learning systems. We propose the method that is based on the contextual bandits and reinforcement learning problems which work effectively in a dynamic environment. Moreover, we propose to use the past student behaviors and current student state as the contextual information to create the policy for the reinforcement agent to make the optimal decision. We deploy real data from an online learning system to evaluate our proposed method. The proposed method is compared with the well-known methods in reinforcement learning problems, i.e. ε\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varepsilon$$\end{document}-greedy, greedy optimistic initial value, and upper bound confidence methods. The results depict that our proposed method significantly performs better than those benchmarking methods in our case test.
引用
收藏
页码:2917 / 2932
页数:15
相关论文
共 29 条
[1]  
Adomavicius G(2005)Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions IEEE Transactions on Knowledge and Data Engineering 17 734-749
[2]  
Tuzhilin A(2003)Adaptive and intelligent web-based educational systems International Journal of Artificial Intelligence in Education 13 159-172
[3]  
Brusilovsky P(2008)Intelligent web-based learning system with personalized learning path guidance Computers & Education 51 787-814
[4]  
Peylo C(2008)Personalized web-based tutoring system based on fuzzy item response theory Expert Systems with Applications 34 2298-2315
[5]  
Chen CM(2015)Fuzzy logic for adaptive instruction in an e-learning environment for computer programming IEEE Transactions on Fuzzy Systems 23 164-177
[6]  
Chen CM(2015)A recommender agent based on learning styles for better virtual collaborative learning experiences Computers in Human Behavior 45 243-253
[7]  
Duh LJ(2008)Personal recommender systems for learners in lifelong learning networks: The requirements, techniques and model International Journal of Learning Technology 3 404-423
[8]  
Chrysafiadi K(1988)Learning and teaching styles in engineering education Engineering Education 78 674-681
[9]  
Virvou M(2009)Context-aware system for proactive personalized service based on context history Expert Systems with Applications 36 7448-7457
[10]  
Dascalu MI(1981)Learning styles and disciplinary differences The Modern American College 1 232-255