CONTEXTUAL MULTI-ARMED BANDIT ALGORITHMS FOR PERSONALIZED LEARNING ACTION SELECTION

被引:0
作者
Manickam, Indu [1 ]
Lan, Andrew S. [1 ]
Baraniuk, Richard G. [1 ]
机构
[1] Rice Univ, Houston, TX 77251 USA
来源
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2017年
关键词
contextual bandits; personalized learning;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Optimizing the selection of learning resources and practice questions to address each individual student's needs has the potential to improve students' learning efficiency. In this paper, we study the problem of selecting a personalized learning action for each student (e. g. watching a lecture video, working on a practice question, etc.), based on their prior performance, in order to maximize their learning outcome. We formulate this problem using the contextual multi-armed bandits framework, where students' prior concept knowledge states (estimated from their responses to questions in previous assessments) correspond to contexts, the personalized learning actions correspond to arms, and their performance on future assessments correspond to rewards. We propose three new Bayesian policies to select personalized learning actions for students that each exhibits advantages over prior work, and experimentally validate them using real-world datasets.
引用
收藏
页码:6344 / 6348
页数:5
相关论文
共 14 条
[1]  
Agrawal Shipra., 2012, JMLR Workshop and Conference Proceedings, page, P26
[2]  
[Anonymous], 2011, P 4 ACM INT C WEB SE, DOI DOI 10.1145/1935826.1935878
[3]  
[Anonymous], 2006, GAUSSIAN PROCESSES M, DOI DOI 10.1142/S0129065704001899
[4]  
Chapelle O., 2011, ADV NEURAL INFORM PR, V24
[5]   A KNOWLEDGE-GRADIENT POLICY FOR SEQUENTIAL INFORMATION COLLECTION [J].
Frazier, Peter I. ;
Powell, Warren B. ;
Dayanik, Savas .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2008, 47 (05) :2410-2439
[6]  
Lan A. S., 2016, PROC 9 INT C ED DATA, P424
[7]   Time-Varying Learning and Content Analytics via Sparse Factor Analysis [J].
Lan, Andrew S. ;
Studer, Christoph ;
Baraniuk, Richard G. .
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, :452-461
[8]  
Lan AS, 2014, J MACH LEARN RES, V15, P1959
[9]  
Powell W. B., APPROXIMATE DYNAMIC
[10]  
Powell Warren B, 2012, Optimal learning, V841