Combining Click-Stream Data with NLP Tools to Better Understand MOOC Completion

被引:103
作者
Crossley, Scott [1 ]
Paquette, Luc [2 ]
Dascalu, Mihai [3 ]
McNamara, Danielle S. [4 ]
Baker, Ryan S. [5 ]
机构
[1] Georgia State Univ, 25 Pk Pl,Ste 1500, Atlanta, GA 30303 USA
[2] Univ Illinois, 1310 S 6th St, Champaign, IL 61820 USA
[3] Univ Politehn Bucuresti, 313 Splaiullndepententei, Bucharest, Romania
[4] Arizona State Univ, POB 872111, Tempe, AZ 85287 USA
[5] Columbia Univ, Teachers Coll, 525 West 120th St, New York, NY 10027 USA
来源
LAK '16 CONFERENCE PROCEEDINGS: THE SIXTH INTERNATIONAL LEARNING ANALYTICS & KNOWLEDGE CONFERENCE, | 2016年
基金
美国国家科学基金会;
关键词
MOOC; click-stream data; educational data mining; natural language processing; sentiment analysis; educational success; predictive analytics;
D O I
10.1145/2883851.2883931
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Completion rates for massive open online classes (MOOCs) are notoriously low. Identifying student patterns related to course completion may help to develop interventions that can improve retention and learning outcomes in MOOCs. Previous research predicting MOOC completion has focused on click-stream data, student demographics, and natural language processing (NLP) analyses. However, most of these analyses have not taken full advantage of the multiple types of data available. This study combines click-stream data and NLP approaches to examine if students' on-line activity and the language they produce in the online discussion forum is predictive of successful class completion. We study this analysis in the context of a subsample of 320 students who completed at least one graded assignment and produced at least 50 words in discussion forums, in a MOOC on educational data mining. The findings indicate that a mix of click-stream data and NLP indices can predict with substantial accuracy (78%) whether students complete the MOOC. This predictive power suggests that student interaction data and language data within a MOOC can help us both to understand student retention in MOOCs and to develop automated signals of student success.
引用
收藏
页码:6 / 14
页数:9
相关论文
共 45 条
[1]  
[Anonymous], 2010, P NAACL HLT 2010 WOR, DOI DOI 10.5555/1860631.1860635
[2]  
[Anonymous], BEHAV RES M IN PRESS
[3]  
[Anonymous], 2015, SENTIC COMPUTING COM
[4]  
[Anonymous], 2007, LIWC2007: Linguistic Inquiry and Word Count. Austin Tex
[5]  
Balakrishnan Girish., 2013, Predicting student retention in massive open online courses using hidden markov models
[6]  
Boyer S., 2015, P INT C ART INT ED
[7]  
Bradley MM., Affective Norms for English Words (ANEW): Instruction Manual and Affective Ratings. page, P49
[8]  
Cambria E, 2012, 25 INT FLAIRS C
[9]  
Cambria E., 2010, Proceedings of the 2010 AAAI Fall Symposium Series
[10]  
Chaturvedi S., 2014, P 52 M ASS COMP LING