Predicting student dropout in subscription-based online learning environments: The beneficial impact of the logit leaf model

被引:75
作者
Coussement, Kristof [1 ,2 ]
Phan, Minh [1 ,2 ]
De Caigny, Arno [1 ,2 ]
Benoit, Dries F. [3 ]
Raes, Annelies [4 ]
机构
[1] IESEG Sch Management, 3 Rue Digue, F-59000 Lille, France
[2] LEM CNRS 9221, 3 Rue Digue, F-59000 Lille, France
[3] Univ Ghent, Fac Econ & Business Adm, Tweekerkenstr 2, B-9000 Ghent, Belgium
[4] Katholieke Univ Leuven, ITEC, Imec Res Grp, Kapeldreef 75, B-3001 Leuven, Belgium
关键词
Learning analytics; Proactive student management; Subscription-based online learning; Student dropout; Logit leaf model; Machine learning; CUSTOMER CHURN PREDICTION; LOGISTIC-REGRESSION; CLASSIFICATION; PERFORMANCE; RETENTION; DECISION;
D O I
10.1016/j.dss.2020.113325
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online learning has been adopted rapidly by educational institutions and organizations. Despite its many advantages, including 24/7 access, high flexibility, rich content, and low cost, online learning suffers from high dropout rates that hamper pedagogical and economic goal outcomes. Enhanced student dropout prediction tools would help providers proactively detect students at risk of leaving and identify factors that they might address to help students continue their learning experience. Therefore, this study seeks to improve student dropout predictions, with three main contributions. First, it benchmarks a recently proposed logit leaf model (LLM) algorithm against eight other algorithms, using a real-life data set of 10,554 students of a global subscription-based online learning provider. The LLM outperforms all other methods in finding a balance between predictive performance and comprehensibility. Second, a new multilevel informative visualization of the LLM adds novel benefits, relative to a standard LLM visualization. Third, this research specifies the impacts of student demographics; classroom characteristics; and academic, cognitive, and behavioral engagement variables on student dropout. In reviewing LLM segments, these results show that different insights emerge for various student segments with different learning patterns. This notable result can be used to personalize student retention campaigns.
引用
收藏
页数:11
相关论文
共 73 条
[1]  
AL-Shabandar R, 2017, IEEE IJCNN, P713, DOI 10.1109/IJCNN.2017.7965922
[2]  
Allen I. E., 2016, ONLINE REPORT CARD T, V23, P2016
[3]   Combined 5 x 2 cv F test for comparing supervised classification learning algorithms [J].
Alpaydin, E .
NEURAL COMPUTATION, 1999, 11 (08) :1885-1892
[4]  
Amnueypornsakul B., 2014, Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), P55
[5]  
[Anonymous], 1999, Data preparation for data mining
[6]  
[Anonymous], SSRN ELECT J
[7]  
Balakrishnan G., 2013, TECH REP, P1
[8]  
Barber R., 2012, Proceedings of the 2nd International Conference on Learning Analytics and Knowledge, P259, DOI DOI 10.1145/2330601.2330664
[9]  
Bates A. W., 1997, RESTRUCTURING U TECH, P207
[10]  
Bishop C. M., 1995, Neural Networks for Pattern Recognition