Predicting student success in MOOCs: a comprehensive analysis using machine learning models

被引:0
作者
Althibyani, Hosam A. [1 ]
机构
[1] Learning Design and Technology Department, College of Education, University of Jeddah, Jeddah
关键词
Artificial intelligence; Logistic regression; Machine learning; MOOC; OULAD; Random Forest; Virtual learning environment;
D O I
10.7717/PEERJ-CS.2221
中图分类号
学科分类号
摘要
Background. This study was motivated by the increasing popularity of Massive Open Online Courses (MOOCs) and the challenges they face, such as high dropout and failure rates. The existing knowledge primarily focused on predicting student dropout, but this study aimed to go beyond that by predicting both student dropout and course results. By using machine learning models and analyzing various data sources, the study sought to improve our understanding of factors influencing student success in MOOCs. Objectives. The primary aim of this research was to develop accurate predictions of students’ course outcomes in MOOCs, specifically whether they would pass or fail. Unlike previous studies, this study took into account demographic, assessment, and student interaction data to provide comprehensive predictions. Methods. The study utilized demographic, assessment, and student interaction data to develop predictive models. Two machine learning methods, logistic regression, and random forest classification were employed to predict students’ course outcomes. The accuracy of the models was evaluated based on four-class classification (predicting four possible outcomes) and two-class classification (predicting pass or fail). Results and Conclusions. The study found that simple indicators, such as a student’s activity level on a given day, could be as effective as more complex data combinations or personal information in predicting student success. The logistic regression model achieved an accuracy of 72.1% for four-class classification and 92.4% for 2-class classification, while the random forest classifier achieved an accuracy of 74.6% for four-class classification and 95.7% for two-class classification. These findings highlight the potential of machine learning models in predicting and understanding students’ course outcomes in MOOCs, offering valuable insights for improving student engagement and success in online learning environments. Copyright 2024 Althibyani Distributed under Creative Commons CC-BY 4.0 OPEN ACCESS
引用
收藏
相关论文
共 31 条
[11]  
Haiyang L, Wang Z, Benachour P, Tubman P., A time series classification method for behaviour-based dropout prediction, 2018 IEEE 18th international conference on advanced learning technologies (ICALT), pp. 191-195, (2018)
[12]  
Hasan R, Palaniappan S, Mahmood S, Sarker KU, Abbas A., Modelling and predicting student’s academic performance using classification data mining techniques, International Journal of Business Information Systems, 34, 3, pp. 403-422, (2020)
[13]  
Hlosta M, Zdrahal Z, Zendulka J., Ouroboros: early identification of at-risk students without models based on legacy data, Proceedings of the seventh international learning analytics & knowledge conference, pp. 6-15, (2017)
[14]  
Hong B, Wei Z, Yang Y., Discovering learning behavior patterns to predict dropout in MOOC, 2017 12th international conference on computer science and education (ICCSE), pp. 700-704, (2017)
[15]  
Jha NI, Ghergulescu I, Moldovan AN., OULAD MOOC dropout and result prediction using ensemble, deep learning and regression techniques, CSEDU, 2, pp. 154-164, (2019)
[16]  
Kuzilek J, Hlosta M, Zdrahal Z., Open university learning analytics dataset, Scientific Data, 4, 1, pp. 1-8, (2017)
[17]  
Lemay DJ, Doleck T., Predicting completion of massive open online course (MOOC) assignments from video viewing behavior, Interactive Learning Environments, 30, 10, pp. 1782-1793, (2022)
[18]  
Ljubobratovic D, Matetic M., Using LMS activity logs to predict student failure with random forest algorithm, The Future of Information Sciences, 113, (2019)
[19]  
Menard S., Coefficients of determination for multiple logistic regression analysis, The American Statistician, 54, 1, pp. 17-24, (2000)
[20]  
Mourdi Y, Sadgal M, El Kabtane H, Berrada Fathi W., A machine learning-based methodology to predict learners’ dropout, success or failure in MOOCs, International Journal of Web Information Systems, 15, 5, pp. 489-509, (2019)