A predictive approach based on efficient feature selection and learning algorithms’ competition: Case of learners’ dropout in MOOCs

被引:0
作者
Mourdi Youssef
Sadgal Mohammed
El Kabtane Hamada
Berrada Fathi Wafaa
机构
[1] CADI AYYAD University,Computer Science Departement
来源
Education and Information Technologies | 2019年 / 24卷
关键词
Dropout; Distance education; Feature selection; Algorithms competition; Educational datamining; MOOC;
D O I
暂无
中图分类号
学科分类号
摘要
MOOCs are becoming more and more involved in the pedagogical experimentation of universities whose infrastructure does not respond to the growing mass of learners. These universities aim to complete their initial training with distance learning courses. Unfortunately, the efforts made to succeed in this pedagogical model are facing a dropout rate of enrolled learners reaching 90% in some cases. This makes the coaching, the group formation of learners, and the instructor/learner interaction challenging. It is within this context that this research aims to propose a predictive model allowing to classify the MOOCs learners into three classes: the learners at risk of dropping out, those who are likely to fail and those who are on the road to success. An automatic determination of relevant attributes for analysis, classification, interpretation and prediction from MOOC learners data, will allow instructors to streamline interventions for each class. To meet this purpose, we present an approach based on feature selection methods and ensemble machine learning algorithms. The proposed model was tested on a dataset of over 5,500 learners in two Stanford University MOOCs courses. In order to attest its performance (98.6%), a comparison was carried out based on several performance measures.
引用
收藏
页码:3591 / 3618
页数:27
相关论文
共 115 条
[1]  
Alves A(2017)Stacking machine learning classifiers to identify Higgs bosons at the LHC Journal of Instrumentation 12 1-19
[2]  
Burgos C(2018)Data mining for modeling students’ performance: A tutoring action plan to prevent academic dropout Computer Electrical Engineering 66 541-556
[3]  
Campanario ML(2015)Predicting student attrition in MOOCs using sentiment analysis and neural networks Proc. CEUR Workshop 1432 7-12
[4]  
de la Pena D(2018)MApping forest change using stacked generalization: An ensemble approach Remote Sensing Environment 204 717-728
[5]  
Lara JA(2014)Regression, Classification and Ensemble Machine Learning Approaches to Forecasting Clinical Outcomes in Ischemic Stroke Biomedical Engineering Systems and Technologies 452 376-402
[6]  
Lizcano D(2010)Feature Subset Selection Problem using Wrapper Approach in Supervised Learning International of Journal Computer Application 1 13-17
[7]  
Martinez MA(2017)Feature Selection ACM Computing Surveys 50 1-45
[8]  
Chaplot DS(2018)Air-pollution prediction in smart cities through machine learning methods: A case of study in Murcia Spain, Journal University of Computer Science 24 261-276
[9]  
Rhim E(2016)Others MLlib: Machine Learning in Apache Spark Journal of Machine Learning Research 17 1235-1241
[10]  
Kim J(2017)Application of Support Vector Machine, Random Forest, and Genetic Algorithm Optimized Random Forest Models in Groundwater Potential Mapping Water Resources Management 31 2761-2775