A machine learning based model for student's dropout prediction in online training

被引:3
作者
Zerkouk, Meriem [1 ]
Mihoubi, Miloud [2 ]
Chikhaoui, Belkacem [2 ]
Wang, Shengrui [1 ]
机构
[1] Univ Sherbrooke, Dept Comp Sci, Sherbrooke, PQ, Canada
[2] Univ Teluq, Artificial Intelligence Inst, 5800 rue St Denis, Montreal, PQ H2S 3L5, Canada
关键词
Dropout school; Machine learning; Prediction; Sociodemographic data; Bihavioral data;
D O I
10.1007/s10639-024-12500-w
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
School dropout is a significant issue in distance learning, and early detection is crucial for addressing the problem. Our study aims to create a binary classification model that anticipates students' activity levels based on their current achievements and engagement on a Canadian Distance learning Platform. Predicting student dropout, a common classification problem in educational data analysis, is addressed by utilizing a comprehensive dataset that includes 49 features ranging from socio-demographic to behavioral data. This dataset provides a unique opportunity to analyze student interactions and success factors in a distance learning environment. We have developed a student profiling system and implemented a predictive approach using XGBoost, selecting the most important features for the prediction process. In this work, our methodology was developed in Python, using the widely used sci-kit-learn package. Alongside XGBoost, logistic regression was also employed as part of our combination of strategies to enhance the models predictive capabilities. Our work can accurately predict student dropout, achieving an accuracy rate of approximately 82% on unseen data from the next academic year.
引用
收藏
页码:15793 / 15812
页数:20
相关论文
共 19 条
[1]  
Alam Rizwan, 2023, Mobile Computing and Sustainable Informatics: Proceedings of ICMCSI 2023. Lecture Notes on Data Engineering and Communications Technologies (166), P549, DOI 10.1007/978-981-99-0835-6_39
[2]  
Alario-Hoyos C, 2017, INT REV RES OPEN DIS, V18, P119
[3]  
Alhramelah A., 2020, Australian Educational Computing, V35, P1
[4]  
Chen J., 2019, J HINDAWI MATH PROBL
[5]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[6]   Student Dropout Prediction [J].
Del Bonifro, Francesca ;
Gabbrielli, Maurizio ;
Lisanti, Giuseppe ;
Zingaro, Stefano Pio .
ARTIFICIAL INTELLIGENCE IN EDUCATION (AIED 2020), PT I, 2020, 12163 :129-140
[7]  
Issah I., 2023, Decis Analyt J, V7, DOI [10.1016/j.dajour.2023.100204, DOI 10.1016/J.DAJOUR.2023.100204]
[8]  
Kemper L., 2020, European Journal of Higher Education, V10, P28, DOI [DOI 10.1080/21568235.2020.1718520, 10.1080/21568235.2020.1718520]
[9]  
King G., 2017, Political Anal, V9, P137, DOI DOI 10.1093/OXFORDJOURNALS.PAN.A004868
[10]   An explainable machine learning approach for student dropout prediction [J].
Krueger, Joao Gabriel Correa ;
Britto Jr, Alceu de Souza ;
Barddal, Jean Paul .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 233