Application of Data Science and Machine Learning in the Prediction of College Dropout: A Data-Driven Predictive Approach

被引:1
作者
Felix Jimenez, Axel Frederick [1 ]
Sanchez Lee, Vania Stephany [1 ]
Ibarra Belmonte, Isaul [2 ]
Parra Gonzalez, Ezra Federico [3 ]
机构
[1] Natl Polytech Inst, Engn Comp Syst, Zacatecas, Mexico
[2] Ctr Res Math, Software Engn, Zacatecas, Mexico
[3] Ctr Res Math, Dept Comp Sci, Zacatecas, Mexico
来源
2023 12TH INTERNATIONAL CONFERENCE ON SOFTWARE PROCESS IMPROVEMENT, CIMPS 2023 | 2023年
关键词
data science; machine learning; education; Mexico; dropout;
D O I
10.1109/CIMPS61323.2023.10528825
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper presents a study on the prediction of student graduation or failure using two predictive models: K-Nearest Neighbors (KNN) and a forward sequential Artificial Neural Network (ANN). The models, built with a well-chosen set of independent variables, were assessed using metrics like precision and accuracy. The results obtained revealed that both the KNN model and the sequential forward ANN model achieved high efficiency in predicting student graduation or failure, achieving accuracies of 0.9133% (K=3) and 0.9312% (after 50 epochs), respectively. Providing a valuable tool to identify early on students at risk of not graduating and to take preventive measures to improve their academic performance. Comparisons with related research showed consistent outcomes, underscoring the credibility and importance of the employed predictive models.
引用
收藏
页码:234 / 243
页数:10
相关论文
共 24 条
[1]  
[Anonymous], Que es la ciencia de datos?
[2]   A data analytics approach for university competitiveness: the QS world university rankings [J].
Carmen Estrada-Real, Ana ;
Cantu-Ortiz, Francisco J. .
INTERNATIONAL JOURNAL OF INTERACTIVE DESIGN AND MANUFACTURING - IJIDEM, 2022, 16 (03) :871-891
[3]  
Contreras EC, 2018, ENCYCLOPEDIA OF INFORMATION SCIENCE AND TECHNOLOGY, 4TH EDITION, P2431, DOI 10.4018/978-1-5225-2255-3.ch212
[4]  
Cortes F., 2019, Revista de Estudios Sociales, V68, P120
[5]  
Felix Jimenez A. F., 2023, Mejora de la Educacion en Mexico mediante el uso de Ciencia de Datos y Machine Learning
[6]  
FernandezHernandez J. L., 2022, Revista Electronica de Metodologia Aplicada, V24, P38, DOI [10.17811/rema.24.1.2022.38-40, DOI 10.17811/REMA.24.1.2022.38-40]
[7]  
Gonzalez M., 2019, Revista de Investigacion en Educacion Superior, V4, P21
[8]  
Hair J. F., 2009, Multivariate data analysis
[9]  
Holmes W., 2022, Artificial intelligence and education
[10]  
Jobson J., 1991, Appl. multivariate Data analysis: Regression Exp. design, P219, DOI [DOI 10.1007/978-1-4612-0955-3_4, 10.1007/97814612095534, 10.1038/nmeth.3665, DOI 10.1038/NMETH.3665]