Early segmentation of students according to their academic performance: A predictive modelling approach

被引:97
作者
Migueis, V. L. [1 ]
Freitas, Ana [2 ]
Garcia, Paulo J. V. [2 ]
Silva, Andre [2 ]
机构
[1] Univ Porto, Fac Engn, INESC TEC, Rua Dr Roberto Frias, P-4200465 Porto, Portugal
[2] Univ Porto, Fac Engn, Rua Dr Roberto Frias, P-4200465 Porto, Portugal
关键词
Educational data mining; Predictive modelling; Data mining; Academic performance; Engineering education; HONORS PROGRAM; MANAGEMENT; EDUCATION; SUCCESS; SYSTEM; ACHIEVEMENT; VALIDITY; IMPACT; GPA;
D O I
10.1016/j.dss.2018.09.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The early classification of university students according to their potential academic performance can be a useful strategy to mitigate failure, to promote the achievement of better results and to better manage resources in higher education institutions. This paper proposes a two-stage model, supported by data mining techniques, that uses the information available at the end of the first year of students' academic career (path) to predict their overall academic performance. Unlike most literature on educational data mining, academic success is inferred from both the average grade achieved and the time taken to conclude the degree. Furthermore, this study proposes to segment students based on the dichotomy between the evidence of failure or high performance at the beginning of the degree program, and the students' performance levels predicted by the model. A data set of 2459 students, spanning the years from 2003 to 2015, from a European Engineering School of a public research University, is used to validate the proposed methodology. The empirical results demonstrate the ability of the proposed model to predict the students' performance level with an accuracy above 95%, in an early stage of the students' academic path. It is found that random forests are superior to the other classification techniques that were considered (decision trees, support vector machines, naive Bayes, bagged trees and boosted trees). Together with the prediction model, the suggested segmentation framework represents a useful tool to delineate the optimum strategies to apply, in order to promote higher performance levels and mitigate academic failure, overall increasing the quality of the academic experience provided by a higher education institution.
引用
收藏
页码:36 / 51
页数:16
相关论文
共 79 条
[1]   Combination of machine learning algorithms for recommendation of courses in E-Learning System based on historical data [J].
Aher, Sunita B. ;
Lobo, L. M. R. J. .
KNOWLEDGE-BASED SYSTEMS, 2013, 51 :1-14
[2]   Predicting the academic success of architecture students by pre-enrolment requirement: using machine-learning techniques [J].
Aluko, Ralph Olusola ;
Adenuga, Olumide Afolarin ;
Kukoyi, Patricia Omega ;
Soyingbe, Aliu Adebayo ;
Oyedeji, Joseph Oyewale .
CONSTRUCTION ECONOMICS AND BUILDING, 2016, 16 (04) :86-98
[3]  
[Anonymous], LONGITUDINAL STUDY I
[4]  
[Anonymous], 2014, INT J ENG TECHNOL
[5]  
[Anonymous], 1 WORLD 2020 WHAT WI
[6]  
[Anonymous], STUDIES ED EVALUATIO
[7]  
[Anonymous], 2014, P 4 INT C LEARN AN K
[8]  
[Anonymous], STUDIES COMPUTATIONA
[9]  
[Anonymous], TECHNICAL REPORT
[10]   Factors influencing university drop out rates [J].
Araque, Francisco ;
Roldan, Concepcion ;
Salguero, Alberto .
COMPUTERS & EDUCATION, 2009, 53 (03) :563-574