Using machine learning to predict factors affecting academic performance: the case of college students on academic probation

被引:13
作者
Al-Alawi, Lamees [1 ]
Al Shaqsi, Jamil [1 ]
Tarhini, Ali [1 ]
Al-Busaidi, Adil S. [1 ,2 ,3 ]
机构
[1] Sultan Qaboos Univ, Coll Econ & Polit Sci, Dept Informat Syst, POB 20, Muscat 123, Oman
[2] Sultan Qaboos Univ, Innovat & Technol Transfer Ctr, Muscat, Oman
[3] Sultan Qaboos Univ, Dept Business Commun, POB 20, Muscat 123, Oman
关键词
Data Mining; Education Data Mining; Predictive models; Supervised learning; Higher education; Student Academic performance; Academic under probation; Oman;
D O I
10.1007/s10639-023-11700-0
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
This study aims to employ the supervised machine learning algorithms to examine factors that negatively impacted academic performance among college students on probation (underperforming students). We used the Knowledge Discovery in Databases (KDD) methodology on a sample of N = 6514 college students spanning 11 years (from 2009 to 2019) provided by a major public university in Oman. We used the Information Gain (InfoGain) algorithm to select the most effective features and ensemble methods to compare the accuracy with more robust algorithms, including Logit Boost, Vote, and Bagging. The algorithms were evaluated based on the performance evaluation metrics such as accuracy, precision, recall, F-measure, and ROC curve, and then validated using 10-folds cross-validation. The study revealed that the main identified factors affecting student academic achievement include study duration in the university and previous performance in secondary school. Based on the experimental results, these features were consistently ranked as the top factors that negatively impacted academic performance. The study also indicated that gender, estimated graduation year, cohort, and academic specialization significantly contributed to whether a student was under probation. Domain experts and other students were involved in verifying some of the results. The theoretical and practical implications of this study are discussed.
引用
收藏
页码:12407 / 12432
页数:26
相关论文
共 72 条
[31]  
James G, 2013, SPRINGER TEXTS STAT, V103, P1, DOI [10.1007/978-1-4614-7138-7, 10.1007/978-1-4614-7138-7_1]
[32]  
Jia J. W., 2013, THESIS BOWIE STATE U
[33]  
JIAO P, 2022, ARTIF INTELL REV, P1
[34]  
Kalavathy R., 2007, IET-UK International Conference on Information and Communication Technology in Electrical Sciences (ICTES 2007), P1105, DOI 10.1049/ic:20070775
[35]   Data mining based analysis to explore the effect of teaching on student performance [J].
Khan, Anupam ;
Ghosh, Soumya K. .
EDUCATION AND INFORMATION TECHNOLOGIES, 2018, 23 (04) :1677-1697
[36]  
Khan F., 2019, DESIGN THINKING HUMA
[37]  
KHANNA L., 2016, SYSTEMATIC REV, P1, DOI [DOI 10.1109/IICIP.2016.7975354, 10.1109/IICIP.2016.7975354]
[38]   A Survey on Machine Learning-Based Performance Improvement of Wireless Networks: PHY, MAC and Network Layer [J].
Kulin, Merima ;
Kazaz, Tarik ;
De Poorter, Eli ;
Moerman, Ingrid .
ELECTRONICS, 2021, 10 (03) :1-64
[39]  
KUMAR R, 2017, INT J MECH ENG INFOR, V5, P1843, DOI DOI 10.18535/IJMEIT/V5I1.02
[40]   Exploring the determinants of students' academic performance at university level: The mediating role of internet usage continuance intention [J].
Maqableh, Mahmoud ;
Jaradat, Mais ;
Azzam, Ala'a .
EDUCATION AND INFORMATION TECHNOLOGIES, 2021, 26 (04) :4003-4025