Interpretable machine learning for academic performance prediction: A SHAP-based analysis of key influencing factors

被引:0
作者
Guan, Yiming [1 ]
Wang, Fenglan [1 ]
Song, Shihao [1 ]
机构
[1] Guangdong Business & Technol Univ, Sch Finance Econ & Law, Zhaoqing, Guangdong, Peoples R China
关键词
Prediction; machine learning; academic performance; SHAP; CROSS-VALIDATION;
D O I
10.1080/14703297.2025.2532050
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
This study employs machine learning approaches to predict the final exam scores of vocational undergraduate students and analyse critical factors influencing their academic performance. Using a multidimensional feature dataset, Ridge Regression was set as a baseline model, while four mainstream machine learning models - Random Forest, XGBoost, Support Vector Machine and Neural Network - were utilised for predictive modelling, with Random Forest achieving the best performance. SHapley Additive exPlanations (SHAP) was applied to interpret global and local feature contributions, indicating monthly exam scores, admission scores and self-study time as the most influential predictors, whereas demographic features were comparatively less significant. Furthermore, Partial Dependence Plots (PDP) and Kernel Density Estimation (KDE) analyses were conducted to explore feature interactions and differences between high- and low-achieving students, offering practical insights for vocational institutions to implement precise interventions focusing on key predictive factors.
引用
收藏
页数:20
相关论文
共 49 条
[1]  
Aghbalou A, 2022, Arxiv, DOI arXiv:2202.10211
[2]   Cross-Validation Visualized: A Narrative Guide to Advanced Methods [J].
Allgaier, Johannes ;
Pryss, Ruediger .
MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2024, 6 (02) :1378-1388
[3]   Predicting Academic Outcomes: A Survey from 2007 Till 2018 [J].
Alturki, Sarah ;
Hulpus, Ioana ;
Stuckenschmidt, Heiner .
TECHNOLOGY KNOWLEDGE AND LEARNING, 2022, 27 (01) :275-307
[4]   Evidence-based peer-tutoring program to improve students' performance at the university [J].
Arco-Tirado, Jose L. ;
Fernandez-Martin, Francisco D. ;
Hervas-Torres, Miriam .
STUDIES IN HIGHER EDUCATION, 2020, 45 (11) :2190-2202
[5]  
Austin G. I., 2024, ArXiv, parXiv
[6]   Contributions of Machine Learning Models towards Student Academic Performance Prediction: A Systematic Review [J].
Balaji, Prasanalakshmi ;
Alelyani, Salem ;
Qahmash, Ayman ;
Mohana, Mohamed .
APPLIED SCIENCES-BASEL, 2021, 11 (21)
[7]  
Black P., 1998, Inside the black box: Raising standards through classroom assessment, DOI [10.1080/02671520600615612, DOI 10.1080/02671520600615612]
[8]   Student perceptions of feedback in reciprocal or nonreciprocal peer tutoring or mentoring [J].
Byl, E. ;
Topping, K. J. .
STUDIES IN EDUCATIONAL EVALUATION, 2023, 79
[9]   An intelligent tutoring system for supporting active learning: A case study on predictive parsing learning [J].
Castro-Schez, J. J. ;
Glez-Morcillo, C. ;
Albusac, J. ;
Vallejo, D. .
INFORMATION SCIENCES, 2021, 544 :446-468
[10]  
Chen F., 2021, Research on the relationship between learners' information literacy, teachers' online teaching behavior and online learning effectiveness from the perspective of educational data mining, DOI [https://doi.org/10.27159/d.cnki.ghzsu.2021.000960, DOI 10.27159/D.CNKI.GHZSU.2021.000960]