Uncovering student profiles. An explainable cluster analysis approach to PISA 2022

被引:3
作者
Alvarez-Garcia, Miguel [1 ]
Arenas-Parra, Mar [1 ]
Ibar-Alonso, Raquel [2 ]
机构
[1] Univ Oviedo, Dept Quantitat Econ, Oviedo 33006, Spain
[2] Rey Juan Carlos Univ, Dept Appl Econ 1, Madrid 28032, Spain
关键词
Educational data mining; Explainable cluster analysis; Student profiles; International large-scale assessments; PISA; ACHIEVEMENT; SCIENCE; SEGMENTATION; EXPLANATIONS; PERFORMANCE; SELECTION;
D O I
10.1016/j.compedu.2024.105166
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Educational data mining (EDM) applied to the wealth of data generated from international largescale assessments (ILSAs) shows potential for identifying successful educational initiatives. Despite limited research on clustering methods in ILSAs, leveraging these methods to uncover student profiles can help decision-making in designing tailored programs. This study aims to identify and characterize 15-year-old student profiles using PISA 2022 data and reveal insights into the relationship between these profiles and factors such as ICT availability and use, gender, academic performance, and educational expectations. We analyzed PISA 2022 Spanish student data (n = 30,800) with a selection of 74 contextual variables, applying an end-to-end explainable cluster analysis methodology that integrates different machine learning (ML) and explainable artificial intelligence (XAI) techniques. This methodology covered data pre-processing, dimensionality reduction, clustering, and classification to ensure data quality and result explainability. We obtained 16 derived variables, 7 student clusters, and an optimal XGBoost classifier with a global accuracy of 0.8643. Using local and global SHAP values, we interpreted clusters, finding that socio-economic status and ICT availability and use at home are the most important factors differentiating student profiles. Our findings suggest the need to emphasize (i) proper ICT accessibility and use, as well as student support networks to improve academic performance, (ii) gender-specific well-being programs, and (iii) the encouragement of educational expectations tailored to students' gender and their exposure to higher education. These results pave the way for personalized academic policies and programs through ML-based tools for uncovering student profiles.
引用
收藏
页数:24
相关论文
共 68 条
  • [31] An explainable machine learning approach for student dropout prediction
    Krueger, Joao Gabriel Correa
    Britto Jr, Alceu de Souza
    Barddal, Jean Paul
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 233
  • [32] Missing Data Analysis
    Little, Roderick J.
    [J]. ANNUAL REVIEW OF CLINICAL PSYCHOLOGY, 2024, 20 : 149 - 173
  • [33] Livieris Ioannis E., 2023, Methodologies and Intelligent Systems for Technology Enhanced Learning, Workshops - 13th International Conference. Lecture Notes in Networks and Systems (769), P87, DOI 10.1007/978-3-031-42134-1_9
  • [34] Effect size measures for multilevel models:definition, interpretation, and TIMSS example
    Lorah, Julie
    [J]. LARGE-SCALE ASSESSMENTS IN EDUCATION, 2018, 6
  • [35] Lundberg SM, 2017, ADV NEUR IN, V30
  • [36] From local explanations to global understanding with explainable AI for trees
    Lundberg, Scott M.
    Erion, Gabriel
    Chen, Hugh
    DeGrave, Alex
    Prutkin, Jordan M.
    Nair, Bala
    Katz, Ronit
    Himmelfarb, Jonathan
    Bansal, Nisha
    Lee, Su-In
    [J]. NATURE MACHINE INTELLIGENCE, 2020, 2 (01) : 56 - 67
  • [37] Mairal J, 2010, J MACH LEARN RES, V11, P19
  • [38] From explanations to feature selection: assessing SHAP values as feature selection mechanism
    Marcilio Jr, Wilson E.
    Eler, Danilo M.
    [J]. 2020 33RD SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2020), 2020, : 340 - 347
  • [39] Gender Differences in Mathematics Self-concept Across the World: an Exploration of Student and Parent Data of TIMSS 2015
    Maria Mejia-Rodriguez, Ana
    Luyten, Hans
    Meelissen, Martina R. M.
    [J]. INTERNATIONAL JOURNAL OF SCIENCE AND MATHEMATICS EDUCATION, 2021, 19 (06) : 1229 - 1250
  • [40] School socioeconomic compositional effect on shadow education participation: evidence from Japan
    Matsuoka, Ryoji
    [J]. BRITISH JOURNAL OF SOCIOLOGY OF EDUCATION, 2015, 36 (02) : 270 - 290