Machine learning predicts upper secondary education dropout as early as the end of primary school

被引:1
作者
Psyridou, Maria [1 ]
Prezja, Fabi [2 ]
Torppa, Minna [3 ]
Lerkkanen, Marja-Kristiina [3 ]
Poikkeus, Anna-Maija [3 ]
Vasalampi, Kati [4 ]
机构
[1] Univ Jyvaskyla, Dept Psychol, Jyvaskyla 40014, Finland
[2] Univ Jyvaskyla, Fac Informat Technol, Jyvaskyla 40014, Finland
[3] Univ Jyvaskyla, Dept Teacher Educ, Jyvaskyla 40014, Finland
[4] Univ Jyvaskyla, Dept Educ, Jyvaskyla 40014, Finland
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
基金
芬兰科学院;
关键词
Machine learning; Education dropout; Longitudinal data; Upper secondary education; Comprehensive education; Kindergarten; Academic outcomes; READING-COMPREHENSION; PERFORMANCE; STUDENTS;
D O I
10.1038/s41598-024-63629-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Education plays a pivotal role in alleviating poverty, driving economic growth, and empowering individuals, thereby significantly influencing societal and personal development. However, the persistent issue of school dropout poses a significant challenge, with its effects extending beyond the individual. While previous research has employed machine learning for dropout classification, these studies often suffer from a short-term focus, relying on data collected only a few years into the study period. This study expanded the modeling horizon by utilizing a 13-year longitudinal dataset, encompassing data from kindergarten to Grade 9. Our methodology incorporated a comprehensive range of parameters, including students' academic and cognitive skills, motivation, behavior, well-being, and officially recorded dropout data. The machine learning models developed in this study demonstrated notable classification ability, achieving a mean area under the curve (AUC) of 0.61 with data up to Grade 6 and an improved AUC of 0.65 with data up to Grade 9. Further data collection and independent correlational and causal analyses are crucial. In future iterations, such models may have the potential to proactively support educators' processes and existing protocols for identifying at-risk students, thereby potentially aiding in the reinvention of student retention and success strategies and ultimately contributing to improved educational outcomes.
引用
收藏
页数:14
相关论文
共 59 条
  • [1] Aguiar E., 2015, Proceedings of the Fifth Learning Analytics and Knowledge Conference, NY, USA, P93, DOI [10.1145/2723576.2723619, DOI 10.1145/2723576.2723619]
  • [2] [Anonymous], 2015, Journal of Educational Data Mining, DOI [DOI 10.5281/ZENODO.3554725, 10.5281/zenodo.3554725]
  • [3] [Anonymous], 2021, Early leavers from education and training
  • [4] Developmental dynamics of math performance from preschool to grade 2
    Aunola, K
    Leskinen, E
    Lerkkanen, MK
    Nurmi, JE
    [J]. JOURNAL OF EDUCATIONAL PSYCHOLOGY, 2004, 96 (04) : 699 - 713
  • [5] High School Dropout, Resource Attainment, and Criminal Convictions
    Backman, Olof
    [J]. JOURNAL OF RESEARCH IN CRIME AND DELINQUENCY, 2017, 54 (05) : 715 - 749
  • [6] Preventing student disengagement and keeping students on the graduation path in urban middle-grades schools: Early identification and effective interventions
    Balfanz, Robert
    Herzog, Liza
    Mac Iver, Douglas J.
    [J]. EDUCATIONAL PSYCHOLOGIST, 2007, 42 (04) : 223 - 235
  • [7] Profiling low-proficiency science students in the Philippines using machine learning
    Bernardo, Allan B. I.
    Cordel, Macario O.
    Calleja, Marissa Ortiz
    Teves, Jude Michael M.
    Yap, Sashmir A.
    Chua, Unisse C.
    [J]. HUMANITIES & SOCIAL SCIENCES COMMUNICATIONS, 2023, 10 (01):
  • [8] The role of demographic and academic features in a student performance prediction
    Bilal, Muhammad
    Omar, Muhammad
    Anwar, Waheed
    Bokhari, Rahat H.
    Choi, Gyu Sang
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [9] Re-examining the impact of dropping out on criminal and labor outcomes in early adulthood
    Bjerk, David
    [J]. ECONOMICS OF EDUCATION REVIEW, 2012, 31 (01) : 110 - 122
  • [10] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32