Predicting science achievement scores with machine learning algorithms: a case study of OECD PISA 2015–2018 data

被引：0

作者：

Sibel Acıslı-Celik

Cafer Mert Yesilkanat

机构：

[1] Artvin Çoruh University,Science Teaching Department

来源：

Neural Computing and Applications | 2023年 / 35卷

关键词：

Artificial intelligence; Interdisciplinary/transdisciplinary; Random forest; XGBoost;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this study, the performance of machine learning methods was examined in terms of predicting the science education achievement scores of the students who took the exam for the next term, PISA 2018, and the science average scores of the countries, using PISA 2015 data. The research sample consists of a total of 67,329 students who took the PISA 2015 exam from 13 randomly selected countries (Brazil, Chinese Taipei, Dominican Republic, Estonia, Finland, Hungary, Italy, Japan, Lithuania, Luxembourg, Peru, Singapore, Türkiye). In this study, multiple linear regression, support vector regression, random forest, and extreme gradient boosting (XGBoost) machine learning algorithms were used. For the machine learning process, a randomly determined part from the PISA-2015 data of each country researched was divided as training data and the remaining part as testing data to evaluate model performance. As a result of the research, it was determined that the XGBoost algorithm showed the best performance in estimating both PISA-2015 test data and PISA-2018 science academic achievement scores in all researched countries. Furthermore, it was determined that the highest PISA-2018 science achievement scores of the students who participated in the exam, estimated by this algorithm, were in Luxembourg (r = 0.600, RMSE = 75.06, MAE = 59.97), while the lowest were in Finland (r = 0.467, RMSE = 79.38, MAE = 63.24). In addition, the average PISA-2018 science scores of the countries were estimated with the XGBoost algorithm, and the average science scores calculated for all the countries studied were estimated with very high accuracy.

引用

页码：21201 / 21228

页数：27

共 182 条

[1]

Aydın A(2011)A comparative evaluation of pisa 2003–2006 results in reading literacy skills: an example of top-five OECD countries and Turkey Educ Sci Theory Pract 11 665-673

[2]

Erdağ C(2021)The effects of ICT-based social media on adolescents’ digital reading performance: a longitudinal study of PISA 2009, PISA 2012, PISA 2015 and PISA 2018 Comput Educ 175 104342-740

[3]

Taş N(2021)Tailoring a measurement model of socioeconomic status: applying the alignment optimization method to 15 years of PISA Int J Educ Res 106 101723-135

[4]

Hu J(2020)Student composition in the PISA assessments: evidence from Brazil Int J Educ Dev 79 102299-157

[5]

Yu R(2021)Public and private school efficiency and equity in Latin America: new evidence based on PISA for development Int J Educ Dev 84 102404-222

[6]

Rolfe V(2021)Are schools digitally inclusive for all? Profiles of school digital inclusion using PISA 2018 Comput Educ 170 104226-1085

[7]

Gomes M(2021)Teachers’ perceived societal appreciation: PISA outcomes predict whether teachers feel valued in society Int J Educ Res 109 101833-158

[8]

Hirata G(2021)Testing measurement invariance of PISA 2015 mathematics, science, and ICT scales using the alignment method Stud Educ Eval 68 100965-111

[9]

e Oliveira JBA(2020)The assessment of collaborative problem solving in PISA 2015: an investigation of the validity of the PISA 2015 CPS tasks Comput Educ 157 103964-199

[10]

Delprato M(2004)Comparison of machine learning techniques with classical statistical models in predicting health outcomes Stud Health Technol Inform 107 736-163

← 1 2 3 4 5 6 7 8 9 10 →