Taking PISA Seriously: How Accurate are Low-Stakes Exams?

被引:20
作者
Akyol, Pelin [1 ]
Krishna, Kala [2 ,3 ]
Wang, Jinwen [4 ]
机构
[1] Bilkent Univ, Ankara, Turkey
[2] Penn State Univ, CES, IFO, State Coll, PA USA
[3] NBER, State Coll, PA USA
[4] Bates White Econ Consulting, Washington, DC USA
关键词
Low-stakes exams; Computer-based assessments; PISA; Biased rankings; Item response data;
D O I
10.1007/s12122-021-09317-8
中图分类号
F24 [劳动经济];
学科分类号
020106 ; 020207 ; 1202 ; 120202 ;
摘要
PISA is seen as the gold standard for evaluating educational outcomes worldwide. Yet, being a low-stakes exam, students may not take it seriously resulting in downward biased scores and inaccurate rankings. This paper provides a method to identify and account for non-serious behavior in low-stakes exams by leveraging information in computer-based assessments in PISA 2015. Our method corrects for non-serious behavior by fully imputing scores for items not taken seriously. We compare the scores/rankings calculated by our method to the scores/rankings calculated by giving zero points to skipped items as well as to the scores/rankings calculated by treating skipped items at the end of the exam as if they were not administered, which is the procedure followed by PISA. We show that a country can improve its ranking by up to 15 places by encouraging its own students to take the exam seriously and that the PISA approach corrects for only about half of the bias generated by the non-seriousness.
引用
收藏
页码:184 / 243
页数:60
相关论文
共 39 条
[1]   GENDER DIFFERENCES IN RESPONSE TO BIG STAKES [J].
Azmat, Ghazala ;
Calsamiglia, Caterina ;
Iriberri, Nagore .
JOURNAL OF THE EUROPEAN ECONOMIC ASSOCIATION, 2016, 14 (06) :1372-1400
[2]   Multiple imputation by chained equations: what is it and how does it work? [J].
Azur, Melissa J. ;
Stuart, Elizabeth A. ;
Frangakis, Constantine ;
Leaf, Philip J. .
INTERNATIONAL JOURNAL OF METHODS IN PSYCHIATRIC RESEARCH, 2011, 20 (01) :40-49
[3]   Test motivation in the assessment of student skills: The effects of incentives on motivation and performance [J].
Baumert, J ;
Demmrich, A .
EUROPEAN JOURNAL OF PSYCHOLOGY OF EDUCATION, 2001, 16 (03) :441-462
[4]  
Borghans L., 2012, The Leaning Tower of Pisa Decomposing achievement test scores into cognitive and noncognitive components
[5]   An international comparison of students' ability to endure fatigue and maintain motivation during a low-stakes test [J].
Borgonovi, Francesca ;
Biecek, Przemyslaw .
LEARNING AND INDIVIDUAL DIFFERENCES, 2016, 49 :128-137
[6]  
Butler Jayne, 2007, J Appl Meas, V8, P279
[7]   Predicting student achievement for low stakes tests with effort and task value [J].
Cole, James S. ;
Bergin, David A. ;
Whittaker, Tiffany A. .
CONTEMPORARY EDUCATIONAL PSYCHOLOGY, 2008, 33 (04) :609-624
[8]   Role of test motivation in intelligence testing [J].
Duckworth, Angela Lee ;
Quinn, Patrick D. ;
Lynam, Donald R. ;
Loeber, Rolf ;
Stouthamer-Loeber, Magda .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (19) :7716-7720
[9]   A Cross-National Comparison of Reported Effort and Mathematics Performance in TIMSS Advanced [J].
Eklof, Hanna ;
Pavesic, Barbara Japelj ;
Gronmo, Liv Sissel .
APPLIED MEASUREMENT IN EDUCATION, 2014, 27 (01) :31-45
[10]   Skill and will: test-taking motivation and assessment quality [J].
Eklof, Hanna .
ASSESSMENT IN EDUCATION-PRINCIPLES POLICY & PRACTICE, 2010, 17 (04) :345-356