Using generalizability theory to investigate the variability and reliability of EFL composition scores by human raters and e-rater

被引：1

作者：

Sari, Elif ^{[1
]}

Han, Turgay ^{[2
]}

机构：

[1] Karadeniz Tech Univ, Trabzon, Turkey

[2] Ordu Univ, Ordu, Turkey

来源：

PORTA LINGUARUM | 2022年 / 38期

关键词：

EFL writing assessment; generalizability theory; scoring variability; scoring reliability; automated writing evaluation (AWE); AUTOMATED WRITING EVALUATION;

D O I：

10.30827/portalin.vi38.18056

中图分类号：

G40 [教育学];

学科分类号：

040101 ; 120403 ;

摘要：

Using the generalizability theory (G-theory) as a theoretical framework, this study aimed at investigating the variability and reliability of holistic scores assigned by human raters and e-rater to the same EFL essays. Eighty argumentative essays written on two different topics by tertiary level Turkish EFL students were scored holistically by e-rater and eight human raters who received a detailed rater training. The results showed that e-rater and human raters assigned significantly different holistic scores to the same EFL essays. G-theory analyses revealed that human raters assigned considerably inconsistent scores to the same EFL essays although they were given a detailed rater training and more reliable ratings were attained when e-rater was integrated in the scoring procedure. Some implications are given for EFL writing assessment practices.

引用

页码：27 / 45

页数：19

共 5 条

[1] Using Generalizability Theory to Investigate the Reliability of Scores Assigned to Students in English Language Examination in Nigeria
Akindahunsi, Olufunke Favour
Afolabi, Eyitayo Rufus Ifedayo
JOURNAL OF MEASUREMENT AND EVALUATION IN EDUCATION AND PSYCHOLOGY-EPOD, 2021, 12 (02): : 147 - 162
[2] Estimating reliability of school-level scores using multilevel and generalizability theory models
Min-Jeong Jeon
Guemin Lee
Jeong-Won Hwang
Sang-Jin Kang
Asia Pacific Education Review, 2009, 10 : 149 - 158
[3] Estimating reliability of school-level scores using multilevel and generalizability theory models
Jeon, Min-Jeong
Lee, Guemin
Hwang, Jeong-Won
Kang, Sang-Jin
ASIA PACIFIC EDUCATION REVIEW, 2009, 10 (02) : 149 - 158
[4] Comparing the reliability of performance task scores obtained from rating scale and analytic rubric using the generalizability theory
Yilmaz, Funda Nalbantoglu
STUDIES IN EDUCATIONAL EVALUATION, 2024, 83
[5] Using generalizability theory and the ERP Reliability Analysis (ERA) Toolbox for assessing test-retest reliability of ERP scores part 1: Algorithms, framework, and implementation
Clayson, Peter E.
Carbine, Kaylie A.
Baldwin, Scott A.
Olsen, Joseph A.
Larson, Michael J.
INTERNATIONAL JOURNAL OF PSYCHOPHYSIOLOGY, 2021, 166 : 174 - 187

← 1 →