Semi-automatic coding of open-ended text responses in large-scale assessments

被引：10

作者：

Andersen, Nico ^{[1
]}

Zehner, Fabian ^{[1
,2
]}

Goldhammer, Frank ^{[1
,2
]}

机构：

[1] DIPF, Leibniz Inst Res & Informat Educ, Rostocker Str 6, D-60323 Frankfurt, Germany

[2] Ctr Int Student Assessment ZIB eV, Frankfurt, Germany

来源：

JOURNAL OF COMPUTER ASSISTED LEARNING | 2023年 / 39卷 / 03期

关键词：

clustering; eco; effort reduction; exploring coding assistant; semi-automatic coding; support human raters; AGREEMENT; TRENDS;

D O I：

10.1111/jcal.12717

中图分类号：

G40 [教育学];

学科分类号：

040101 ; 120403 ;

摘要：

Background In the context of large-scale educational assessments, the effort required to code open-ended text responses is considerably more expensive and time-consuming than the evaluation of multiple-choice responses because it requires trained personnel and long manual coding sessions. Aim Our semi-supervised coding method eco (exploring coding assistant) dynamically supports human raters by automatically coding a subset of the responses. Method We map normalized response texts into a semantic space and cluster response vectors based on their semantic similarity. Assuming that similar codes represent semantically similar responses, we propagate codes to responses in optimally homogeneous clusters. Cluster homogeneity is assessed by strategically querying informative responses and presenting them to a human rater. Following each manual coding, the method estimates the code distribution respecting a certainty interval and assumes a homogeneous distribution if certainty exceeds a predefined threshold. If a cluster is determined to certainly comprise homogeneous responses, all remaining responses are coded accordingly automatically. We evaluated the method in a simulation using different data sets. Results With an average miscoding of about 3%, the method reduced the manual coding effort by an average of about 52%. Conclusion Combining the advantages of automatic and manual coding produces considerable coding accuracy and reduces the required manual effort.

引用

页码：841 / 854

页数：14

共 59 条

[1]

Aggarwal CC, 2012, Mining text data, P77

[2]

Andersen N., 2021, Psych, V3, P422, DOI DOI 10.3390/PSYCH3030030

[3] An Investigation of Students' Self-Regulated Learning Strategies: Students' Qualitative and Quantitative Accounts of Their Learning Strategies [J].