Semi-automatic coding of open-ended text responses in large-scale assessments

被引:10
作者
Andersen, Nico [1 ]
Zehner, Fabian [1 ,2 ]
Goldhammer, Frank [1 ,2 ]
机构
[1] DIPF, Leibniz Inst Res & Informat Educ, Rostocker Str 6, D-60323 Frankfurt, Germany
[2] Ctr Int Student Assessment ZIB eV, Frankfurt, Germany
关键词
clustering; eco; effort reduction; exploring coding assistant; semi-automatic coding; support human raters; AGREEMENT; TRENDS;
D O I
10.1111/jcal.12717
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Background In the context of large-scale educational assessments, the effort required to code open-ended text responses is considerably more expensive and time-consuming than the evaluation of multiple-choice responses because it requires trained personnel and long manual coding sessions. Aim Our semi-supervised coding method eco (exploring coding assistant) dynamically supports human raters by automatically coding a subset of the responses. Method We map normalized response texts into a semantic space and cluster response vectors based on their semantic similarity. Assuming that similar codes represent semantically similar responses, we propagate codes to responses in optimally homogeneous clusters. Cluster homogeneity is assessed by strategically querying informative responses and presenting them to a human rater. Following each manual coding, the method estimates the code distribution respecting a certainty interval and assumes a homogeneous distribution if certainty exceeds a predefined threshold. If a cluster is determined to certainly comprise homogeneous responses, all remaining responses are coded accordingly automatically. We evaluated the method in a simulation using different data sets. Results With an average miscoding of about 3%, the method reduced the manual coding effort by an average of about 52%. Conclusion Combining the advantages of automatic and manual coding produces considerable coding accuracy and reduces the required manual effort.
引用
收藏
页码:841 / 854
页数:14
相关论文
共 59 条
[1]  
Aggarwal CC, 2012, Mining text data, P77
[2]  
Andersen N., 2021, Psych, V3, P422, DOI DOI 10.3390/PSYCH3030030
[3]   An Investigation of Students' Self-Regulated Learning Strategies: Students' Qualitative and Quantitative Accounts of Their Learning Strategies [J].
Anthony, Jared S. ;
Clayton, Karen E. ;
Zusho, Akane .
JOURNAL OF COGNITIVE EDUCATION AND PSYCHOLOGY, 2013, 12 (03) :359-373
[4]  
Basu Sumit, 2013, Transactions of the Association for Computational Linguistics, V1, P391, DOI DOI 10.1162/TACLA00236
[5]  
Bejar I.I., 2012, EDUC MEAS-ISSUES PRA, V31, P2, DOI [DOI 10.1111/J.1745-3992.2012.00238.X, 10.1111/j.1745-3992.2012.00238.x]
[6]  
Berliner D. C., 2020, HDB ED POLICY STUDIE, P239, DOI 10.1007/978-981-13-8343-413
[7]  
Bexte M, 2021, UNIMPLICIT 2021: THE FIRST WORKSHOP ON UNDERSTANDING IMPLICIT AND UNDERSPECIFIED LANGUAGE, P11
[8]   From text to codings - Intercoder reliability assessment in qualitative content analysis [J].
Burla, Laila ;
Knierim, Birte ;
Barth, Juergen ;
Liewald, Katharina ;
Duetz, Margreet ;
Abel, Thomas .
NURSING RESEARCH, 2008, 57 (02) :113-117
[9]   The Eras and Trends of Automatic Short Answer Grading [J].
Burrows, Steven ;
Gurevych, Iryna ;
Stein, Benno .
INTERNATIONAL JOURNAL OF ARTIFICIAL INTELLIGENCE IN EDUCATION, 2015, 25 (01) :60-117
[10]  
Cai Z., 2019, ADV QUANTITATIVE ETH, P41, DOI [10.1007/978-3-030-33232-7_4, DOI 10.1007/978-3-030-33232-7_4]