From the Automated Assessment of Student Essay Content to Highly Informative Feedback: a Case Study

被引：17

作者：

Gombert, Sebastian ^{[1
]}

Fink, Aron ^{[2
]}

Giorgashvili, Tornike ^{[2
]}

Jivet, Ioana ^{[1
,2
]}

Di Mitri, Daniele ^{[1
]}

Yau, Jane ^{[1
]}

Frey, Andreas ^{[2
]}

Drachsler, Hendrik ^{[1
,2
,3
]}

机构：

[1] DIPF Leibniz Inst Res & Informat Educ, Frankfurt, Germany

[2] Goethe Univ, Frankfurt, Germany

[3] Open Univ Netherlands, Heerlen, Netherlands

来源：

INTERNATIONAL JOURNAL OF ARTIFICIAL INTELLIGENCE IN EDUCATION | 2024年 / 34卷 / 04期

关键词：

Automated essay scoring (AES); Content scoring; Writing assessment; Automated feedback; Analytic scoring; MOTIVATION;

D O I：

10.1007/s40593-023-00387-6

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Various studies empirically proved the value of highly informative feedback for enhancing learner success. However, digital educational technology has yet to catch up as automated feedback is often provided shallowly. This paper presents a case study on implementing a pipeline that provides German-speaking university students enrolled in an introductory-level educational psychology lecture with content-specific feedback for a lecture assignment. In the assignment, students have to discuss the usefulness and educational grounding (i.e., connection to working memory, metacognition or motivation) of ten learning tips presented in a video within essays. Through our system, students received feedback on the correctness of their solutions and content areas they needed to improve. For this purpose, we implemented a natural language processing pipeline with two steps: (1) segmenting the essays and (2) predicting codes from the resulting segments used to generate feedback texts. As training data for the model in each processing step, we used 689 manually labelled essays submitted by the previous student cohort. We then evaluated approaches based on GBERT, T5, and bag-of-words baselines for scoring them. Both pipeline steps, especially the transformer-based models, demonstrated high performance. In the final step, we evaluated the feedback using a randomised controlled trial. The control group received feedback as usual (essential feedback), while the treatment group received highly informative feedback based on the natural language processing pipeline. We then used a six items long survey to test the perception of feedback. We conducted an ordinary least squares analysis to model these items as dependent variables, which showed that highly informative feedback had positive effects on helpfulness and reflection.

引用

页码：1378 / 1416

页数：39

共 99 条

[1] Connecting the dots - A literature review on learning analytics indicators from a learning design perspective [J].