Generative AI vs. instructor vs. peer assessments: a comparison of grading and feedback in higher education

被引：2

作者：

Usher, Maya ^{[1
,2
]}

机构：

[1] Technion Israel Inst Technol, Fac Educ Sci & Technol, Haifa, Israel

[2] HIT Holon Inst Technol, Fac Instruct Technol, Holon, Israel

来源：

ASSESSMENT & EVALUATION IN HIGHER EDUCATION | 2025年

关键词：

Generative Artificial Intelligence (GenAI); higher education; peer assessment; chatbot-based assessment; METAANALYSIS; IMPACT;

D O I：

10.1080/02602938.2025.2487495

中图分类号：

G40 [教育学];

学科分类号：

040101 ; 120403 ;

摘要：

The integration of Generative Artificial Intelligence (GenAI) in education has introduced innovative approaches to assessment. One such approach is AI chatbot-based assessment, which utilizes large language models to provide students with timely and consistent feedback. However, the effectiveness of AI chatbots in generating assessments comparable to human evaluators in educational contexts remains underexplored. This study compared the grades and feedback provided by AI chatbots, peers, and the course instructor for student projects in a higher education course. The participants were 76 undergraduate students who engaged in a group project involving three phases: questionnaire development, peer assessment, and chatbot-based assessment. Employing a mixed-methods approach, this study quantitatively compared project grades and qualitatively analyzed feedback quality. Results indicated that AI chatbots consistently assigned higher grades than human assessors, while peer and instructor grades were notably lower and closely aligned. Content analysis revealed that chatbots generally provided higher-quality feedback compared to peers, offering detailed insights and specific guidance for improvement, though they occasionally included irrelevant or contradictory information requiring student intervention. Conversely, peer feedback was more personalized and context-sensitive. These findings highlight the importance of human judgment, suggesting that integrating chatbot-based assessments with traditional methods can leverage their complementary strengths to enrich student learning.

引用

页数：16

共 50 条

[31] An Investigation of a Joyful Peer Response System: High Ability vs. Low Ability [J].

Wang, Jen-Hang ;

Chen, Sherry Y. ;

Chan, Tak-Wai .

INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2016, 32 (06) :431-444

[32] Cost comparison of conservative vs. surgical treatment of chronic lymphedema [J].

Nuwayhid, Rima ;

Langer, Stefan ;

von Dercks, Nikolaus .

CHIRURGIE, 2025, 96 (01) :41-47

[33] Feedback and year level effects on university students' self-efficacy and emotions during self-assessment: positive impact of rubrics vs. instructor feedback [J].

Panadero, Ernesto ;

Garcia-Perez, Daniel ;

Ruiz, Javier Fernandez ;

Fraile, Juan ;

Sanchez-Iglesias, Ivan ;

Brown, Gavin T. L. .

EDUCATIONAL PSYCHOLOGY, 2023, 43 (07) :756-779

[34] Analogizing vs. simulating: how does associative learning boost AI adoption? [J].

Xiao, Wei ;

Men, Cheng-Hao ;

Li, Yi-Ling .

CURRENT PSYCHOLOGY, 2023, 42 (24) :20430-20442

[35] Hydrogen vs. Batteries: Comparative Safety Assessments for a High-Speed Passenger Ferry [J].

Mylonopoulos, Foivos ;

Boulougouris, Evangelos ;

Trivyza, Nikoletta L. ;

Priftis, Alexandros ;

Cheliotis, Michail ;

Wang, Haibin ;

Shi, Guangyu .

APPLIED SCIENCES-BASEL, 2022, 12 (06)

[36] WEB 2.0 AND HIGHER EDUCATION: ITS EDUCATIONAL USE IN THE UNIVERSITY ENVIRONMENT: CONTENT CREATOR VS. INFORMATION CONSUMER [J].

Santiago, Raul ;

Benavides, Otto ;

Navaridas, Fermin ;

Serrano, Manuel .

2013 IEEE 63RD ANNUAL CONFERENCE INTERNATIONAL COUNCIL FOR EDUCATIONAL MEDIA (ICEM), 2013,

[37] Face-to-face vs. blended learning in higher education: a quantitative analysis of biological science student outcomes [J].

Harper, Claire V. ;

Mccormick, Lucy M. ;

Marron, Linda .

INTERNATIONAL JOURNAL OF EDUCATIONAL TECHNOLOGY IN HIGHER EDUCATION, 2024, 21 (01)

[38] Numerical Feedback Roundness Affects the Choice of the Self vs. Others as a Reference Point [J].

Shoham, Meyrav ;

Munichor, Nira .

FRONTIERS IN PSYCHOLOGY, 2021, 12

[39] Face-to-face vs. blended learning in higher education: a quantitative analysis of biological science student outcomes [J].

Claire V. Harper ;

Lucy M. McCormick ;

Linda Marron .

International Journal of Educational Technology in Higher Education, 21

[40] AI vs. Human Voices: How Delivery Source and Narrative Format Influence the Effectiveness of Persuasion Messages [J].

Dai, Yue ;

Lee, Jiyoung ;

Kim, Ji Won .

INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2024, 40 (24) :8735-8749

← 1 2 3 4 5 →