Detecting ChatGPT-generated essays in a large-scale writing assessment: Is there a bias against non-native English speakers?

被引:16
作者
Jiang, Yang [1 ]
Hao, Jiangang [1 ]
Fauss, Michael [1 ]
Li, Chen [1 ]
机构
[1] ETS, 660 Rosedale Rd, Princeton, NJ 08541 USA
关键词
ChatGPT; Detection bias; Test security; Assessment; Education;
D O I
10.1016/j.compedu.2024.105070
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the prevalence of generative AI tools like ChatGPT, automated detectors of AI-generated texts have been increasingly used in education to detect the misuse of these tools (e.g., cheating in assessments). Recently, the responsible use of these detectors has attracted a lot of attention. Research has shown that publicly available detectors are more likely to misclassify essays written by non-native English speakers as AI-generated than those written by native English speakers. In this study, we address these concerns by leveraging carefully sampled large-scale data from the Graduate Record Examinations (GRE) writing assessment. We developed multiple detectors of ChatGPT-generated essays based on linguistic features from the ETS e-rater engine and text perplexity features, and investigated their performance and potential bias. Results showed that our carefully constructed detectors not only achieved near-perfect detection accuracy, but also showed no evidence of bias disadvantaging non-native English speakers. Findings of this study contribute to the ongoing debates surrounding the formulation of policies for utilizing AI-generated content detectors in education.
引用
收藏
页数:14
相关论文
共 42 条
[1]  
American Educational Research Association/American Psychological Association/National Council on Measurement in Education, 2014, Standards for educational and psychological testing
[2]  
[Anonymous], 2023, Reuters
[3]  
Attali Y., 2006, Journal of Technology, Learning, and Assessment, V4
[4]   Algorithmic Bias in Education [J].
Baker, Ryan S. ;
Hawn, Aaron .
INTERNATIONAL JOURNAL OF ARTIFICIAL INTELLIGENCE IN EDUCATION, 2022, 32 (04) :1052-1092
[5]  
Bakhtin Anton, 2019, Real or fake? learning to discriminate machine from human generated text
[6]  
Bozic V., 2023, Chat GPT and education, DOI [10.13140/RG.2.2.18837.40168, DOI 10.13140/RG.2.2.18837.40168]
[7]  
Clark Elizabeth, 2021, P 59 ANN M ASS COMPU, V1, P7282, DOI [DOI 10.18653/V1/2021.ACL-LONG.565,HTTPS://ACLANTHOLOGY.ORG/2021.ACL-LONG.565, DOI 10.18653/V1/2021.ACL-LONG.565]
[8]   Chatting and cheating: Ensuring academic integrity in the era of ChatGPT [J].
Cotton, Debby R. E. ;
Cotton, Peter A. A. ;
Shipway, J. Reuben .
INNOVATIONS IN EDUCATION AND TEACHING INTERNATIONAL, 2024, 61 (02) :228-239
[9]  
Crothers E, 2023, Arxiv, DOI arXiv:2210.07321
[10]  
Eliot L., 2022, Forbes