Classification of human- and AI-generated texts for different languages and domains

被引:0
|
作者
Kristina Schaaff [1 ]
Tim Schlippe [1 ]
Lorenz Mindner [1 ]
机构
[1] IU International University of Applied Sciences,
关键词
Generative AI; ChatGPT; Natural language processing; Features; Prompting; Artificial intelligence; Text classification;
D O I
10.1007/s10772-024-10143-3
中图分类号
学科分类号
摘要
Chatbots based on large language models (LLMs) like ChatGPT are available to the wide public. These tools can for instance be used by students to generate essays or whole theses from scratch or by rephrasing an existing text. But how does for instance a teacher know whether a text is written by a student or an AI? In this paper, we investigate perplexity, semantic, list lookup, document, error-based, readability, AI feedback and text vector features to classify human-generated and AI-generated texts from the educational domain as well as news articles. We analyze two scenarios: (1) The detection of text generated by AI from scratch, and (2) the detection of text rephrased by AI. Since we assumed that classification is more difficult when the AI has been prompted to create or rephrase the text in a way that a human would not recognize that it was generated or rephrased by an AI, we also investigate this advanced prompting scenario. To train, fine-tune and test the classifiers, we created the Multilingual Human-AI-Generated Text Corpus which contains human-generated, AI-generated and AI-rephrased texts from the educational domain in English, French, German, and Spanish and English texts from the news domain. We demonstrate that the same features can be used for the detection of AI-generated and AI-rephrased texts from the educational domain in all languages and the detection of AI-generated and AI-rephrased news texts. Our best systems significantly outperform GPTZero and ZeroGPT—state-of-the-art systems for the detection of AI-generated text. Our best text rephrasing detection system even outperforms GPTZero by 181.3% relative in F1-score.
引用
收藏
页码:935 / 956
页数:21
相关论文
共 39 条
  • [31] Video Lectures With AI-Generated Instructors: Low Video Engagement, Same Performance as Human Instructors
    Arkun-Kocadere, Selay
    Caglar-Ozhan, Seyma
    INTERNATIONAL REVIEW OF RESEARCH IN OPEN AND DISTRIBUTED LEARNING, 2024, 25 (03): : 350 - 369
  • [32] A Comparison of Human-Written Versus AI-Generated Text in Discussions at Educational Settings: Investigating Features for ChatGPT, Gemini and BingAI
    Durak, Hatice Yildiz
    Egin, Figen
    Onan, Aytug
    EUROPEAN JOURNAL OF EDUCATION, 2025, 60 (01)
  • [33] People are skeptical of headlines labeled as AI-generated, even if true or human-made, because they assume full AI automation
    Altay, Sacha
    Gilardi, Fabrizio
    PNAS NEXUS, 2024, 3 (10):
  • [34] Exploring the boundaries of authorship: a comparative analysis of AI-generated text and human academic writing in English literature
    Amirjalili, Forough
    Neysani, Masoud
    Nikbakht, Ahmadreza
    FRONTIERS IN EDUCATION, 2024, 9
  • [35] Comparative Analysis of Learnersourced Human-Graded and AI-Generated Responses for Autograding Online Tutor Lessons
    Thomas, Danielle R.
    Gupta, Shivang
    Koedinger, Kenneth R.
    ARTIFICIAL INTELLIGENCE IN EDUCATION. POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS, DOCTORAL CONSORTIUM AND BLUE SKY, AIED 2023, 2023, 1831 : 714 - 719
  • [36] Consumers' Emotional Responses to AI-Generated Versus Human-Generated Content: The Role of Perceived Agency, Affect and Gaze in Health Marketing
    Diwanji, Vaibhav Shwetangbhai
    Geana, Mugur
    Pei, Jun
    Nguyen, Nhung
    Izhar, Nazra
    Chaif, Rim H.
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2025,
  • [37] Comparison of an AI-Generated Case Report With a Human-Written Case Report: Practical Considerations for AI-Assisted Medical Writing
    Pinto, Denver S.
    Noronha, Sharon M.
    Saigal, Gaurav
    Quencer, Robert M.
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2024, 16 (05)
  • [38] Assessing AI-generated (GPT-4) Versus Human Created MCQs in Mathematics Education: A Comparative Inquiry into Vector Topics
    Kuusemets, Laura
    Parve, Kristin
    Ain, Kati
    Kraav, Tiina
    INTERNATIONAL JOURNAL OF EDUCATION IN MATHEMATICS SCIENCE AND TECHNOLOGY, 2024, 12 (06):
  • [39] Evaluating AI-Generated informed consent documents in oral surgery: A comparative study of ChatGPT-4, Bard gemini advanced, and human-written consents
    Vaira, Luigi Angelo
    Lechien, Jerome R.
    Maniaci, Antonino
    Tanda, Giuseppe
    Abbate, Vincenzo
    Allevi, Fabiana
    Arena, Antonio
    Beltramini, Giada Anna
    Bergonzani, Michela
    Bolzoni, Alessandro Remigio
    Crimi, Salvatore
    Frosolini, Andrea
    Gabriele, Guido
    Maglitto, Fabio
    Mayo-Yanez, Miguel
    Orru, Ludovica
    Petrocelli, Marzia
    Pucci, Resi
    Saibene, Alberto Maria
    Troise, Stefania
    Tel, Alessandro
    Vellone, Valentino
    Chiesa-Estomba, Carlos Miguel
    Boscolo-Rizzo, Paolo
    Salzano, Giovanni
    De Riu, Giacomo
    JOURNAL OF CRANIO-MAXILLOFACIAL SURGERY, 2025, 53 (01) : 18 - 23