ChatGPT-4 in the Turing Test

被引:0
作者
Echavarria, Ricardo Restrepo [1 ]
机构
[1] Univ Tecn Manabi, Dept Ciencias Sociales & Comportamiento, Portoviejo, Ecuador
关键词
Turing test; ChatGPT; Artificial intelligence; Science; Thinking; Intelligence; COMPUTERS;
D O I
10.1007/s11023-025-09711-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There has been considerable optimistic speculation on how well ChatGPT-4 would perform in a Turing Test. However, no minimally serious implementation of the test has been reported to have been carried out. This brief note documents the results of subjecting ChatGPT-4 to 10 Turing Tests, with different interrogators and participants. The outcome is tremendously disappointing for the optimists. Despite ChatGPT reportedly outperforming 99.9% of humans in a Verbal IQ test, it falls short of passing the Turing Test. In 9 out of the 10 tests conducted, the interrogators successfully identified ChatGPT-4 and the human participant. The probability of obtaining this result from a process in which the interrogator is really no better than chance at correct identification is calculated to be less than 1%. An additional question was posed to the interrogators at the end of each test: What led them to distinguish between the human and the machine? The interrogators, who effectively filtered out ChatGPT-4 from passing the Turing Test for intelligence, stated that they could identify the machine because it, in effect, responded more intelligently than the human. Subsequently, ChatGPT-4 was tasked with differentiating syntax from semantics and self-corrected when falling for the fallacy of equivocation. The curious situation is arrived at that passing the Turing Test for intelligence remains a challenge that ChatGPT-4 has yet to overcome, precisely because, as per the interrogators, its intellectual abilities surpass those of individual humans.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Enhancing infectious disease response: A demonstrative dialogue with ChatGPT and ChatGPT-4 for future outbreak preparedness
    Al-Tawfi, Jaffar A.
    Jamal, Amr
    -Morales, Alfonso J. Rodriguez
    Temsah, Mohamad-Hani
    NEW MICROBES AND NEW INFECTIONS, 2023, 53
  • [32] Evaluating the strengths and limitations of multimodal ChatGPT-4 in detecting glaucoma using fundus images
    Alryalat, Saif Aldeen
    Musleh, Ayman Mohammed
    Kahook, Malik Y.
    FRONTIERS IN OPHTHALMOLOGY, 2024, 4
  • [33] Evaluating chatGPT-4 and chatGPT-4o: performance insights from NAEP mathematics problem solving
    Wei, Xin
    FRONTIERS IN EDUCATION, 2024, 9
  • [34] Prostatakarzinomforschung verständlich gemacht: ChatGPT-4 als Werkzeug zur Verbesserung der LaienkommunikationMaking prostate cancer research accessible: chatGPT-4 as a tool to enhance lay communication
    Maximilian Haas
    Veronika Saberi
    Christopher Gossler
    Anna Schmelzer
    Anton Kravchuk
    Johannes Breyer
    Johannes Bründl
    Simon Engelmann
    Clemens Kirschner
    Christian Gilfrich
    Maximilian Burger
    Dominik von Winning
    Christian Wülfing
    Hendrik Borgmann
    Severin Rodler
    Axel S. Merseburger
    Emily Rinderknecht
    Matthias May
    Die Urologie, 2025, 64 (6) : 574 - 583
  • [35] Prompting Theory into Practice: Utilizing ChatGPT-4 in a Curriculum Planning Course
    Biberman-Shalev, Liat
    EDUCATION SCIENCES, 2025, 15 (02):
  • [36] Performance and Consistency of ChatGPT-4 Versus Otolaryngologists: A Clinical Case Series
    Lechien, Jerome R.
    Naunheim, Mattheuw R.
    Maniaci, Antonino
    Radulesco, Thomas
    Saibene, Alberto M.
    Chiesa-Estomba, Carlos M.
    Vaira, Luigi A.
    OTOLARYNGOLOGY-HEAD AND NECK SURGERY, 2024, 170 (06) : 1519 - 1526
  • [37] A Study on the Efficacy of ChatGPT-4 in Enhancing Students' English Communication Skills
    Wang, Ying
    SAGE OPEN, 2025, 15 (01):
  • [38] In Defence of a Reciprocal Turing Test
    Fintan Mallory
    Minds and Machines, 2020, 30 : 659 - 680
  • [39] Evaluation of the Appropriateness and Readability of ChatGPT-4 Responses to Patient Queries on Uveitis
    Mohammadi, S. Saeed
    Khatri, Anadi
    Jain, Tanya
    Thng, Zheng Xian
    Yoo, Woong-sun
    Yavari, Negin
    Bazojoo, Vahid
    Mobasserian, Azadeh
    Akhavanrezayat, Amir
    Than, Ngoc Trong Tuong
    Elaraby, Osama
    Ganbold, Battuya
    El Feley, Dalia
    Nguyen, Trung
    Yasar, Cigdem
    Gupta, Ankur
    Hung, Jia-Horung
    Nguyen, Quan Dong
    OPHTHALMOLOGY SCIENCE, 2025, 5 (01):
  • [40] Comparative evaluation of ChatGPT-4, ChatGPT-3.5 and Google Gemini on PCOS assessment and management based on recommendations from the 2023 guideline
    Gunesli, Irmak
    Aksun, Seren
    Fathelbab, Jana
    Yildiz, Bulent Okan
    ENDOCRINE, 2024, : 315 - 322