ChatGPT-4 in the Turing Test

被引：0

作者：

Echavarria, Ricardo Restrepo ^{[1
]}

机构：

[1] Univ Tecn Manabi, Dept Ciencias Sociales & Comportamiento, Portoviejo, Ecuador

来源：

MINDS AND MACHINES | 2025年 / 35卷 / 01期

关键词：

Turing test; ChatGPT; Artificial intelligence; Science; Thinking; Intelligence; COMPUTERS;

D O I：

10.1007/s11023-025-09711-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

There has been considerable optimistic speculation on how well ChatGPT-4 would perform in a Turing Test. However, no minimally serious implementation of the test has been reported to have been carried out. This brief note documents the results of subjecting ChatGPT-4 to 10 Turing Tests, with different interrogators and participants. The outcome is tremendously disappointing for the optimists. Despite ChatGPT reportedly outperforming 99.9% of humans in a Verbal IQ test, it falls short of passing the Turing Test. In 9 out of the 10 tests conducted, the interrogators successfully identified ChatGPT-4 and the human participant. The probability of obtaining this result from a process in which the interrogator is really no better than chance at correct identification is calculated to be less than 1%. An additional question was posed to the interrogators at the end of each test: What led them to distinguish between the human and the machine? The interrogators, who effectively filtered out ChatGPT-4 from passing the Turing Test for intelligence, stated that they could identify the machine because it, in effect, responded more intelligently than the human. Subsequently, ChatGPT-4 was tasked with differentiating syntax from semantics and self-corrected when falling for the fallacy of equivocation. The curious situation is arrived at that passing the Turing Test for intelligence remains a challenge that ChatGPT-4 has yet to overcome, precisely because, as per the interrogators, its intellectual abilities surpass those of individual humans.

引用

页数：10

共 50 条

[31] Enhancing infectious disease response: A demonstrative dialogue with ChatGPT and ChatGPT-4 for future outbreak preparedness
Al-Tawfi, Jaffar A.
Jamal, Amr
-Morales, Alfonso J. Rodriguez
Temsah, Mohamad-Hani
NEW MICROBES AND NEW INFECTIONS, 2023, 53
[32] Evaluating the strengths and limitations of multimodal ChatGPT-4 in detecting glaucoma using fundus images
Alryalat, Saif Aldeen
Musleh, Ayman Mohammed
Kahook, Malik Y.
FRONTIERS IN OPHTHALMOLOGY, 2024, 4
[33] Evaluating chatGPT-4 and chatGPT-4o: performance insights from NAEP mathematics problem solving
Wei, Xin
FRONTIERS IN EDUCATION, 2024, 9
[34] Prostatakarzinomforschung verständlich gemacht: ChatGPT-4 als Werkzeug zur Verbesserung der LaienkommunikationMaking prostate cancer research accessible: chatGPT-4 as a tool to enhance lay communication
Maximilian Haas
Veronika Saberi
Christopher Gossler
Anna Schmelzer
Anton Kravchuk
Johannes Breyer
Johannes Bründl
Simon Engelmann
Clemens Kirschner
Christian Gilfrich
Maximilian Burger
Dominik von Winning
Christian Wülfing
Hendrik Borgmann
Severin Rodler
Axel S. Merseburger
Emily Rinderknecht
Matthias May
Die Urologie, 2025, 64 (6) : 574 - 583
[35] Prompting Theory into Practice: Utilizing ChatGPT-4 in a Curriculum Planning Course
Biberman-Shalev, Liat
EDUCATION SCIENCES, 2025, 15 (02):
[36] Performance and Consistency of ChatGPT-4 Versus Otolaryngologists: A Clinical Case Series
Lechien, Jerome R.
Naunheim, Mattheuw R.
Maniaci, Antonino
Radulesco, Thomas
Saibene, Alberto M.
Chiesa-Estomba, Carlos M.
Vaira, Luigi A.
OTOLARYNGOLOGY-HEAD AND NECK SURGERY, 2024, 170 (06) : 1519 - 1526
[37] A Study on the Efficacy of ChatGPT-4 in Enhancing Students' English Communication Skills
Wang, Ying
SAGE OPEN, 2025, 15 (01):
[38] In Defence of a Reciprocal Turing Test
Fintan Mallory
Minds and Machines, 2020, 30 : 659 - 680
[39] Evaluation of the Appropriateness and Readability of ChatGPT-4 Responses to Patient Queries on Uveitis
Mohammadi, S. Saeed
Khatri, Anadi
Jain, Tanya
Thng, Zheng Xian
Yoo, Woong-sun
Yavari, Negin
Bazojoo, Vahid
Mobasserian, Azadeh
Akhavanrezayat, Amir
Than, Ngoc Trong Tuong
Elaraby, Osama
Ganbold, Battuya
El Feley, Dalia
Nguyen, Trung
Yasar, Cigdem
Gupta, Ankur
Hung, Jia-Horung
Nguyen, Quan Dong
OPHTHALMOLOGY SCIENCE, 2025, 5 (01):
[40] Comparative evaluation of ChatGPT-4, ChatGPT-3.5 and Google Gemini on PCOS assessment and management based on recommendations from the 2023 guideline
Gunesli, Irmak
Aksun, Seren
Fathelbab, Jana
Yildiz, Bulent Okan
ENDOCRINE, 2024, : 315 - 322

← 1 2 3 4 5 →