ChatGPT-4 in the Turing Test

被引：0

作者：

Echavarria, Ricardo Restrepo ^{[1
]}

机构：

[1] Univ Tecn Manabi, Dept Ciencias Sociales & Comportamiento, Portoviejo, Ecuador

来源：

MINDS AND MACHINES | 2025年 / 35卷 / 01期

关键词：

Turing test; ChatGPT; Artificial intelligence; Science; Thinking; Intelligence; COMPUTERS;

D O I：

10.1007/s11023-025-09711-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

There has been considerable optimistic speculation on how well ChatGPT-4 would perform in a Turing Test. However, no minimally serious implementation of the test has been reported to have been carried out. This brief note documents the results of subjecting ChatGPT-4 to 10 Turing Tests, with different interrogators and participants. The outcome is tremendously disappointing for the optimists. Despite ChatGPT reportedly outperforming 99.9% of humans in a Verbal IQ test, it falls short of passing the Turing Test. In 9 out of the 10 tests conducted, the interrogators successfully identified ChatGPT-4 and the human participant. The probability of obtaining this result from a process in which the interrogator is really no better than chance at correct identification is calculated to be less than 1%. An additional question was posed to the interrogators at the end of each test: What led them to distinguish between the human and the machine? The interrogators, who effectively filtered out ChatGPT-4 from passing the Turing Test for intelligence, stated that they could identify the machine because it, in effect, responded more intelligently than the human. Subsequently, ChatGPT-4 was tasked with differentiating syntax from semantics and self-corrected when falling for the fallacy of equivocation. The curious situation is arrived at that passing the Turing Test for intelligence remains a challenge that ChatGPT-4 has yet to overcome, precisely because, as per the interrogators, its intellectual abilities surpass those of individual humans.

引用

页数：10

共 50 条

[41] Comparing the performance of ChatGPT-3.5-Turbo, ChatGPT-4, and Google Bard with Iranian students in pre-internship comprehensive exams
Zare, Soolmaz
Vafaeian, Soheil
Amini, Mitra
Farhadi, Keyvan
Vali, Mohammadreza
Golestani, Ali
SCIENTIFIC REPORTS, 2024, 14 (01):
[42] In Defence of a Reciprocal Turing Test
Mallory, Fintan
MINDS AND MACHINES, 2020, 30 (04) : 659 - 680
[43] Generative AI in education: ChatGPT-4 in evaluating students' written responses
Jauhiainen, Jussi S.
Garagorry Guerra, Agustin
INNOVATIONS IN EDUCATION AND TEACHING INTERNATIONAL, 2024,
[44] Assessing the accuracy of ChatGPT in interpreting blood gas analysis results ChatGPT-4 in blood gas analysis
Turan, Engin Ihsan
Baydemir, Abdurrahman Engin
Balitatli, Anil Berkay
Sahin, Ayca Sultan
JOURNAL OF CLINICAL ANESTHESIA, 2025, 102
[45] The Turing Test*
B. Jack Copeland
Minds and Machines, 2000, 10 : 519 - 539
[46] ChatGPT-4: Transforming Medical Education and Addressing Clinical Exposure Challenges in the Post-pandemic Era
Lower, Kirk
Seth, Ishith
Lim, Bryan
Seth, Nimish
INDIAN JOURNAL OF ORTHOPAEDICS, 2023, 57 (09) : 1527 - 1544
[47] Bariatric Evaluation Through AI: a Survey of Expert Opinions Versus ChatGPT-4 (BETA-SEOV)
Jazi, Amir Hossein Davarpanah
Mahjoubi, Mohammad
Shahabi, Shahab
Alqahtani, Aayed R.
Haddad, Ashraf
Pazouki, Abdolreza
Prasad, Arun
Safadi, Bassem Y.
Chiappetta, Sonja
Taskin, Halit Eren
Billy, Helmuth Thorlakur
Kasama, Kazunori
Mahawar, Kamal
Gawdat, Khaled
Rheinwalt, Karl Peter
Miller, Karl A.
Kow, Lilian
Neto, Manoel Galvao
Yang, Wah
Palermo, Mariano
Ghanem, Omar M.
Lainas, Panagiotis
Peterli, Ralph
Kassir, Radwan
Puy, Ramon Vilallonga
Ribeiro, Rui Jose Da Silva
Verboonen, Sergio
Pintar, Tadeja
Shabbir, Asim
Musella, Mario
Kermansaravi, Mohammad
OBESITY SURGERY, 2023, 33 (12) : 3971 - 3980
[48] Large Language Models for Intraoperative Decision Support in Plastic Surgery: A Comparison between ChatGPT-4 and Gemini
Gomez-Cabello, Cesar A.
Borna, Sahar
Pressman, Sophia M.
Haider, Syed Ali
Forte, Antonio J.
MEDICINA-LITHUANIA, 2024, 60 (06):
[49] Bariatric Evaluation Through AI: a Survey of Expert Opinions Versus ChatGPT-4 (BETA-SEOV)
Amir Hossein Davarpanah Jazi
Mohammad Mahjoubi
Shahab Shahabi
Aayed R. Alqahtani
Ashraf Haddad
Abdolreza Pazouki
Arun Prasad
Bassem Y. Safadi
Sonja Chiappetta
Halit Eren Taskin
Helmuth Thorlakur Billy
Kazunori Kasama
Kamal Mahawar
Khaled Gawdat
Karl Peter Rheinwalt
Karl A. Miller
Lilian Kow
Manoel Galvao Neto
Wah Yang
Mariano Palermo
Omar M. Ghanem
Panagiotis Lainas
Ralph Peterli
Radwan Kassir
Ramon Vilallonga Puy
Rui José Da Silva Ribeiro
Sergio Verboonen
Tadeja Pintar
Asim Shabbir
Mario Musella
Mohammad Kermansaravi
Obesity Surgery, 2023, 33 : 3971 - 3980
[50] Is ChatGPT-4 Accurate in Proofread a Manuscript in Otolaryngology-Head and Neck Surgery?
Lechien, Jerome R.
Gorton, Amy
Robertson, Jean
Vaira, Luigi A.
OTOLARYNGOLOGY-HEAD AND NECK SURGERY, 2024, 170 (06) : 1527 - 1530

← 1 2 3 4 5 →