ChatGPT-4 in the Turing Test

被引:0
作者
Echavarria, Ricardo Restrepo [1 ]
机构
[1] Univ Tecn Manabi, Dept Ciencias Sociales & Comportamiento, Portoviejo, Ecuador
关键词
Turing test; ChatGPT; Artificial intelligence; Science; Thinking; Intelligence; COMPUTERS;
D O I
10.1007/s11023-025-09711-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There has been considerable optimistic speculation on how well ChatGPT-4 would perform in a Turing Test. However, no minimally serious implementation of the test has been reported to have been carried out. This brief note documents the results of subjecting ChatGPT-4 to 10 Turing Tests, with different interrogators and participants. The outcome is tremendously disappointing for the optimists. Despite ChatGPT reportedly outperforming 99.9% of humans in a Verbal IQ test, it falls short of passing the Turing Test. In 9 out of the 10 tests conducted, the interrogators successfully identified ChatGPT-4 and the human participant. The probability of obtaining this result from a process in which the interrogator is really no better than chance at correct identification is calculated to be less than 1%. An additional question was posed to the interrogators at the end of each test: What led them to distinguish between the human and the machine? The interrogators, who effectively filtered out ChatGPT-4 from passing the Turing Test for intelligence, stated that they could identify the machine because it, in effect, responded more intelligently than the human. Subsequently, ChatGPT-4 was tasked with differentiating syntax from semantics and self-corrected when falling for the fallacy of equivocation. The curious situation is arrived at that passing the Turing Test for intelligence remains a challenge that ChatGPT-4 has yet to overcome, precisely because, as per the interrogators, its intellectual abilities surpass those of individual humans.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Comparing the performance of ChatGPT-3.5-Turbo, ChatGPT-4, and Google Bard with Iranian students in pre-internship comprehensive exams
    Zare, Soolmaz
    Vafaeian, Soheil
    Amini, Mitra
    Farhadi, Keyvan
    Vali, Mohammadreza
    Golestani, Ali
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [42] In Defence of a Reciprocal Turing Test
    Mallory, Fintan
    MINDS AND MACHINES, 2020, 30 (04) : 659 - 680
  • [43] Generative AI in education: ChatGPT-4 in evaluating students' written responses
    Jauhiainen, Jussi S.
    Garagorry Guerra, Agustin
    INNOVATIONS IN EDUCATION AND TEACHING INTERNATIONAL, 2024,
  • [44] Assessing the accuracy of ChatGPT in interpreting blood gas analysis results ChatGPT-4 in blood gas analysis
    Turan, Engin Ihsan
    Baydemir, Abdurrahman Engin
    Balitatli, Anil Berkay
    Sahin, Ayca Sultan
    JOURNAL OF CLINICAL ANESTHESIA, 2025, 102
  • [45] The Turing Test*
    B. Jack Copeland
    Minds and Machines, 2000, 10 : 519 - 539
  • [46] ChatGPT-4: Transforming Medical Education and Addressing Clinical Exposure Challenges in the Post-pandemic Era
    Lower, Kirk
    Seth, Ishith
    Lim, Bryan
    Seth, Nimish
    INDIAN JOURNAL OF ORTHOPAEDICS, 2023, 57 (09) : 1527 - 1544
  • [47] Bariatric Evaluation Through AI: a Survey of Expert Opinions Versus ChatGPT-4 (BETA-SEOV)
    Jazi, Amir Hossein Davarpanah
    Mahjoubi, Mohammad
    Shahabi, Shahab
    Alqahtani, Aayed R.
    Haddad, Ashraf
    Pazouki, Abdolreza
    Prasad, Arun
    Safadi, Bassem Y.
    Chiappetta, Sonja
    Taskin, Halit Eren
    Billy, Helmuth Thorlakur
    Kasama, Kazunori
    Mahawar, Kamal
    Gawdat, Khaled
    Rheinwalt, Karl Peter
    Miller, Karl A.
    Kow, Lilian
    Neto, Manoel Galvao
    Yang, Wah
    Palermo, Mariano
    Ghanem, Omar M.
    Lainas, Panagiotis
    Peterli, Ralph
    Kassir, Radwan
    Puy, Ramon Vilallonga
    Ribeiro, Rui Jose Da Silva
    Verboonen, Sergio
    Pintar, Tadeja
    Shabbir, Asim
    Musella, Mario
    Kermansaravi, Mohammad
    OBESITY SURGERY, 2023, 33 (12) : 3971 - 3980
  • [48] Large Language Models for Intraoperative Decision Support in Plastic Surgery: A Comparison between ChatGPT-4 and Gemini
    Gomez-Cabello, Cesar A.
    Borna, Sahar
    Pressman, Sophia M.
    Haider, Syed Ali
    Forte, Antonio J.
    MEDICINA-LITHUANIA, 2024, 60 (06):
  • [49] Bariatric Evaluation Through AI: a Survey of Expert Opinions Versus ChatGPT-4 (BETA-SEOV)
    Amir Hossein Davarpanah Jazi
    Mohammad Mahjoubi
    Shahab Shahabi
    Aayed R. Alqahtani
    Ashraf Haddad
    Abdolreza Pazouki
    Arun Prasad
    Bassem Y. Safadi
    Sonja Chiappetta
    Halit Eren Taskin
    Helmuth Thorlakur Billy
    Kazunori Kasama
    Kamal Mahawar
    Khaled Gawdat
    Karl Peter Rheinwalt
    Karl A. Miller
    Lilian Kow
    Manoel Galvao Neto
    Wah Yang
    Mariano Palermo
    Omar M. Ghanem
    Panagiotis Lainas
    Ralph Peterli
    Radwan Kassir
    Ramon Vilallonga Puy
    Rui José Da Silva Ribeiro
    Sergio Verboonen
    Tadeja Pintar
    Asim Shabbir
    Mario Musella
    Mohammad Kermansaravi
    Obesity Surgery, 2023, 33 : 3971 - 3980
  • [50] Is ChatGPT-4 Accurate in Proofread a Manuscript in Otolaryngology-Head and Neck Surgery?
    Lechien, Jerome R.
    Gorton, Amy
    Robertson, Jean
    Vaira, Luigi A.
    OTOLARYNGOLOGY-HEAD AND NECK SURGERY, 2024, 170 (06) : 1527 - 1530