Assessing the accuracy of ChatGPT references in head and neck and ENT disciplines

被引：28

作者：

Frosolini, Andrea ^{[1
]}

Franz, Leonardo ^{[2
,3
]}

Benedetti, Simone ^{[1
]}

Vaira, Luigi Angelo ^{[4
,5
]}

de Filippis, Cosimo ^{[2
]}

Gennaro, Paolo ^{[1
]}

Marioni, Gino ^{[2
]}

Gabriele, Guido ^{[1
]}

机构：

[1] Univ Siena, Dept Maxillofacial Surg, Policlin Le Scotte, Siena, Italy

[2] Univ Padua, Dept Neurosci DNS, Phoniatris & Audiol Unit, Treviso, Italy

[3] Univ Brescia, Dept Clin & Expt Sci, Artificial Intelligence Med & Innovat Clin Res & M, Brescia, Italy

[4] Univ Sassari, Dept Med Surg & Pharm, Maxillofacial Surg Operat Unit, Sassari, Italy

[5] Univ Sassari, PhD Sch Biomed Sci, Dept Biomed Sci, Sassari, Italy

来源：

EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY | 2023年 / 280卷 / 11期

关键词：

Head and neck surgery; Maxillofacial; AI; Chat-GPT; Artificial intelligence;

D O I：

10.1007/s00405-023-08205-4

中图分类号：

R76 [耳鼻咽喉科学];

学科分类号：

100213 ;

摘要：

PurposeChatGPT has gained popularity as a web application since its release in 2022. While artificial intelligence (AI) systems' potential in scientific writing is widely discussed, their reliability in reviewing literature and providing accurate references remains unexplored. This study examines the reliability of references generated by ChatGPT language models in the Head and Neck field.MethodsTwenty clinical questions were generated across different Head and Neck disciplines, to prompt ChatGPT versions 3.5 and 4.0 to produce texts on the assigned topics. The generated references were categorized as "true," "erroneous," or "inexistent" based on congruence with existing records in scientific databases.ResultsChatGPT 4.0 outperformed version 3.5 in terms of reference reliability. However, both versions displayed a tendency to provide erroneous/non-existent references.ConclusionsIt is crucial to address this challenge to maintain the reliability of scientific literature. Journals and institutions should establish strategies and good-practice principles in the evolving landscape of AI-assisted scientific writing.

引用

页码：5129 / 5133

页数：5

共 50 条

[21] Assessing the Accuracy of Responses by the Language Model ChatGPT to Questions Regarding Bariatric Surgery
Samaan, Jamil S.
Yeo, Yee Hui
Rajeev, Nithya
Hawley, Lauren
Abel, Stuart
Ng, Wee Han
Srinivasan, Nitin
Park, Justin
Burch, Miguel
Watson, Rabindra
Liran, Omer
Samakar, Kamran
OBESITY SURGERY, 2023, 33 (06) : 1790 - 1796
[22] Assessing the accuracy and utility of ChatGPT responses to patient questions regarding posterior lumbar decompression
Giakas, Alec M.
Narayanan, Rajkishen
Ezeonu, Teeto
Dalton, Jonathan
Lee, Yunsoo
Henry, Tyler
Mangan, John
Schroeder, Gregory
Vaccaro, Alexander
Kepler, Christopher
ARTIFICIAL INTELLIGENCE SURGERY, 2024, 4 (03): : 233 - 246
[23] Assessing the Accuracy of Information on Medication Abortion: A Comparative Analysis of ChatGPT and Google Bard AI
Mediboina, Anjali
Badam, Rajani Kumari
Chodavarapu, Sailaja
CUREUS JOURNAL OF MEDICAL SCIENCE, 2024, 16 (01)
[24] REASONS FOR CANCELLATION OF ENT, HEAD AND NECK SURGERIES IN A NIGERIAN TEACHING HOSPITAL
Adobamen, Paul Oserhemhen
Imarengiaye, Charles
GOMAL JOURNAL OF MEDICAL SCIENCES, 2012, 10 (02): : 190 - 193
[25] Assessing ChatGPT's Potential in HIV Prevention Communication: A Comprehensive Evaluation of Accuracy, Completeness, and Inclusivity
De Vito, Andrea
Colpani, Agnese
Moi, Giulia
Babudieri, Sergio
Calcagno, Andrea
Calvino, Valeria
Ceccarelli, Manuela
Colpani, Gianmaria
d'Ettorre, Gabriella
Di Biagio, Antonio
Farinella, Massimo
Falaguasta, Marco
Foca, Emanuele
Giupponi, Giusi
Habed, Adriano Jose
Isenia, Wigbertson Julian
Lo Caputo, Sergio
Marchetti, Giulia
Modesti, Luca
Mussini, Cristina
Nunnari, Giuseppe
Rusconi, Stefano
Russo, Daria
Saracino, Annalisa
Serra, Pier Andrea
Madeddu, Giordano
AIDS AND BEHAVIOR, 2024, : 2746 - 2754
[26] Assessing the role of advanced artificial intelligence as a tool in multidisciplinary tumor board decision-making for recurrent/metastatic head and neck cancer cases - the first study on ChatGPT 4o and a comparison to ChatGPT 4.0
Schmidl, Benedikt
Huetten, Tobias
Pigorsch, Steffi
Stoegbauer, Fabian
Hoch, Cosima C.
Hussain, Timon
Wollenberg, Barbara
Wirth, Markus
FRONTIERS IN ONCOLOGY, 2024, 14
[27] Assessing the accuracy and explainability of using ChatGPT to evaluate the quality of health news
Xiaoyu Liu
Lu He
Eman Alanazi
Echu Liu
Arianna Goss
Lionel Gumireddy
BMC Public Health, 25 (1)
[28] ENT medicine and head and neck surgery in the G-DRG system 2008
Franz, D.
Roeder, N.
Hoermann, K.
Alberty, J.
HNO, 2008, 56 (09) : 874 - 880
[29] Let's chat about cervical cancer: Assessing the accuracy of ChatGPT responses to cervical cancer questions
Hermann, Catherine E.
Patel, Jharna M.
Boyd, Leslie
Aviki, Emeline
Stasenko, Marina
GYNECOLOGIC ONCOLOGY, 2023, 179 : 164 - 168
[30] Assessing the accuracy and reproducibility of artificial intelligence-generated medical responses by ChatGPT on Scheuermann's kyphosis
Giray, Esra
Illeez, Ozge Gulsum
Korkmaz, Merve Damla
Capan, Nalan
Saygi, Evrim Karadag
Aydin, Resa
TURKISH JOURNAL OF PHYSICAL MEDICINE AND REHABILITATION, 2024,

← 1 2 3 4 5 →