Assessing the accuracy of ChatGPT references in head and neck and ENT disciplines

被引:28
作者
Frosolini, Andrea [1 ]
Franz, Leonardo [2 ,3 ]
Benedetti, Simone [1 ]
Vaira, Luigi Angelo [4 ,5 ]
de Filippis, Cosimo [2 ]
Gennaro, Paolo [1 ]
Marioni, Gino [2 ]
Gabriele, Guido [1 ]
机构
[1] Univ Siena, Dept Maxillofacial Surg, Policlin Le Scotte, Siena, Italy
[2] Univ Padua, Dept Neurosci DNS, Phoniatris & Audiol Unit, Treviso, Italy
[3] Univ Brescia, Dept Clin & Expt Sci, Artificial Intelligence Med & Innovat Clin Res & M, Brescia, Italy
[4] Univ Sassari, Dept Med Surg & Pharm, Maxillofacial Surg Operat Unit, Sassari, Italy
[5] Univ Sassari, PhD Sch Biomed Sci, Dept Biomed Sci, Sassari, Italy
关键词
Head and neck surgery; Maxillofacial; AI; Chat-GPT; Artificial intelligence;
D O I
10.1007/s00405-023-08205-4
中图分类号
R76 [耳鼻咽喉科学];
学科分类号
100213 ;
摘要
PurposeChatGPT has gained popularity as a web application since its release in 2022. While artificial intelligence (AI) systems' potential in scientific writing is widely discussed, their reliability in reviewing literature and providing accurate references remains unexplored. This study examines the reliability of references generated by ChatGPT language models in the Head and Neck field.MethodsTwenty clinical questions were generated across different Head and Neck disciplines, to prompt ChatGPT versions 3.5 and 4.0 to produce texts on the assigned topics. The generated references were categorized as "true," "erroneous," or "inexistent" based on congruence with existing records in scientific databases.ResultsChatGPT 4.0 outperformed version 3.5 in terms of reference reliability. However, both versions displayed a tendency to provide erroneous/non-existent references.ConclusionsIt is crucial to address this challenge to maintain the reliability of scientific literature. Journals and institutions should establish strategies and good-practice principles in the evolving landscape of AI-assisted scientific writing.
引用
收藏
页码:5129 / 5133
页数:5
相关论文
共 50 条
  • [21] Assessing the Accuracy of Responses by the Language Model ChatGPT to Questions Regarding Bariatric Surgery
    Samaan, Jamil S.
    Yeo, Yee Hui
    Rajeev, Nithya
    Hawley, Lauren
    Abel, Stuart
    Ng, Wee Han
    Srinivasan, Nitin
    Park, Justin
    Burch, Miguel
    Watson, Rabindra
    Liran, Omer
    Samakar, Kamran
    OBESITY SURGERY, 2023, 33 (06) : 1790 - 1796
  • [22] Assessing the accuracy and utility of ChatGPT responses to patient questions regarding posterior lumbar decompression
    Giakas, Alec M.
    Narayanan, Rajkishen
    Ezeonu, Teeto
    Dalton, Jonathan
    Lee, Yunsoo
    Henry, Tyler
    Mangan, John
    Schroeder, Gregory
    Vaccaro, Alexander
    Kepler, Christopher
    ARTIFICIAL INTELLIGENCE SURGERY, 2024, 4 (03): : 233 - 246
  • [23] Assessing the Accuracy of Information on Medication Abortion: A Comparative Analysis of ChatGPT and Google Bard AI
    Mediboina, Anjali
    Badam, Rajani Kumari
    Chodavarapu, Sailaja
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2024, 16 (01)
  • [24] REASONS FOR CANCELLATION OF ENT, HEAD AND NECK SURGERIES IN A NIGERIAN TEACHING HOSPITAL
    Adobamen, Paul Oserhemhen
    Imarengiaye, Charles
    GOMAL JOURNAL OF MEDICAL SCIENCES, 2012, 10 (02): : 190 - 193
  • [25] Assessing ChatGPT's Potential in HIV Prevention Communication: A Comprehensive Evaluation of Accuracy, Completeness, and Inclusivity
    De Vito, Andrea
    Colpani, Agnese
    Moi, Giulia
    Babudieri, Sergio
    Calcagno, Andrea
    Calvino, Valeria
    Ceccarelli, Manuela
    Colpani, Gianmaria
    d'Ettorre, Gabriella
    Di Biagio, Antonio
    Farinella, Massimo
    Falaguasta, Marco
    Foca, Emanuele
    Giupponi, Giusi
    Habed, Adriano Jose
    Isenia, Wigbertson Julian
    Lo Caputo, Sergio
    Marchetti, Giulia
    Modesti, Luca
    Mussini, Cristina
    Nunnari, Giuseppe
    Rusconi, Stefano
    Russo, Daria
    Saracino, Annalisa
    Serra, Pier Andrea
    Madeddu, Giordano
    AIDS AND BEHAVIOR, 2024, : 2746 - 2754
  • [26] Assessing the role of advanced artificial intelligence as a tool in multidisciplinary tumor board decision-making for recurrent/metastatic head and neck cancer cases - the first study on ChatGPT 4o and a comparison to ChatGPT 4.0
    Schmidl, Benedikt
    Huetten, Tobias
    Pigorsch, Steffi
    Stoegbauer, Fabian
    Hoch, Cosima C.
    Hussain, Timon
    Wollenberg, Barbara
    Wirth, Markus
    FRONTIERS IN ONCOLOGY, 2024, 14
  • [27] Assessing the accuracy and explainability of using ChatGPT to evaluate the quality of health news
    Xiaoyu Liu
    Lu He
    Eman Alanazi
    Echu Liu
    Arianna Goss
    Lionel Gumireddy
    BMC Public Health, 25 (1)
  • [28] ENT medicine and head and neck surgery in the G-DRG system 2008
    Franz, D.
    Roeder, N.
    Hoermann, K.
    Alberty, J.
    HNO, 2008, 56 (09) : 874 - 880
  • [29] Let's chat about cervical cancer: Assessing the accuracy of ChatGPT responses to cervical cancer questions
    Hermann, Catherine E.
    Patel, Jharna M.
    Boyd, Leslie
    Aviki, Emeline
    Stasenko, Marina
    GYNECOLOGIC ONCOLOGY, 2023, 179 : 164 - 168
  • [30] Assessing the accuracy and reproducibility of artificial intelligence-generated medical responses by ChatGPT on Scheuermann's kyphosis
    Giray, Esra
    Illeez, Ozge Gulsum
    Korkmaz, Merve Damla
    Capan, Nalan
    Saygi, Evrim Karadag
    Aydin, Resa
    TURKISH JOURNAL OF PHYSICAL MEDICINE AND REHABILITATION, 2024,