Accuracy of ChatGPT-Generated Information on Head and Neck and Oromaxillofacial Surgery: A Multicenter Collaborative Analysis

被引:48
作者
Vaira, Luigi Angelo [1 ,2 ,21 ]
Lechien, Jerome R. [3 ,4 ]
Abbate, Vincenzo [5 ]
Allevi, Fabiana [6 ]
Audino, Giovanni [5 ]
Beltramini, Giada Anna [7 ,8 ]
Bergonzani, Michela [9 ]
Bolzoni, Alessandro [7 ]
Committeri, Umberto [5 ]
Crimi, Salvatore [10 ]
Gabriele, Guido [11 ]
Lonardi, Fabio [12 ]
Maglitto, Fabio [13 ]
Petrocelli, Marzia [14 ]
Pucci, Resi [15 ]
Saponaro, Gianmarco [16 ]
Tel, Alessandro [17 ]
Vellone, Valentino [18 ]
Chiesa-Estomba, Carlos Miguel [19 ]
Boscolo-Rizzo, Paolo [20 ]
Salzano, Giovanni [5 ]
De Riu, Giacomo [1 ]
机构
[1] Univ Sassari, Dept Med Surg & Pharm, Maxillofacial Surg Operat Unit, Sassari, Italy
[2] Univ Sassari, PhD Sch Biomed Sci, Biomed Sci Dept, Sassari, Italy
[3] Univ Mons UMons, Res Inst Hlth Sci & Technol, Mons Sch Med, Dept Anat & Expt Oncol,UMONS, Mons, Belgium
[4] Elsan Polyclin Poitiers, Dept Otolaryngol Head Neck Surg, Poitiers, France
[5] Federico II Univ Naples, Dept Neurosci Reprod & Odontostomatol Sci, Head & Neck Sect, Naples, Italy
[6] Univ Milan, Maxillofacial Surg Dept, ASSt Santi Paolo & Carlo, Milan, Italy
[7] Univ Milan, Dept Biomed Surg & Dent Sci, Milan, Italy
[8] Fdn IRCCS Ca Granda Osped Maggiore Policlin, Maxillofacial & Dent Unit, Milan, Italy
[9] Univ Hosp Parma, Head & Neck Dept, Maxillo Facial Surg Div, Parma, Italy
[10] Univ Catania, Operat Unit Maxillofacial Surg, Policlin San Marco, Catania, Italy
[11] Univ Siena, Dept Maxillofacial Surg, Siena, Italy
[12] Univ Verona, Dept Maxillofacial Surg, Verona, Italy
[13] Univ Bari Aldo Moro, Maxillo Facial Surg Unit, Bari, Italy
[14] Bellaria & Maggiore Hosp, Maxillofacial Surg Operat Unit, Bologna, Italy
[15] San Camillo Forlanini Hosp, Maxillofacial Surg Unit, Rome, Italy
[16] Univ Cattolica Sacro Cuore, IRCSS A Gemelli Fdn Catholic, Maxillo Facial Surg Unit, Rome, Italy
[17] Univ Hosp Udine, Dept Head & Neck Surg & Neurosci, Clin Maxillofacial Surg, Udine, Italy
[18] S Maria Hosp, Maxillofacial Surg Unit, Terni, Italy
[19] Hosp Univ Donostia, Dept Otorhinolaryngol Head & Neck Surg, San Sebastian, Spain
[20] Univ Trieste, Dept Med Surg & Hlth Sci, Sect Otolaryngol, Trieste, Italy
[21] Univ Sassari, Viale San Pietro 43-B, I-07100 Sassari, Italy
关键词
artificial intelligence; ChatGPT; maxillofacial surgery; otorhinolaryngology;
D O I
10.1002/ohn.489
中图分类号
R76 [耳鼻咽喉科学];
学科分类号
100213 ;
摘要
ObjectiveTo investigate the accuracy of Chat-Based Generative Pre-trained Transformer (ChatGPT) in answering questions and solving clinical scenarios of head and neck surgery.Study DesignObservational and valuative study.SettingEighteen surgeons from 14 Italian head and neck surgery units.MethodsA total of 144 clinical questions encompassing different subspecialities of head and neck surgery and 15 comprehensive clinical scenarios were developed. Questions and scenarios were inputted into ChatGPT4, and the resulting answers were evaluated by the researchers using accuracy (range 1-6), completeness (range 1-3), and references' quality Likert scales.ResultsThe overall median score of open-ended questions was 6 (interquartile range[IQR]: 5-6) for accuracy and 3 (IQR: 2-3) for completeness. Overall, the reviewers rated the answer as entirely or nearly entirely correct in 87.2% of cases and as comprehensive and covering all aspects of the question in 73% of cases. The artificial intelligence (AI) model achieved a correct response in 84.7% of the closed-ended questions (11 wrong answers). As for the clinical scenarios, ChatGPT provided a fully or nearly fully correct diagnosis in 81.7% of cases. The proposed diagnostic or therapeutic procedure was judged to be complete in 56.7% of cases. The overall quality of the bibliographic references was poor, and sources were nonexistent in 46.4% of the cases.ConclusionThe results generally demonstrate a good level of accuracy in the AI's answers. The AI's ability to resolve complex clinical scenarios is promising, but it still falls short of being considered a reliable support for the decision-making process of specialists in head-neck surgery.
引用
收藏
页码:1492 / 1503
页数:12
相关论文
共 38 条
  • [1] Internet use by the public to search for health-related information
    AlGhamdi, Khalid M.
    Moussa, Noura A.
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2012, 81 (06) : 363 - 373
  • [2] A comparison of ChatGPT-generated articles with human-written articles
    Ariyaratne, Sisith
    Iyengar, Karthikeyan. P.
    Nischal, Neha
    Chitti Babu, Naparla
    Botchu, Rajesh
    [J]. SKELETAL RADIOLOGY, 2023, 52 (09) : 1755 - 1758
  • [3] Appropriateness of Recommendations Provided by ChatGPT to Interventional Radiologists
    Barat, Maxime
    Soyer, Philippe
    Dohan, Anthony
    [J]. CANADIAN ASSOCIATION OF RADIOLOGISTS JOURNAL-JOURNAL DE L ASSOCIATION CANADIENNE DES RADIOLOGISTES, 2023, 74 (04): : 758 - 763
  • [4] Artificial Intelligence and the Future of Primary Care: Exploratory Qualitative Study of UK General Practitioners' Views
    Blease, Charlotte
    Kaptchuk, Ted J.
    Bernstein, Michael H.
    Mandl, Kenneth D.
    Halamka, John D.
    DesRoches, Catherine M.
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2019, 21 (03)
  • [5] ChatGPT: five priorities for research
    Bockting, Claudi
    van Dis, Eva A. M.
    Bollen, Johan
    van Rooij, Robert
    Zuidema, Willem L.
    [J]. NATURE, 2023, 614 (7947) : 224 - 226
  • [6] Evaluating the Feasibility of ChatGPT in Healthcare: An Analysis of Multiple Clinical and Research Scenarios
    Cascella, Marco
    Montomoli, Jonathan
    Bellini, Valentina
    Bignami, Elena
    [J]. JOURNAL OF MEDICAL SYSTEMS, 2023, 47 (01)
  • [7] The potential impact of ChatGPT/GPT-4 on surgery: will it topple the profession of surgeons?
    Cheng, Kunming
    Sun, Zaijie
    He, Yongbin
    Gu, Shuqin
    Wu, Haiyang
    [J]. INTERNATIONAL JOURNAL OF SURGERY, 2023, 109 (05) : 1545 - 1547
  • [8] Potential Use of Artificial Intelligence in Infectious Disease: Take ChatGPT as an Example
    Cheng, Kunming
    Li, Zhiyong
    He, Yongbin
    Guo, Qiang
    Lu, Yanqiu
    Gu, Shuqin
    Wu, Haiyang
    [J]. ANNALS OF BIOMEDICAL ENGINEERING, 2023, 51 (06) : 1130 - 1135
  • [9] Emergency surgery in the era of artificial intelligence: ChatGPT could be the doctor's right-hand man
    Cheng, Kunming
    Li, Zhiyong
    Guo, Qiang
    Sun, Zaijie
    Wu, Haiyang
    Li, Cheng
    [J]. INTERNATIONAL JOURNAL OF SURGERY, 2023, 109 (06) : 1816 - 1818
  • [10] Utilizing ChatGPT-4 for Providing Medical Information on Blepharoplasties to Patients
    Cox, Aram
    Seth, Ishith
    Xie, Yi
    Hunter-Smith, David J.
    Rozen, Warren M.
    [J]. AESTHETIC SURGERY JOURNAL, 2023, 43 (08) : NP658 - NP662