Assessing the use of the novel tool Claude 3 in comparison to ChatGPT 4.0 as an artificial intelligence tool in the diagnosis and therapy of primary head and neck cancer cases

被引:12
作者
Schmidl, Benedikt [1 ]
Huetten, Tobias [1 ]
Pigorsch, Steffi [2 ]
Stoegbauer, Fabian [3 ]
Hoch, Cosima C. [1 ]
Hussain, Timon [1 ]
Wollenberg, Barbara [1 ]
Wirth, Markus [1 ]
机构
[1] Tech Univ Munich, Dept Otolaryngol Head & Neck Surg, Munich, Germany
[2] Tech Univ Munich, Dept RadioOncol, Munich, Germany
[3] Tech Univ Munich, Inst Pathol, Munich, Germany
关键词
Claude; 3; Opus; HNSCC; Multidisciplinary Tumorboard; Artificial Intelligence; LLM; SQUAMOUS-CELL CARCINOMA; PHASE-II; NIVOLUMAB;
D O I
10.1007/s00405-024-08828-1
中图分类号
R76 [耳鼻咽喉科学];
学科分类号
100213 ;
摘要
ObjectivesHead and neck squamous cell carcinoma (HNSCC) is a complex malignancy that requires a multidisciplinary tumor board approach for individual treatment planning. In recent years, artificial intelligence tools have emerged to assist healthcare professionals in making informed treatment decisions. This study investigates the application of the newly published LLM Claude 3 Opus compared to the currently most advanced LLM ChatGPT 4.0 for the diagnosis and therapy planning of primary HNSCC. The results were compared to that of a conventional multidisciplinary tumor board; (2) Materials and Methods: We conducted a study in March 2024 on 50 consecutive primary head and neck cancer cases. The diagnostics and MDT recommendations were compared to the Claude 3 Opus and ChatGPT 4.0 recommendations for each patient and rated by two independent reviewers for the following parameters: clinical recommendation, explanation, and summarization in addition to the Artificial Intelligence Performance Instrument (AIPI); (3) Results: In this study, Claude 3 achieved better scores for the diagnostic workup of patients than ChatGPT 4.0 and provided treatment recommendations involving surgery, chemotherapy, and radiation therapy. In terms of clinical recommendations, explanation and summarization Claude 3 scored similar to ChatGPT 4.0, listing treatment recommendations which were congruent with the MDT, but failed to cite the source of the information; (4) Conclusion: This study is the first analysis of Claude 3 for primary head and neck cancer cases and demonstrates a superior performance in the diagnosis of HNSCC than ChatGPT 4.0 and similar results for therapy recommendations. This marks the advent of a newly launched advanced AI model that may be superior to ChatGPT 4.0 for the assessment of primary head and neck cancer cases and may assist in the clinical diagnostic and MDT setting. Claude 3 OpusHNSCCMultidisciplinary TumorboardArtificial IntelligenceLLM
引用
收藏
页码:6099 / 6109
页数:11
相关论文
共 7 条
  • [1] Assessing the role of advanced artificial intelligence as a tool in multidisciplinary tumor board decision-making for recurrent/metastatic head and neck cancer cases - the first study on ChatGPT 4o and a comparison to ChatGPT 4.0
    Schmidl, Benedikt
    Huetten, Tobias
    Pigorsch, Steffi
    Stoegbauer, Fabian
    Hoch, Cosima C.
    Hussain, Timon
    Wollenberg, Barbara
    Wirth, Markus
    FRONTIERS IN ONCOLOGY, 2024, 14
  • [2] Assessing the role of advanced artificial intelligence as a tool in multidisciplinary tumor board decision-making for primary head and neck cancer cases
    Schmidl, Benedikt
    Huetten, Tobias
    Pigorsch, Steffi
    Stoegbauer, Fabian
    Hoch, Cosima C.
    Hussain, Timon
    Wollenberg, Barbara
    Wirth, Markus
    FRONTIERS IN ONCOLOGY, 2024, 14
  • [3] Quality of Information Provided by Artificial Intelligence Chatbots Surrounding the Reconstructive Surgery for Head and Neck Cancer: A Comparative Analysis Between ChatGPT4 and Claude2
    Boscolo-Rizzo, Paolo
    Marcuzzo, Alberto Vito
    Lazzarin, Chiara
    Giudici, Fabiola
    Polesel, Jerry
    Stellin, Marco
    Pettorelli, Andrea
    Spinato, Giacomo
    Ottaviano, Giancarlo
    Ferrari, Marco
    Borsetto, Daniele
    Zucchini, Simone
    Trabalzini, Franco
    Sia, Egidio
    Gardenal, Nicoletta
    Baruca, Roberto
    Fortunati, Alfonso
    Vaira, Luigi Angelo
    Tirelli, Giancarlo
    CLINICAL OTOLARYNGOLOGY, 2025, 50 (02) : 330 - 335
  • [4] Use of a Novel Artificial Intelligence Tool for Evaluating Primary Stability and Immediate Loading Suitability of Dental Implants: An In Vitro Pilot Study
    Degidi, Marco
    Daprile, Giuseppe
    INTERNATIONAL JOURNAL OF ORAL & MAXILLOFACIAL IMPLANTS, 2024, 39 (04) : e129 - e135
  • [5] The Use of an Artificial Intelligence-Driven Novel Tool for the Evaluation of Dental Implants Primary Stability and Immediate Loading Feasibility: A Multicenter Retrospective Study
    Degidi, Marco
    Daprile, Giuseppe
    Battelli, Filippo
    Caselli, Ernesto
    Cisternino, Luca
    Greco, Alessandro
    Palumbo, Daniele
    Quasso, Federico
    Rossi, Fabio
    Tavelli, Corrado
    Zaccheroni, Zoran
    JOURNAL OF CLINICAL MEDICINE, 2025, 14 (06)
  • [6] Robust Artificial Intelligence Tool for Atrial Fibrillation Diagnosis: Novel Development Approach Incorporating Both Atrial Electrograms and Surface ECG and Evaluation by Head-to- Head Comparison With Hospital-Based Physician ECG Readers
    Zhang, Yuji
    Xu, Shusheng
    Xing, Wenhui
    Chen, Qiong
    Liu, Xu
    Pu, Yachuan
    Xin, Fangran
    Jiang, Hui
    Yin, Zongtao
    Tao, Dengshun
    Zhou, Dong
    Zhu, Yan
    Yuan, Binhang
    Jin, Yan
    He, Yuanchen
    Wu, Yi
    Po, Sunny S.
    Wang, Huishan
    Benditt, David G.
    JOURNAL OF THE AMERICAN HEART ASSOCIATION, 2024, 13 (03):
  • [7] Artificial intelligence electrocardiogram as a novel screening tool to detect a newly abnormal left ventricular ejection fraction after anthracycline-based cancer therapy
    Jacobs, Johanna E. J.
    Greason, Grace
    Mangold, Kathryn E.
    Wildiers, Hans
    Willems, Rik
    Janssens, Stefan
    Noseworthy, Peter
    Lopez-Jimenez, Francisco
    Voigt, Jens-Uwe
    Friedman, Paul
    Van Aelst, Lucas
    Vandenberk, Bert
    Attia, Zachi Itzhak
    Herrmann, Joerg
    EUROPEAN JOURNAL OF PREVENTIVE CARDIOLOGY, 2023, : 560 - 566