Exploring the landscape of AI-assisted decision-making in head and neck cancer treatment: a comparative analysis of NCCN guidelines and ChatGPT responses

被引:24
作者
Marchi, Filippo [1 ,2 ]
Bellini, Elisa [1 ,2 ]
Iandelli, Andrea [1 ]
Sampieri, Claudio [3 ,4 ,5 ]
Peretti, Giorgio [1 ,2 ]
机构
[1] IRCCS Osped Policlin San Martino, Unit Otorhinolaryngol Head & Neck Surg, Largo Rosanna Benzi 10, I-16132 Genoa, Italy
[2] Univ Genoa, Dept Surg Sci & Integrated Diagnost DISC, I-16132 Genoa, Italy
[3] Univ Genoa, Dept Expt Med DIMES, Genoa, Italy
[4] Hosp Clin Univ, Dept Otolaryngol, Barcelona, Spain
[5] Hosp Clin Barcelona, Funct Unit Head Neck Tumors, Barcelona, Spain
关键词
Machine learning; Artificial intelligence (AI) models; ChatGPT; Cancer care; National Comprehensive Cancer Network (NCCN) Guidelines; Head and neck cancers;
D O I
10.1007/s00405-024-08525-z
中图分类号
R76 [耳鼻咽喉科学];
学科分类号
100213 ;
摘要
PurposeRecent breakthroughs in natural language processing and machine learning, exemplified by ChatGPT, have spurred a paradigm shift in healthcare. Released by OpenAI in November 2022, ChatGPT rapidly gained global attention. Trained on massive text datasets, this large language model holds immense potential to revolutionize healthcare. However, existing literature often overlooks the need for rigorous validation and real-world applicability.MethodsThis head-to-head comparative study assesses ChatGPT's capabilities in providing therapeutic recommendations for head and neck cancers. Simulating every NCCN Guidelines scenarios. ChatGPT is queried on primary treatments, adjuvant treatment, and follow-up, with responses compared to the NCCN Guidelines. Performance metrics, including sensitivity, specificity, and F1 score, are employed for assessment.ResultsThe study includes 68 hypothetical cases and 204 clinical scenarios. ChatGPT exhibits promising capabilities in addressing NCCN-related queries, achieving high sensitivity and overall accuracy across primary treatment, adjuvant treatment, and follow-up. The study's metrics showcase robustness in providing relevant suggestions. However, a few inaccuracies are noted, especially in primary treatment scenarios.ConclusionOur study highlights the proficiency of ChatGPT in providing treatment suggestions. The model's alignment with the NCCN Guidelines sets the stage for a nuanced exploration of AI's evolving role in oncological decision support. However, challenges related to the interpretability of AI in clinical decision-making and the importance of clinicians understanding the underlying principles of AI models remain unexplored. As AI continues to advance, collaborative efforts between models and medical experts are deemed essential for unlocking new frontiers in personalized cancer care.
引用
收藏
页码:2123 / 2136
页数:14
相关论文
共 47 条
[1]  
[Anonymous], 2024, NCCN Clinical Practice Guidelines in Oncology (NCCN Guidelines)-Head and Neck Cancers, P1
[2]   Videomics of the Upper Aero-Digestive Tract Cancer: Deep Learning Applied to White Light and Narrow Band Imaging for Automatic Segmentation of Endoscopic Images [J].
Azam, Muhammad Adeel ;
Sampieri, Claudio ;
Ioppi, Alessandro ;
Benzi, Pietro ;
Giordano, Giorgio Gregory ;
De Vecchi, Marta ;
Campagnari, Valentina ;
Li, Shunlei ;
Guastini, Luca ;
Paderno, Alberto ;
Moccia, Sara ;
Piazza, Cesare ;
Mattos, Leonardo S. ;
Peretti, Giorgio .
FRONTIERS IN ONCOLOGY, 2022, 12
[3]   Deep Learning Applied to White Light and Narrow Band Imaging Videolaryngoscopy: Toward Real-Time Laryngeal Cancer Detection [J].
Azam, Muhammad Adeel ;
Sampieri, Claudio ;
Ioppi, Alessandro ;
Africano, Stefano ;
Vallin, Alberto ;
Mocellin, Davide ;
Fragale, Marco ;
Guastini, Luca ;
Moccia, Sara ;
Piazza, Cesare ;
Mattos, Leonardo S. ;
Peretti, Giorgio .
LARYNGOSCOPE, 2022, 132 (09) :1798-1806
[4]   A Radiation Oncology Board Exam of ChatGPT [J].
Barbour, Andrew B. ;
Barbour, T. Aleksandr .
CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (09)
[5]   Leveraging Large Language Models for Decision Support in Personalized Oncology [J].
Benary, Manuela ;
Wang, Xing David ;
Schmidt, Max ;
Soll, Dominik ;
Hilfenhaus, Georg ;
Nassir, Mani ;
Sigler, Christian ;
Knoedler, Maren ;
Keller, Ulrich ;
Beule, Dieter ;
Keilholz, Ulrich ;
Leser, Ulf ;
Rieke, Damian T. .
JAMA NETWORK OPEN, 2023, 6 (11) :E2343689
[6]   Will I soon be out of my job? Quality and guideline conformity of ChatGPT therapy suggestions to patient inquiries with gynecologic symptoms in a palliative setting [J].
Braun, Eva-Marie ;
Juhasz-Boess, Ingolf ;
Solomayer, Erich-Franz ;
Truhn, Daniel ;
Keller, Christiane ;
Heinrich, Vanessa ;
Braun, Benedikt Johannes .
ARCHIVES OF GYNECOLOGY AND OBSTETRICS, 2024, 309 (04) :1543-1549
[7]   Congruence between patients' preferred and perceived participation in medical decision-making: a review of the literature [J].
Brom, Linda ;
Hopmans, Wendy ;
Pasman, H. Roeline W. ;
Timmermans, Danielle R. M. ;
Widdershoven, Guy A. M. ;
Onwuteaka-Philipsen, Bregje D. .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2014, 14
[8]   Data Set for the Reporting of Nodal Excisions and Neck Dissection Specimens for Head and Neck Tumors Explanations and Recommendations of the Guidelines From the International Collaboration on Cancer Reporting [J].
Bullock, Martin J. ;
Beitler, Jonathan, I ;
Carlson, Diane L. ;
Fonseca, Isabel ;
Hunt, Jennifer L. ;
Katabi, Nora ;
Sloan, Philip ;
Taylor, S. Mark ;
Williams, Michelle D. ;
Thompson, Lester D. R. .
ARCHIVES OF PATHOLOGY & LABORATORY MEDICINE, 2019, 143 (04) :452-462
[9]   Chat Generative Pre-trained Transformer: why we should embrace this technology [J].
Chavez, Martin R. ;
Butler, Thomas S. ;
Rekawek, Patricia ;
Heo, Hye ;
Kinzler, Wendy L. .
AMERICAN JOURNAL OF OBSTETRICS AND GYNECOLOGY, 2023, 228 (06) :706-711
[10]   Artificial intelligence chatbots as sources of patient education material for obstructive sleep apnoea: ChatGPT versus Google Bard [J].
Cheong, Ryan Chin Taw ;
Unadkat, Samit ;
Mcneillis, Venkata ;
Williamson, Andrew ;
Joseph, Jonathan ;
Randhawa, Premjit ;
Andrews, Peter ;
Paleri, Vinidh .
EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2024, 281 (02) :985-993