Use of ChatGPT for Determining Clinical and Surgical Treatment of Lumbar Disc Herniation With Radiculopathy: A North American Spine Society Guideline Comparison

被引：1

作者：

Mejia, Mateo Restrepo ^{[1
]}

Arroyave, Juan Sebastian ^{[1
]}

Saturno, Michael ^{[1
]}

Ndjonko, Laura Chelsea Mazudie ^{[1
]}

Zaidat, Bashar ^{[1
]}

Rajjoub, Rami ^{[1
]}

Ahmed, Wasil ^{[1
]}

Zapolsky, Ivan ^{[1
]}

Cho, Samuel K. ^{[1
,2
]}

机构：

[1] Icahn Sch Med Mt Sinai, Dept Orthoped Surg, New York, NY USA

[2] Icahn Sch Med Mt Sinai, Dept Orthoped Surg, 425 West 59th St, New York, NY 10019 USA

来源：

NEUROSPINE | 2024年 / 21卷 / 01期

关键词：

Artificial intelligence; ChatGPT; Lumbar disk herniation with radiculopathy; North American Spine Society guidelines; Qualitative study;

D O I：

10.14245/ns.2448248.124

中图分类号：

R74 [神经病学与精神病学];

学科分类号：

摘要：

Objective: Large language models like chat generative pre-trained transformer (ChatGPT) have found success in various sectors, but their application in the medical field remains limited. This study aimed to assess the feasibility of using ChatGPT to provide accurate medical information to patients, specifically evaluating how well ChatGPT versions 3.5 and 4 aligned with the 2012 North American Spine Society (NASS) guidelines for lumbar disk herniation with radiculopathy. Methods: ChatGPT's responses to questions based on the NASS guidelines were analyzed for accuracy. Three new categories-overconclusiveness, supplementary information, and incompleteness-were introduced to deepen the analysis. Overconclusiveness referred to recommendations not mentioned in the NASS guidelines, supplementary information denoted additional relevant details, and incompleteness indicated omitted crucial information from the NASS guidelines. Results: Out of 29 clinical guidelines evaluated, ChatGPT-3. 5 demonstrated accuracy in 15 responses (52%), while ChatGPT-4 achieved accuracy in 17 responses (59%). ChatGPT-3. 5 was overconclusive in 14 responses (48%), while ChatGPT-4 exhibited overconclusiveness in 13 responses (45%). Additionally, ChatGPT-3. 5 provided supplementary information in 24 responses (83%), and ChatGPT-4 provided supplemental information in 27 responses (93%). In terms of incompleteness, ChatGPT-3. 5 displayed this in 11 responses (38%), while ChatGPT-4 showed incompleteness in 8 responses (23%). Conclusion: ChatGPT shows promise for clinical decision-making, but both patients and healthcare providers should exercise caution to ensure safety and quality of care. While these results are encouraging, further research is necessary to validate the use of large language models in clinical settings.

引用

页码：149 / 158

页数：10

共 23 条

[1] Artificial Hallucinations in ChatGPT: Implications in Scientific Writing [J].

Alkaissi, Hussam ;

McFarlane, Samy I. .

CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (02)

[2]

[Anonymous], 2024, NEW MODELS DEVELOPER

[3] AI chatbots not yet ready for clinical use [J].

Au Yeung, Joshua ;

Kraljevic, Zeljko ;

Luintel, Akish ;

Balston, Alfred ;

Idowu, Esther ;

Dobson, Richard J. J. ;

Teo, James T. T. .

FRONTIERS IN DIGITAL HEALTH, 2023, 5

[4] Thromboembolic prophylaxis in spine surgery: an analysis of ChatGPT recommendations [J].

Duey, Akiro H. ;

Nietsch, Katrina S. ;

Zaidat, Bashar ;

Ren, Renee ;

Ndjonko, Laura C. Mazudie ;

Shrestha, Nancy ;

Rajjoub, Rami ;

Ahmed, Wasil ;

Hoang, Timothy ;

Saturno, Michael P. ;

Tang, Justin E. ;

Gallate, Zachary S. ;

Kim, Jun S. ;

Cho, Samuel K. .

SPINE JOURNAL, 2023, 23 (11) :1684-1691

[5]

Edmonston Daniel L, 2010, J Surg Orthop Adv, V19, P174

[6]

Elkatatny Amr Abdelmonam Abdelaziz Mostafa, 2019, Open Access Maced J Med Sci, V7, P2851, DOI 10.3889/oamjms.2019.679

[7] Accuracy of ChatGPT generated diagnosis from patient's medical history and imaging findings in neuroradiology cases [J].

Horiuchi, Daisuke ;

Tatekawa, Hiroyuki ;

Shimono, Taro ;

Walston, Shannon L. ;

Takita, Hirotaka ;

Matsushita, Shu ;

Oura, Tatsushi ;

Mitsuyama, Yasuhito ;

Miki, Yukio ;

Ueda, Daiju .

NEURORADIOLOGY, 2024, 66 (01) :73-79

[8]

Institute of Medicine, 2001, Crossing the quality chasm: a new health system for the 21st century, DOI DOI 10.17226/10027

[9] An evidence-based clinical guideline for the diagnosis and treatment of lumbar disc herniation with radiculopathy [J].

Kreiner, D. Scott ;

Hwang, Steven W. ;

Easa, John E. ;

Resnick, Daniel K. ;

Baisden, Jamie L. ;

Bess, Shay ;

Cho, Charles H. ;

DePalma, Michael J. ;

Dougherty, Paul, II ;

Fernand, Robert ;

Ghiselli, Gary ;

Hanna, Amgad S. ;

Lamer, Tim ;

Lisi, Anthony J. ;

Mazanec, Daniel J. ;

Meagher, Richard J. ;

Nucci, Robert C. ;

Patel, Rakesh D. ;

Sembrano, Jonathan N. ;

Sharma, Anil K. ;

Summers, Jeffrey T. ;

Taleghani, Christopher K. ;

Tontz, William L., Jr. ;

Toton, John F. .

SPINE JOURNAL, 2014, 14 (01) :180-191

[10]

Kung T. H., 2023, PLoS digital health, V2, DOI DOI 10.1371/JOURNALPDIG.0000198

← 1 2 3 →