Arthrosis diagnosis and treatment recommendations in clinical practice: an exploratory investigation with the generative AI model GPT-4

被引：12

作者：

Pagano, Stefano ^{[1
]}

Holzapfel, Sabrina ^{[2
]}

Kappenschneider, Tobias ^{[1
]}

Meyer, Matthias ^{[1
]}

Maderbacher, Guenther ^{[1
]}

Grifka, Joachim ^{[1
]}

Holzapfel, Dominik Emanuel ^{[1
]}

机构：

[1] Univ Regensburg, Dept Orthopaed Surg, Asklepios Klinikum, Bad Abbach, Germany

[2] Univ Regensburg, Univ Childrens Hosp Regensburg, Hosp St Hedwig Order St John, Dept Neonatol, Regensburg, Germany

来源：

JOURNAL OF ORTHOPAEDICS AND TRAUMATOLOGY | 2023年 / 24卷 / 01期

关键词：

Artificial intelligence; ChatGPT-4; Large language model; Orthopaedics; Total joint replacement; Arthrosis; RELIABILITY;

D O I：

10.1186/s10195-023-00740-4

中图分类号：

R826.8 [整形外科学]; R782.2 [口腔颌面部整形外科学]; R726.2 [小儿整形外科学]; R62 [整形外科学（修复外科学）];

学科分类号：

摘要：

Background The spread of artificial intelligence (AI) has led to transformative advancements in diverse sectors, including healthcare. Specifically, generative writing systems have shown potential in various applications, but their effectiveness in clinical settings has been barely investigated. In this context, we evaluated the proficiency of ChatGPT-4 in diagnosing gonarthrosis and coxarthrosis and recommending appropriate treatments compared with orthopaedic specialists.Methods A retrospective review was conducted using anonymized medical records of 100 patients previously diagnosed with either knee or hip arthrosis. ChatGPT-4 was employed to analyse these historical records, formulating both a diagnosis and potential treatment suggestions. Subsequently, a comparative analysis was conducted to assess the concordance between the AI's conclusions and the original clinical decisions made by the physicians.Results In diagnostic evaluations, ChatGPT-4 consistently aligned with the conclusions previously drawn by physicians. In terms of treatment recommendations, there was an 83% agreement between the AI and orthopaedic specialists. The therapeutic concordance was verified by the calculation of a Cohen's Kappa coefficient of 0.580 (p < 0.001). This indicates a moderate-to-good level of agreement. In recommendations pertaining to surgical treatment, the AI demonstrated a sensitivity and specificity of 78% and 80%, respectively. Multivariable logistic regression demonstrated that the variables reduced quality of life (OR 49.97, p < 0.001) and start-up pain (OR 12.54, p = 0.028) have an influence on ChatGPT-4's recommendation for a surgery.Conclusion This study emphasises ChatGPT-4's notable potential in diagnosing conditions such as gonarthrosis and coxarthrosis and in aligning its treatment recommendations with those of orthopaedic specialists. However, it is crucial to acknowledge that AI tools such as ChatGPT-4 are not meant to replace the nuanced expertise and clinical judgment of seasoned orthopaedic surgeons, particularly in complex decision-making scenarios regarding treatment indications. Due to the exploratory nature of the study, further research with larger patient populations and more complex diagnoses is necessary to validate the findings and explore the broader potential of AI in healthcare.

引用

页数：11

共 24 条

[1] Achiam OJ, 2023, Arxiv, DOI arXiv:2303.08774
[2] Role of Chat GPT in Public Health
Biswas, Som S.
[J]. ANNALS OF BIOMEDICAL ENGINEERING, 2023, 51 (05) : 868 - 869
[3] WHO declares the end of the COVID-19 global health emergency: lessons and recommendations from the perspective of ChatGPT/GPT-4
Cheng, Kunming
Wu, Chunchun
Gu, Shuqin
Lu, Yanqiu
Wu, Haiyang
Li, Cheng
[J]. INTERNATIONAL JOURNAL OF SURGERY, 2023, 109 (09) : 2859 - 2862
[4] The Potential of GPT-4 as an AI-Powered Virtual Assistant for Surgeons Specialized in Joint Arthroplasty
Cheng, Kunming
Li, Zhiyong
Li, Cheng
Xie, Ruijie
Guo, Qiang
He, Yongbin
Wu, Haiyang
[J]. ANNALS OF BIOMEDICAL ENGINEERING, 2023, 51 (07) : 1366 - 1370
[5] The global burden of hip and knee osteoarthritis: estimates from the Global Burden of Disease 2010 study
Cross, Marita
Smith, Emma
Hoy, Damian
Nolte, Sandra
Ackerman, Ilana
Fransen, Marlene
Bridgett, Lisa
Williams, Sean
Guillemin, Francis
Hill, Catherine L.
Laslett, Laura L.
Jones, Graeme
Cicuttini, Flavia M.
Osborne, Richard
Vos, Theo
Buchbinder, Rachelle
Woolf, Anthony
March, Lyn
[J]. ANNALS OF THE RHEUMATIC DISEASES, 2014, 73 (07) : 1323 - 1330
[6] Davenport Thomas, 2019, Future Healthc J, V6, P94, DOI 10.7861/futurehosp.6-2-94
[7] Eloundou T, 2023, Arxiv, DOI arXiv:2303.10130
[8] Harskamp RE, 2023, medRxiv, DOI [10.1101/2023.03.25.23285475, 10.1101/2023.03.25.23285475, DOI 10.1101/2023.03.25.23285475, https://doi.org/10.1101/2023.03.25.23285475, DOI 10.1101/2023.03.25.23285475V1]
[9] ChatGPT Passes German State Examination in Medicine With Picture Questions Omitted
Jung, Leonard B.
Gudera, Jonas A.
Wiegand, Tim L. T.
Allmendinger, Simeon
Dimitriadis, Konstantinos
Koerte, Inga K.
[J]. DEUTSCHES ARZTEBLATT INTERNATIONAL, 2023, 120 (21-22): : 373 - 374
[10] Exploring the potential of ChatGPT as a supplementary tool for providing orthopaedic information
Kaarre, Janina
Feldt, Robert
Keeling, Laura E.
Dadoo, Sahil
Zsidai, Balint
Hughes, Jonathan D.
Samuelsson, Kristian
Musahl, Volker
[J]. KNEE SURGERY SPORTS TRAUMATOLOGY ARTHROSCOPY, 2023, 31 (11) : 5190 - 5198

← 1 2 3 →