Arthrosis diagnosis and treatment recommendations in clinical practice: an exploratory investigation with the generative AI model GPT-4

被引:12
作者
Pagano, Stefano [1 ]
Holzapfel, Sabrina [2 ]
Kappenschneider, Tobias [1 ]
Meyer, Matthias [1 ]
Maderbacher, Guenther [1 ]
Grifka, Joachim [1 ]
Holzapfel, Dominik Emanuel [1 ]
机构
[1] Univ Regensburg, Dept Orthopaed Surg, Asklepios Klinikum, Bad Abbach, Germany
[2] Univ Regensburg, Univ Childrens Hosp Regensburg, Hosp St Hedwig Order St John, Dept Neonatol, Regensburg, Germany
关键词
Artificial intelligence; ChatGPT-4; Large language model; Orthopaedics; Total joint replacement; Arthrosis; RELIABILITY;
D O I
10.1186/s10195-023-00740-4
中图分类号
R826.8 [整形外科学]; R782.2 [口腔颌面部整形外科学]; R726.2 [小儿整形外科学]; R62 [整形外科学(修复外科学)];
学科分类号
摘要
Background The spread of artificial intelligence (AI) has led to transformative advancements in diverse sectors, including healthcare. Specifically, generative writing systems have shown potential in various applications, but their effectiveness in clinical settings has been barely investigated. In this context, we evaluated the proficiency of ChatGPT-4 in diagnosing gonarthrosis and coxarthrosis and recommending appropriate treatments compared with orthopaedic specialists.Methods A retrospective review was conducted using anonymized medical records of 100 patients previously diagnosed with either knee or hip arthrosis. ChatGPT-4 was employed to analyse these historical records, formulating both a diagnosis and potential treatment suggestions. Subsequently, a comparative analysis was conducted to assess the concordance between the AI's conclusions and the original clinical decisions made by the physicians.Results In diagnostic evaluations, ChatGPT-4 consistently aligned with the conclusions previously drawn by physicians. In terms of treatment recommendations, there was an 83% agreement between the AI and orthopaedic specialists. The therapeutic concordance was verified by the calculation of a Cohen's Kappa coefficient of 0.580 (p < 0.001). This indicates a moderate-to-good level of agreement. In recommendations pertaining to surgical treatment, the AI demonstrated a sensitivity and specificity of 78% and 80%, respectively. Multivariable logistic regression demonstrated that the variables reduced quality of life (OR 49.97, p < 0.001) and start-up pain (OR 12.54, p = 0.028) have an influence on ChatGPT-4's recommendation for a surgery.Conclusion This study emphasises ChatGPT-4's notable potential in diagnosing conditions such as gonarthrosis and coxarthrosis and in aligning its treatment recommendations with those of orthopaedic specialists. However, it is crucial to acknowledge that AI tools such as ChatGPT-4 are not meant to replace the nuanced expertise and clinical judgment of seasoned orthopaedic surgeons, particularly in complex decision-making scenarios regarding treatment indications. Due to the exploratory nature of the study, further research with larger patient populations and more complex diagnoses is necessary to validate the findings and explore the broader potential of AI in healthcare.
引用
收藏
页数:11
相关论文
共 24 条
  • [1] Achiam OJ, 2023, Arxiv, DOI arXiv:2303.08774
  • [2] Role of Chat GPT in Public Health
    Biswas, Som S.
    [J]. ANNALS OF BIOMEDICAL ENGINEERING, 2023, 51 (05) : 868 - 869
  • [3] WHO declares the end of the COVID-19 global health emergency: lessons and recommendations from the perspective of ChatGPT/GPT-4
    Cheng, Kunming
    Wu, Chunchun
    Gu, Shuqin
    Lu, Yanqiu
    Wu, Haiyang
    Li, Cheng
    [J]. INTERNATIONAL JOURNAL OF SURGERY, 2023, 109 (09) : 2859 - 2862
  • [4] The Potential of GPT-4 as an AI-Powered Virtual Assistant for Surgeons Specialized in Joint Arthroplasty
    Cheng, Kunming
    Li, Zhiyong
    Li, Cheng
    Xie, Ruijie
    Guo, Qiang
    He, Yongbin
    Wu, Haiyang
    [J]. ANNALS OF BIOMEDICAL ENGINEERING, 2023, 51 (07) : 1366 - 1370
  • [5] The global burden of hip and knee osteoarthritis: estimates from the Global Burden of Disease 2010 study
    Cross, Marita
    Smith, Emma
    Hoy, Damian
    Nolte, Sandra
    Ackerman, Ilana
    Fransen, Marlene
    Bridgett, Lisa
    Williams, Sean
    Guillemin, Francis
    Hill, Catherine L.
    Laslett, Laura L.
    Jones, Graeme
    Cicuttini, Flavia M.
    Osborne, Richard
    Vos, Theo
    Buchbinder, Rachelle
    Woolf, Anthony
    March, Lyn
    [J]. ANNALS OF THE RHEUMATIC DISEASES, 2014, 73 (07) : 1323 - 1330
  • [6] Davenport Thomas, 2019, Future Healthc J, V6, P94, DOI 10.7861/futurehosp.6-2-94
  • [7] Eloundou T, 2023, Arxiv, DOI arXiv:2303.10130
  • [8] Harskamp RE, 2023, medRxiv, DOI [10.1101/2023.03.25.23285475, 10.1101/2023.03.25.23285475, DOI 10.1101/2023.03.25.23285475, https://doi.org/10.1101/2023.03.25.23285475, DOI 10.1101/2023.03.25.23285475V1]
  • [9] ChatGPT Passes German State Examination in Medicine With Picture Questions Omitted
    Jung, Leonard B.
    Gudera, Jonas A.
    Wiegand, Tim L. T.
    Allmendinger, Simeon
    Dimitriadis, Konstantinos
    Koerte, Inga K.
    [J]. DEUTSCHES ARZTEBLATT INTERNATIONAL, 2023, 120 (21-22): : 373 - 374
  • [10] Exploring the potential of ChatGPT as a supplementary tool for providing orthopaedic information
    Kaarre, Janina
    Feldt, Robert
    Keeling, Laura E.
    Dadoo, Sahil
    Zsidai, Balint
    Hughes, Jonathan D.
    Samuelsson, Kristian
    Musahl, Volker
    [J]. KNEE SURGERY SPORTS TRAUMATOLOGY ARTHROSCOPY, 2023, 31 (11) : 5190 - 5198