Evaluating the accuracy and relevance of ChatGPT responses to frequently asked questions regarding total knee replacement

被引：8

作者：

Zhang, Siyuan ^{[1
]}

Liau, Zi Qiang Glen ^{[1
]}

Tan, Kian Loong Melvin ^{[1
]}

Chua, Wei Liang ^{[1
]}

机构：

[1] Natl Univ Hlth Syst, Dept Orthopaed Surg, Level 11,NUHS Tower Block,1E Kent Ridge Rd, Singapore 119228, Singapore

来源：

KNEE SURGERY & RELATED RESEARCH | 2024年 / 36卷 / 01期

关键词：

ChatGPT; Artificial intelligence; Chatbot; Large language model; Total knee replacement; Total knee arthroplasty; ARTHROPLASTY;

D O I：

10.1186/s43019-024-00218-5

中图分类号：

R826.8 [整形外科学]; R782.2 [口腔颌面部整形外科学]; R726.2 [小儿整形外科学]; R62 [整形外科学（修复外科学）];

学科分类号：

摘要：

Background Chat Generative Pretrained Transformer (ChatGPT), a generative artificial intelligence chatbot, may have broad applications in healthcare delivery and patient education due to its ability to provide human-like responses to a wide range of patient queries. However, there is limited evidence regarding its ability to provide reliable and useful information on orthopaedic procedures. This study seeks to evaluate the accuracy and relevance of responses provided by ChatGPT to frequently asked questions (FAQs) regarding total knee replacement (TKR).Methods A list of 50 clinically-relevant FAQs regarding TKR was collated. Each question was individually entered as a prompt to ChatGPT (version 3.5), and the first response generated was recorded. Responses were then reviewed by two independent orthopaedic surgeons and graded on a Likert scale for their factual accuracy and relevance. These responses were then classified into accurate versus inaccurate and relevant versus irrelevant responses using preset thresholds on the Likert scale.Results Most responses were accurate, while all responses were relevant. Of the 50 FAQs, 44/50 (88%) of ChatGPT responses were classified as accurate, achieving a mean Likert grade of 4.6/5 for factual accuracy. On the other hand, 50/50 (100%) of responses were classified as relevant, achieving a mean Likert grade of 4.9/5 for relevance.Conclusion ChatGPT performed well in providing accurate and relevant responses to FAQs regarding TKR, demonstrating great potential as a tool for patient education. However, it is not infallible and can occasionally provide inaccurate medical information. Patients and clinicians intending to utilize this technology should be mindful of its limitations and ensure adequate supervision and verification of information provided.

引用

页数：8

共 50 条

[1] Evaluating the accuracy and relevance of ChatGPT responses to frequently asked questions regarding total knee replacement
Siyuan Zhang
Zi Qiang Glen Liau
Kian Loong Melvin Tan
Wei Liang Chua
Knee Surgery & Related Research, 36
[2] Evaluating ChatGPT responses to frequently asked patient questions regarding periprosthetic joint infection after total hip and knee arthroplasty
Hu, Xiaojun
Niemann, Marcel
Kienzle, Arne
Braun, Karl
Back, David Alexander
Gwinner, Clemens
Renz, Nora
Stoeckle, Ulrich
Trampuz, Andrej
Meller, Sebastian
DIGITAL HEALTH, 2024, 10
[3] ChatGPT is capable of providing satisfactory responses to frequently asked questions regarding total shoulder arthroplasty
Yeramosu, Teja
Johns, William L.
Onor, Gabriel
Menendez, Mariano E.
Namdari, Surena
Hammoud, Sommer
SHOULDER & ELBOW, 2024, 16 (04) : 407 - 412
[4] Accuracy assessment of ChatGPT responses to frequently asked questions regarding anterior cruciate ligament surgery
Villarreal-Espinosa, Juan Bernardo
Berreta, Rodrigo Saad
Allende, Felicitas
Garcia, Jose Rafael
Ayala, Salvador
Familiari, Filippo
Chahla, Jorge
KNEE, 2024, 51 : 84 - 92
[5] Evaluation of information accuracy and clarity: ChatGPT responses to the most frequently asked questions about premature ejaculation
Sahin, Mehmet Fatih
Keles, Anil
Ozcan, Ridvan
Dogan, Cagri
Topkac, Erdem Can
Akgul, Murat
Yazici, Cenk Murat
SEXUAL MEDICINE, 2024, 12 (03)
[6] An assessment of ChatGPT's responses to frequently asked questions about cervical and breast cancer
Ye, Zichen
Zhang, Bo
Zhang, Kun
Mendez, Maria Jose Gonzalez
Yan, Huijiao
Wu, Tong
Qu, Yimin
Jiang, Yu
Xue, Peng
Qiao, Youlin
BMC WOMENS HEALTH, 2024, 24 (01)
[7] Language-adaptive artificial intelligence: assessing CHATGPT'S answer to frequently asked questions on total hip arthroplasty questions
Ibrahim, Muhammad Talal
Khaskheli, Sarah Ashraf
Shahzad, Hania
Noordin, Shahryar
JOURNAL OF THE PAKISTAN MEDICAL ASSOCIATION, 2024, 74 (04) : S161 - S164
[8] Evaluating ChatGPT's Ability to Address Frequently Asked Questions in Gender-Affirmation Surgery
Rothchild, Evan
Jung, Geena
Ricci, Joseph A.
JOURNAL OF HOMOSEXUALITY, 2025,
[9] Assessing ChatGPT Ability to Answer Frequently Asked Questions About Essential Tremor
Sorrentino, Cristiano
Canoro, Vincenzo
Russo, Maria
Giordano, Caterina
Barone, Paolo
Erro, Roberto
TREMOR AND OTHER HYPERKINETIC MOVEMENTS, 2024, 14 : 1 - 10
[10] Can ChatGPT Answer Patient Questions Regarding Total Knee Arthroplasty?
Mika, Aleksander P.
Mulvey, Hillary E.
Engstrom, Stephen M.
Polkowski, Gregory G.
Martin, J. Ryan
Wilson, Jacob M.
JOURNAL OF KNEE SURGERY, 2024, 37 (09) : 664 - 673

← 1 2 3 4 5 →