Performance of ChatGPT on the Plastic Surgery Inservice Training Examination

被引：61

作者：

Gupta, Rohun ^{[1
,3
]}

Herzog, Isabel ^{[2
]}

Park, John B. ^{[2
]}

Weisberger, Joseph

Firouzbakht, Peter ^{[1
]}

Ocon, Vanessa

Chao, John ^{[2
]}

Lee, Edward S.

Mailey, Brian A.

机构：

[1] St Louis Univ, Dept Surg, Div Plast Surg, Sch Med, St Louis, MO USA

[2] Rutgers New Jersey Sch Med, Dept Plast Surg, Newark, NJ USA

[3] SLUCare Acad Pavil, 1008 S Spring Ave,Suite,1500 St Louis, St Louis, MO 63110 USA

来源：

AESTHETIC SURGERY JOURNAL | 2023年

关键词：

D O I：

10.1093/asj/sjad128

中图分类号：

R61 [外科手术学];

学科分类号：

摘要：

Background Developed originally as a tool for resident self-evaluation, the Plastic Surgery Inservice Training Examination (PSITE) has become a standardized tool adopted by Plastic Surgery residency programs. The introduction of large language models (LLMs), such as ChatGPT (OpenAI, San Francisco, CA), has demonstrated the potential to help propel the field of Plastic Surgery. Objectives The authors of this study wanted to assess whether or not ChatGPT could be utilized as a tool in resident education by assessing its accuracy on the PSITE. Methods Questions were obtained from the 2022 PSITE, which was present on the American Council of Academic Plastic Surgeons (ACAPS) website. Questions containing images or tables were carefully inspected and flagged before being inputted into ChatGPT. All responses by ChatGPT were qualified utilizing the properties of natural coherence. Responses that were found to be incorrect were divided into the following categories: logical, informational, or explicit fallacy. Results ChatGPT answered a total of 242 questions with an accuracy of 54.96%. The software incorporated logical reasoning in 88.8% of questions, internal information in 95.5% of questions, and external information in 92.1% of questions. When stratified by correct and incorrect responses, we determined that there was a statistically significant difference in ChatGPT's use of external information (P < .05). Conclusions ChatGPT is a versatile tool that has the potential to impact resident education by providing general knowledge, clarifying information, providing case-based learning, and promoting evidence-based medicine. With advancements in LLM and artificial intelligence (AI), it is possible that ChatGPT may be an impactful tool for resident education within Plastic Surgery.

引用

页码：NP1078 / NP1082

页数：5

共 16 条

[1]

American Council of Academic Plastic Surgeons, US

[2]

American Society of Plastic Surgeons, INS EX RES

[3]

[Anonymous], ACGME Program Requirements for Graduate Medical Education in Internal Medicine

[4]

Brown TB, 2020, ADV NEUR IN, V33