Human versus artificial intelligence-generated arthroplasty literature: A single-blinded analysis of perceived communication, quality, and authorship source

被引：5

作者：

Lawrence, Kyle W. ^{[1
,2
]}

Habibi, Akram A. ^{[1
]}

Ward, Spencer A. ^{[1
]}

Lajam, Claudette M. ^{[1
]}

Schwarzkopf, Ran ^{[1
]}

Rozell, Joshua C. ^{[1
]}

机构：

[1] NYU Langone Hlth, Dept Orthoped Surg, New York, NY USA

[2] NYU Langone Hlth, Dept Orthoped Surg, 301 East 17th St,15th Floor Suite 1518, New York, NY 10003 USA

来源：

INTERNATIONAL JOURNAL OF MEDICAL ROBOTICS AND COMPUTER ASSISTED SURGERY | 2024年 / 20卷 / 01期

关键词：

artificial intelligence; ChatGPT; large language models; medical literature; total hip arthroplasty; total knee arthroplasty;

D O I：

10.1002/rcs.2621

中图分类号：

R61 [外科手术学];

学科分类号：

摘要：

BackgroundLarge language models (LLM) have unknown implications for medical research. This study assessed whether LLM-generated abstracts are distinguishable from human-written abstracts and to compare their perceived quality.MethodsThe LLM ChatGPT was used to generate 20 arthroplasty abstracts (AI-generated) based on full-text manuscripts, which were compared to originally published abstracts (human-written). Six blinded orthopaedic surgeons rated abstracts on overall quality, communication, and confidence in the authorship source. Authorship-confidence scores were compared to a test value representing complete inability to discern authorship.ResultsModestly increased confidence in human authorship was observed for human-written abstracts compared with AI-generated abstracts (p = 0.028), though AI-generated abstract authorship-confidence scores were statistically consistent with inability to discern authorship (p = 0.999). Overall abstract quality was higher for human-written abstracts (p = 0.019).ConclusionsAI-generated abstracts' absolute authorship-confidence ratings demonstrated difficulty in discerning authorship but did not achieve the perceived quality of human-written abstracts. Caution is warranted in implementing LLMs into scientific writing.

引用

页数：9

共 24 条

[1] Exploring ChatGPT for information of cardiopulmonary resuscitation
Ahn, Chiwon
[J]. RESUSCITATION, 2023, 185
[2] [Anonymous], 2023, ChatGPT General FAQ
[3] Bi AS., What's Important: The Next Academic-ChatGPT AI? JBJS
[4] Brameier DT, 2023, J BONE JOINT SURG AM, V105, P1388, DOI 10.2106/JBJS.23.00473
[5] ChatGPT and other artificial intelligence applications speed up scientific writing
Chen, Tzeng-Ji
[J]. JOURNAL OF THE CHINESE MEDICAL ASSOCIATION, 2023, 86 (04) : 351 - 353
[6] ABSTRACTS WRITTEN BY CHATGPT FOOL SCIENTISTS
Else, Holly
[J]. NATURE, 2023, 613 (7944) : 423 - 423
[7] Gao Catherine A, 2022, BioRxiv, V2022, P2022, DOI [10.1101/2022.12.23.521610, DOI 10.1101/2022.12.23.521610, DOI 10.1038/S41746-023-00819-6]
[8] The Impact of Artificial Intelligence (AI) Programs on Writing Scientific Research
Hammad, Mohamed
[J]. ANNALS OF BIOMEDICAL ENGINEERING, 2023, 51 (03) : 459 - 460
[9] COULD AI HELP YOU TO WRITE YOUR NEXT PAPER?
Hutson, Matthew
[J]. NATURE, 2022, 611 (7934) : 192 - 193
[10] Jeblick K., 2022, arXiv

← 1 2 3 →