Assessing ChatGPT Ability to Answer Frequently Asked Questions About Essential Tremor

被引:0
作者
Sorrentino, Cristiano [1 ]
Canoro, Vincenzo [1 ,2 ]
Russo, Maria [1 ]
Giordano, Caterina [1 ]
Barone, Paolo [1 ]
Erro, Roberto [1 ]
机构
[1] Univ Salerno, Surg & Dent Scuola Med Salernitana, Dept Med, Neurosci Sect, Via Allende 43, I-84081 Baronissi, SA, Italy
[2] Umberto I Hosp, Dept Neurol, Nocera Inferiore, SA, Italy
来源
TREMOR AND OTHER HYPERKINETIC MOVEMENTS | 2024年 / 14卷
关键词
Essential tremor; Movement disorders; Large language Model; Artificial intelligence; ChatGPT; READABILITY; DISORDER;
D O I
10.5334/tohm.917
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
Background: Large-language models (LLMs) driven by artificial intelligence allow people to engage in direct conversations about their health. The accuracy and readability of the answers provided by ChatGPT, the most famous LLM, about Essential Tremor (ET), one of the commonest movement disorders, have not yet been evaluated. Methods: Answers given by ChatGPT to 10 questions about ET were evaluated by 5 professionals and 15 laypeople with a score ranging from 1 (poor) to 5 (excellent) in terms of clarity, relevance, accuracy (only for professionals), comprehensiveness, and overall value of the response. We further calculated the readability of the answers. Results: ChatGPT answers received relatively positive evaluations, with median scores ranging between 4 and 5, by both groups and independently from the type of question. However, there was only moderate agreement between raters, especially in the group of professionals. Moreover, readability levels were poor for all examined answers. Discussion: ChatGPT provided relatively accurate and relevant answers, with some variability as judged by the group of professionals suggesting that the degree of literacy about ET has influenced the ratings and, indirectly, that the quality of information provided in clinical practice is also variable. Moreover, the readability of the answer provided by ChatGPT was found to be poor. LLMs will likely play a significant role in the future; therefore, health-related content generated by these tools should be monitored.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 43 条
  • [1] Essential Tremor as a "Waste Basket" Diagnosis: Diagnosing Essential Tremor Remains a Challenge
    Amlang, Christian J.
    Diaz, Daniel Trujillo
    Louis, Elan D.
    [J]. FRONTIERS IN NEUROLOGY, 2020, 11
  • [2] [Anonymous], 2013, U.S.
  • [3] Comparing Physician and Artificial Intelligence Chatbot Responses to Patient Questions Posted to a Public Social Media Forum
    Ayers, John W.
    Poliak, Adam
    Dredze, Mark
    Leas, Eric C.
    Zhu, Zechariah
    Kelley, Jessica B.
    Faix, Dennis J.
    Goodman, Aaron M.
    Longhurst, Christopher A.
    Hogarth, Michael
    Smith, Davey M.
    [J]. JAMA INTERNAL MEDICINE, 2023, 183 (06) : 589 - 596
  • [4] Consensus Statement on the Classification of Tremors. From the Task Force on Tremor of the International Parkinson and Movement Disorder Society
    Bhatia, Kailash P.
    Bain, Peter
    Bajaj, Nin
    Elble, Rodger J.
    Hallett, Mark
    Louis, Elan D.
    Raethjen, Jan
    Stamelou, Maria
    Testa, Claudia M.
    Deuschl, Guenther
    [J]. MOVEMENT DISORDERS, 2018, 33 (01) : 75 - 87
  • [5] Why do people google movement disorders? An infodemiological study of information seeking behaviors
    Brigo, Francesco
    Erro, Roberto
    [J]. NEUROLOGICAL SCIENCES, 2016, 37 (05) : 781 - 787
  • [6] The readability of the English Wikipedia article on Parkinson's disease
    Brigo, Francesco
    Erro, Roberto
    [J]. NEUROLOGICAL SCIENCES, 2015, 36 (06) : 1045 - 1046
  • [7] Caylor J.S., 1973, Methodologies for determining reading requirements of military occupational specialities
  • [8] The future landscape of large language models in medicine
    Clusmann, Jan
    Kolbinger, Fiona R.
    Muti, Hannah Sophie
    Carrero, Zunamys I.
    Eckardt, Jan-Niklas
    Laleh, Narmin Ghaffari
    Loeffler, Chiara Maria Lavinia
    Schwarzkopf, Sophie-Caroline
    Unger, Michaela
    Veldhuizen, Gregory P.
    Wagner, Sophia J.
    Kather, Jakob Nikolas
    [J]. COMMUNICATIONS MEDICINE, 2023, 3 (01):
  • [9] COMPUTER READABILITY FORMULA DESIGNED FOR MACHINE SCORING
    COLEMAN, M
    LIAU, TL
    [J]. JOURNAL OF APPLIED PSYCHOLOGY, 1975, 60 (02) : 283 - 284
  • [10] Can Patients Trust Online Health Information? A Meta-narrative Systematic Review Addressing the Quality of Health Information on the Internet
    Daraz, Lubna
    Morrow, Allison S.
    Ponce, Oscar J.
    Beuschel, Bradley
    Farah, Magdoleen H.
    Katabi, Abdulrahman
    Alsawas, Mouaz
    Majzoub, Abdul M.
    Benkhadra, Raed
    Seisa, Mohamed O.
    Ding, Jingyi
    Prokop, Larry
    Murad, M. Hassan
    [J]. JOURNAL OF GENERAL INTERNAL MEDICINE, 2019, 34 (09) : 1884 - 1891