Reliability and Usefulness of ChatGPT for Inflammatory Bowel Diseases: An Analysis for Patients and Healthcare Professionals

被引：23

作者：

Cankurtaran, Rasim Eren ^{[1
]}

Polat, Yunus Halil ^{[2
]}

Aydemir, Neslihan Gunes ^{[3
]}

Umay, Ebru ^{[4
]}

Yurekli, Oyku Tayfur ^{[5
]}

机构：

[1] Ankara Etlik City Hosp, Dept Perinatol, Ankara, Turkiye

[2] Ankara Numune Training & Res Hosp, Dept Gastroenterol, Ankara, Turkiye

[3] Akdeniz Univ, Fac Med, Dept Gastroenterol, Antalya, Turkiye

[4] Univ Hlth Sci, Ankara Etlik City Hosp, Phys Med & Rehabil, Ankara, Turkiye

[5] Ankara Yildirim Beyazit Univ, Fac Med, Dept Gastroenterol, Ankara, Turkiye

来源：

CUREUS JOURNAL OF MEDICAL SCIENCE | 2023年 / 15卷 / 10期

关键词：

ulcerative colitis (uc); crohn's disease (cd); healthcare research; artificial intelligence (ai); inflammatory; bowel diseases (ibd); large language model; chatgpt;

D O I：

10.7759/cureus.46736

中图分类号：

R5 [内科学];

学科分类号：

1002 ; 100201 ;

摘要：

Aim: We aimed to evaluate the performance of Chat Generative Pre-trained Transformer (ChatGPT) within the context of inflammatory bowel disease (IBD), which is expected to become an increasingly significant health issue in the future. In addition, the objective of the study was to assess whether ChatGPT serves as a reliable and useful resource for both patients and healthcare professionals.Methods: For this study, 20 specific questions were identified for the two main components of IBD, which are Crohn's disease (CD) and ulcerative colitis (UC). The questions were divided into two sets: one set contained questions directed at healthcare professionals while the second set contained questions directed toward patients. The responses were evaluated with seven-point Likert-type reliability and usefulness scales.Results: The distribution of the reliability and utility scores was calculated into four groups (two diseases and two question sources) by averaging the mean scores from both raters. The highest scores in both reliability and usefulness were obtained from professional sources (5.00 +/- 1.21 and 5.15 +/- 1.08, respectively). The ranking in terms of reliability and usefulness, respectively, was as follows: CD questions (4.70 +/- 1.26 and 4.75 +/- 1.06) and UC questions (4.40 +/- 1.21 and 4.55 +/- 1.31). The reliability scores of the answers for the professionals were significantly higher than those for the patients (both raters, p=0.032).Conclusion: Despite its capacity for reliability and usefulness in the context of IBD, ChatGPT still has some limitations and deficiencies. The correction of ChatGPT's deficiencies and its enhancement by developers with more detailed and up-to-date information could make it a significant source of information for both patients and medical professionals.

引用

页数：21

共 13 条

[11] Physicians' Perceptions of Chatbots in Health Care: Cross-Sectional Web-Based Survey [J].

Palanica, Adam ;

Flaschner, Peter ;

Thommandram, Anirudh ;

Li, Michael ;

Fossat, Yan .

JOURNAL OF MEDICAL INTERNET RESEARCH, 2019, 21 (04)

[12] ChatGPT and large language models in gastroenterology [J].

Sharma, Prateek ;

Parasa, Sravanthi .

NATURE REVIEWS GASTROENTEROLOGY & HEPATOLOGY, 2023, 20 (08) :481-482

[13] "Dr ChatGPT": Is it a reliable and useful source for common rheumatic diseases? [J].

Uz, Cuma ;

Umay, Ebru .

INTERNATIONAL JOURNAL OF RHEUMATIC DISEASES, 2023, 26 (07) :1343-1349

← 1 2 →