Evaluating the Utility of a Large Language Model in Answering Common Patients' Gastrointestinal Health-Related Questions: Are We There Yet?

被引:58
作者
Lahat, Adi [1 ]
Shachar, Eyal [1 ]
Avidan, Benjamin [1 ]
Glicksberg, Benjamin [2 ]
Klang, Eyal [3 ]
机构
[1] Tel Aviv Univ, Chaim Sheba Med Ctr, Dept Gastroenterol, IL-69978 Tel Aviv, Israel
[2] Icahn Sch Med Mt Sinai, Mt Sinai Clin Intelligence Ctr, New York, NY 10029 USA
[3] Tel Aviv Univ, ARC Innovat Ctr, Chaim Sheba Med Ctr, Sami Sagol AI Hub, IL-69978 Tel Aviv, Israel
关键词
OpenAI's ChatGPT; chatbot; natural language processing (NLP); medical information; gastroenterology; patients' questions;
D O I
10.3390/diagnostics13111950
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background and aims: Patients frequently have concerns about their disease and find it challenging to obtain accurate Information. OpenAI's ChatGPT chatbot (ChatGPT) is a new large language model developed to provide answers to a wide range of questions in various fields. Our aim is to evaluate the performance of ChatGPT in answering patients' questions regarding gastrointestinal health. Methods: To evaluate the performance of ChatGPT in answering patients' questions, we used a representative sample of 110 real-life questions. The answers provided by ChatGPT were rated in consensus by three experienced gastroenterologists. The accuracy, clarity, and efficacy of the answers provided by ChatGPT were assessed. Results: ChatGPT was able to provide accurate and clear answers to patients' questions in some cases, but not in others. For questions about treatments, the average accuracy, clarity, and efficacy scores (1 to 5) were 3.9 +/- 0.8, 3.9 +/- 0.9, and 3.3 +/- 0.9, respectively. For symptoms questions, the average accuracy, clarity, and efficacy scores were 3.4 +/- 0.8, 3.7 +/- 0.7, and 3.2 +/- 0.7, respectively. For diagnostic test questions, the average accuracy, clarity, and efficacy scores were 3.7 +/- 1.7, 3.7 +/- 1.8, and 3.5 +/- 1.7, respectively. Conclusions: While ChatGPT has potential as a source of information, further development is needed. The quality of information is contingent upon the quality of the online information provided. These findings may be useful for healthcare providers and patients alike in understanding the capabilities and limitations of ChatGPT.
引用
收藏
页数:10
相关论文
共 22 条
  • [1] Bloomberg, US
  • [2] Eysenbach Gunther, 2023, JMIR Med Educ, V9, pe46885, DOI 10.2196/46885
  • [3] Artificial intelligence-based text generators in hepatology: ChatGPT is just the beginning
    Ge, Jin
    Lai, Jennifer C.
    [J]. HEPATOLOGY COMMUNICATIONS, 2023, 7 (04)
  • [4] Hirosawa Takanobu, 2023, Int J Environ Res Public Health, V20, DOI 10.3390/ijerph20043378
  • [5] Holtedahl K, 2017, HELIYON, V3, DOI 10.1016/j.heliyon.2017.e00328
  • [6] Johnson Douglas, 2023, Res Sq, DOI 10.21203/rs.3.rs-2566942/v1
  • [7] Using ChatGPT to evaluate cancer myths and misconceptions: artificial intelligence and cancer information
    Johnson, Skyler B.
    King, Andy J.
    Warner, Echo L.
    Aneja, Sanjay
    Kann, Benjamin H.
    Bylund, Carma L.
    [J]. JNCI CANCER SPECTRUM, 2023, 7 (02)
  • [8] Evaluating the use of large language model in identifying top research questions in gastroenterology
    Lahat, Adi
    Shachar, Eyal
    Avidan, Benjamin
    Shatz, Zina
    Glicksberg, Benjamin S.
    Klang, Eyal
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [9] Can advanced technologies help address the global increase in demand for specialized medical care and improve telehealth services?
    Lahat, Adi
    Klang, Eyal
    [J]. JOURNAL OF TELEMEDICINE AND TELECARE, 2024, 30 (09) : 1516 - 1517
  • [10] Medical Specialty Recommendations by an Artificial Intelligence Chatbot on a Smartphone: Development and Deployment
    Lee, Hyeonhoon
    Kang, Jaehyun
    Yeo, Jonghyeon
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (05)