Assessing the Accuracy of Responses by the Language Model ChatGPT to Questions Regarding Bariatric Surgery

被引:139
作者
Samaan, Jamil S. [1 ]
Yeo, Yee Hui [1 ]
Rajeev, Nithya [2 ]
Hawley, Lauren [2 ]
Abel, Stuart [2 ]
Ng, Wee Han [3 ]
Srinivasan, Nitin [2 ]
Park, Justin [2 ]
Burch, Miguel [4 ]
Watson, Rabindra [1 ]
Liran, Omer [5 ,6 ]
Samakar, Kamran [2 ]
机构
[1] Cedars Sinai Med Ctr, Karsh Div Gastroenterol & Hepatol, 8700 Beverly Blvd, Los Angeles, CA 90048 USA
[2] Keck Sch Med USC, Hlth Care Consultat Ctr, Dept Surg, Div Upper GI & Gen Surg, 1510 San Pablo St 514, Los Angeles, CA 90033 USA
[3] Univ Bristol, Bristol Med Sch, 5 Tyndall Ave, Bristol BS8 1UD, England
[4] Cedars Sinai Med Ctr, Dept Surg, 8700 Beverly Blvd, Los Angeles, CA 90048 USA
[5] Cedars Sinai Med Ctr, Dept Psychiat & Behav Sci, 8700 Beverly Blvd, Los Angeles, CA 90048 USA
[6] Cedars Sinai Med Ctr, Dept Med, Div Hlth Serv Res, 8700 Beverly Blvd, Los Angeles, CA 90048 USA
关键词
Artificial intelligence; ChatGPT; Language learning models; Bariatric surgery; Weight loss; Health literacy; HEALTH LITERACY; INFORMATION;
D O I
10.1007/s11695-023-06603-5
中图分类号
R61 [外科手术学];
学科分类号
摘要
Purpose ChatGPT is a large language model trained on a large dataset covering a broad range of topics, including the medical literature. We aim to examine its accuracy and reproducibility in answering patient questions regarding bariatric surgery.Materials and methods Questions were gathered from nationally regarded professional societies and health institutions as well as Facebook support groups. Board-certified bariatric surgeons graded the accuracy and reproducibility of responses. The grading scale included the following: (1) comprehensive, (2) correct but inadequate, (3) some correct and some incorrect, and (4) completely incorrect. Reproducibility was determined by asking the model each question twice and examining difference in grading category between the two responses.Results In total, 151 questions related to bariatric surgery were included. The model provided "comprehensive" responses to 131/151 (86.8%) of questions. When examined by category, the model provided "comprehensive" responses to 93.8% of questions related to "efficacy, eligibility and procedure options"; 93.3% related to "preoperative preparation"; 85.3% related to "recovery, risks, and complications"; 88.2% related to "lifestyle changes"; and 66.7% related to "other". The model provided reproducible answers to 137 (90.7%) of questions.Conclusion The large language model ChatGPT often provided accurate and reproducible responses to common questions related to bariatric surgery. ChatGPT may serve as a helpful adjunct information resource for patients regarding bariatric surgery in addition to standard of care provided by licensed healthcare professionals. We encourage future studies to examine how to leverage this disruptive technology to improve patient outcomes and quality of life.
引用
收藏
页码:1790 / 1796
页数:7
相关论文
共 27 条
  • [11] Health Literacy, Health Numeracy, and Cognitive Functioning Among Bariatric Surgery Candidates
    Hecht, Leah
    Cain, Samantha
    Clark-Sienkiewicz, Shannon M.
    Martens, Kellie
    Hamann, Aaron
    Carlin, Arthur M.
    Miller-Matero, Lisa R.
    [J]. OBESITY SURGERY, 2019, 29 (12) : 4138 - 4141
  • [12] Adherence to Medical Appointments Among Patients Undergoing Bariatric Surgery: Do Health Literacy, Health Numeracy, and Cognitive Functioning Play a Role?
    Hecht, Leah M.
    Martens, Kellie M.
    Pester, Bethany D.
    Hamann, Aaron
    Carlin, Arthur M.
    Miller-Matero, Lisa R.
    [J]. OBESITY SURGERY, 2022, 32 (04) : 1391 - 1393
  • [13] Content and accuracy of nutrition-related posts in bariatric surgery Facebook support groups
    Koball, Afton M.
    Jester, Dylan J.
    Pruitt, Marisa A.
    Cripe, Rebecca V.
    Henschied, Jill J.
    Domoff, Sarah
    [J]. SURGERY FOR OBESITY AND RELATED DISEASES, 2018, 14 (12) : 1897 - 1902
  • [14] Examination of bariatric surgery Facebook support groups: a content analysis
    Koball, Afton M.
    Jester, Dylan J.
    Domoff, Sarah E.
    Kallies, Kara J.
    Grothe, Karen B.
    Kothari, Shanu N.
    [J]. SURGERY FOR OBESITY AND RELATED DISEASES, 2017, 13 (08) : 1369 - 1375
  • [15] Does Lower Level of Education and Health Literacy Affect Successful Outcomes in Bariatric Surgery?
    Mahoney, Stephen T.
    Strassle, Paula D.
    Farrell, Timothy M.
    Duke, Meredith C.
    [J]. JOURNAL OF LAPAROENDOSCOPIC & ADVANCED SURGICAL TECHNIQUES, 2019, 29 (08): : 1011 - 1015
  • [16] Readability of online patient-based information on bariatric surgery
    Meleo-Erwin, Zoe
    Basch, Corey
    Fera, Joseph
    Ethan, Danna
    Garcia, Philip
    [J]. HEALTH PROMOTION PERSPECTIVES, 2019, 9 (02): : 156 - 160
  • [17] The Influence of Health Literacy and Health Numeracy on Weight Loss Outcomes Following Bariatric Surgery
    Miller-Matero, Lisa R.
    Hecht, Leah
    Patel, Shivali
    Martens, Kellie M.
    Hamann, Aaron
    Carlin, Arthur M.
    [J]. SURGERY FOR OBESITY AND RELATED DISEASES, 2021, 17 (02) : 384 - 389
  • [19] openai, 2023, ChatGPT: Optimizing Language Models for Dialogue
  • [20] Ouyang L, 2022, ADV NEUR IN