Artificial Intelligence-Generated Draft Replies to Patient Inbox Messages

被引:81
作者
Garcia, Patricia [1 ,3 ]
Ma, Stephen P. [2 ,3 ]
Shah, Shreya [3 ,4 ]
Smith, Margaret [4 ]
Jeong, Yejin [4 ]
Devon-Sand, Anna [4 ]
Tai-Seale, Ming [5 ]
Takazawa, Kevin [6 ]
Clutter, Danyelle [6 ]
Vogt, Kyle [6 ]
Lugtu, Carlene [7 ]
Rojo, Matthew [6 ]
Lin, Steven [3 ,4 ]
Shanafelt, Tait [3 ,8 ]
Pfeffer, Michael A. [3 ,6 ]
Sharp, Christopher [3 ]
机构
[1] Stanford Univ, Dept Med, Sch Med, 430 Broadway St,3rd Floor, Redwood City, CA 94063 USA
[2] Stanford Univ, Dept Med, Sch Med, 453 Quarry Rd, Palo Alto, CA 94304 USA
[3] Stanford Univ, Sch Med, Dept Med, Stanford, CA USA
[4] Stanford Univ, Sch Med, Stanford Healthcare AI Appl Res Team, Div Primary Care & Populat Hlth, Stanford, CA USA
[5] Univ Calif San Diego, Sch Med, Dept Family Med, La Jolla, CA USA
[6] Stanford Med, Technol & Digital Solut, Stanford, CA USA
[7] Stanford Healthcare, Nursing Informat & Innovat, Stanford, CA USA
[8] Stanford Univ, WellMD Ctr, Sch Med, Stanford, CA USA
关键词
PHYSICIANS; IMPACT;
D O I
10.1001/jamanetworkopen.2024.3201
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Importance The emergence and promise of generative artificial intelligence (AI) represent a turning point for health care. Rigorous evaluation of generative AI deployment in clinical practice is needed to inform strategic decision-making. Objective To evaluate the implementation of a large language model used to draft responses to patient messages in the electronic inbox. Design, Setting, and ParticipantsA 5-week, prospective, single-group quality improvement study was conducted from July 10 through August 13, 2023, at a single academic medical center (Stanford Health Care). All attending physicians, advanced practice practitioners, clinic nurses, and clinical pharmacists from the Divisions of Primary Care and Gastroenterology and Hepatology were enrolled in the pilot. InterventionDraft replies to patient portal messages generated by a Health Insurance Portability and Accountability Act-compliant electronic health record-integrated large language model. Main Outcomes and MeasuresThe primary outcome was AI-generated draft reply utilization as a percentage of total patient message replies. Secondary outcomes included changes in time measures and clinician experience as assessed by survey. ResultsA total of 197 clinicians were enrolled in the pilot; 35 clinicians who were prepilot beta users, out of office, or not tied to a specific ambulatory clinic were excluded, leaving 162 clinicians included in the analysis. The survey analysis cohort consisted of 73 participants (45.1%) who completed both the presurvey and postsurvey. In gastroenterology and hepatology, there were 58 physicians and APPs and 10 nurses. In primary care, there were 83 physicians and APPs, 4 nurses, and 8 clinical pharmacists. The mean AI-generated draft response utilization rate across clinicians was 20%. There was no change in reply action time, write time, or read time between the prepilot and pilot periods. There were statistically significant reductions in the 4-item physician task load score derivative (mean [SD], 61.31 [17.23] presurvey vs 47.26 [17.11] postsurvey; paired difference, -13.87; 95% CI, -17.38 to -9.50; P < .001) and work exhaustion scores (mean [SD], 1.95 [0.79] presurvey vs 1.62 [0.68] postsurvey; paired difference, -0.33; 95% CI, -0.50 to -0.17; P < .001). Conclusions and RelevanceIn this quality improvement study of an early implementation of generative AI, there was notable adoption, usability, and improvement in assessments of burden and burnout. There was no improvement in time. Further code-to-bedside testing is needed to guide future development and organizational strategy.
引用
收藏
页数:13
相关论文
共 26 条
[1]   Physicians' electronic inbox work patterns and factors associated with high inbox work duration [J].
Akbar, Fatema ;
Mark, Gloria ;
Warton, E. Margaret ;
Reed, Mary E. ;
Prausnitz, Stephanie ;
East, Jeffrey A. ;
Moeller, Mark F. ;
Lieu, Tracy A. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2021, 28 (05) :923-930
[2]   Comparing Physician and Artificial Intelligence Chatbot Responses to Patient Questions Posted to a Public Social Media Forum [J].
Ayers, John W. ;
Poliak, Adam ;
Dredze, Mark ;
Leas, Eric C. ;
Zhu, Zechariah ;
Kelley, Jessica B. ;
Faix, Dennis J. ;
Goodman, Aaron M. ;
Longhurst, Christopher A. ;
Hogarth, Michael ;
Smith, Davey M. .
JAMA INTERNAL MEDICINE, 2023, 183 (06) :589-596
[3]   Establishing Crosswalks Between Common Measures of Burnout in US Physicians [J].
Brady, Keri J. S. ;
Ni, Pengsheng ;
Carlasare, Lindsey ;
Shanafelt, Tait D. ;
Sinsky, Christine A. ;
Linzer, Mark ;
Stillman, Martin ;
Trockel, Mickey T. .
JOURNAL OF GENERAL INTERNAL MEDICINE, 2022, 37 (04) :777-784
[4]   Implementation of Prediction Models in the Emergency Department from an Implementation Science Perspective-Determinants, Outcomes, and Real-World Impact: A Scoping Review [J].
Chan, Sze Ling ;
Lee, Jin Wee ;
Ong, Marcus Eng Hock ;
Siddiqui, Fahad Javaid ;
Graves, Nicholas ;
Ho, Andrew Fu Wah ;
Liu, Nan .
ANNALS OF EMERGENCY MEDICINE, 2023, 82 (01) :22-36
[5]  
Fleming SL, 2023, Arxiv, DOI arXiv:2308.14089
[6]   In-Basket Reduction: A Multiyear Pragmatic Approach to Lessen the Work Burden of Primary Care Physicians [J].
Fogg, Jane F. ;
Sinsky, Christine A. .
NEJM CATALYST INNOVATIONS IN CARE DELIVERY, 2023, 4 (05)
[7]   Estimating institutional physician turnover attributable to self-reported burnout and associated financial burden: a case study [J].
Hamidi, Maryam S. ;
Bohman, Bryan ;
Sandborg, Christy ;
Smith-Coggins, Rebecca ;
de Vries, Patty ;
Albert, Marisa S. ;
Murphy, Mary Lou ;
Welle, Dana ;
Trockel, Mickey T. .
BMC HEALTH SERVICES RESEARCH, 2018, 18
[8]   Physician Task Load and the Risk of Burnout Among US Physicians in a National Survey [J].
Harry, Elizabeth ;
Sinsky, Christine ;
Dyrbye, Lotte N. ;
Makowski, Maryam S. ;
Trockel, Mickey ;
Tutty, Michael ;
Carlasare, Lindsey E. ;
West, Colin P. ;
Shanafelt, Tait D. .
JOINT COMMISSION JOURNAL ON QUALITY AND PATIENT SAFETY, 2021, 47 (02) :76-85
[9]   Assessing the impact of the COVID-19 pandemic on clinician ambulatory electronic health record use [J].
Holmgren, A. Jay ;
Downing, N. Lance ;
Tang, Mitchell ;
Sharp, Christopher ;
Longhurst, Christopher ;
Huckman, Robert S. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2022, 29 (03) :453-460
[10]   Integrating Rapid Diabetes Screening Into a Latinx Focused Community-Based Low-Barrier COVID-19 Testing Program [J].
Kerkhoff, Andrew D. ;
Rojas, Susana ;
Black, Douglas ;
Ribeiro, Salustiano ;
Rojas, Susy ;
Valencia, Rebecca ;
Lemus, Jonathan ;
Payan, Joselin ;
Schrom, John ;
Jones, Diane ;
Manganelli, Simone ;
Bandi, Shalom ;
Chamie, Gabriel ;
Tulier-Laiwa, Valerie ;
Petersen, Maya ;
Havlir, Diane ;
Marquez, Carina .
JAMA NETWORK OPEN, 2022, 5 (05) :E2214163