Leveraging artificial intelligence to summarize abstracts in lay language for increasing research accessibility and transparency

被引:6
|
作者
Shyr, Cathy [1 ]
Grout, Randall W. [2 ,3 ]
Kennedy, Nan [4 ]
Akdas, Yasemin [5 ]
Tischbein, Maeve [4 ]
Milford, Joshua [4 ]
Tan, Jason [4 ]
Quarles, Kaysi [4 ]
Edwards, Terri L. [4 ]
Novak, Laurie L. [1 ]
White, Jules [6 ]
Wilkins, Consuelo H. [7 ]
Harris, Paul A. [1 ,8 ,9 ]
机构
[1] Vanderbilt Univ, Dept Biomed Informat, Med Ctr, 2525 West End Ave, Nashville, TN 37203 USA
[2] Indiana Univ Sch Med, Dept Pediat, Indianapolis, IN 46202 USA
[3] Regenstrief Inst Inc, Indianapolis, IN 46202 USA
[4] Vanderbilt Univ, Vanderbilt Inst Clin & Translat Res, Med Ctr, Nashville, TN 37203 USA
[5] Vanderbilt Univ, Div Emergency Med Res, Med Ctr, Nashville, TN 37232 USA
[6] Vanderbilt Univ, Dept Elect Engn & Comp Sci, Nashville, TN 37240 USA
[7] Vanderbilt Univ, Dept Med, Med Ctr, Nashville, TN 37232 USA
[8] Vanderbilt Univ, Dept Biostat, Med Ctr, Nashville, TN 37203 USA
[9] Vanderbilt Univ, Dept Biomed Engn, Nashville, TN 37240 USA
关键词
artificial intelligence; large language model; text summarization; ResearchMatch; return of study results; PARTICIPANTS; COMMUNICATION; ATTITUDES; TEXT;
D O I
10.1093/jamia/ocae186
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective Returning aggregate study results is an important ethical responsibility to promote trust and inform decision making, but the practice of providing results to a lay audience is not widely adopted. Barriers include significant cost and time required to develop lay summaries and scarce infrastructure necessary for returning them to the public. Our study aims to generate, evaluate, and implement ChatGPT 4 lay summaries of scientific abstracts on a national clinical study recruitment platform, ResearchMatch, to facilitate timely and cost-effective return of study results at scale.Materials and Methods We engineered prompts to summarize abstracts at a literacy level accessible to the public, prioritizing succinctness, clarity, and practical relevance. Researchers and volunteers assessed ChatGPT-generated lay summaries across five dimensions: accuracy, relevance, accessibility, transparency, and harmfulness. We used precision analysis and adaptive random sampling to determine the optimal number of summaries for evaluation, ensuring high statistical precision.Results ChatGPT achieved 95.9% (95% CI, 92.1-97.9) accuracy and 96.2% (92.4-98.1) relevance across 192 summary sentences from 33 abstracts based on researcher review. 85.3% (69.9-93.6) of 34 volunteers perceived ChatGPT-generated summaries as more accessible and 73.5% (56.9-85.4) more transparent than the original abstract. None of the summaries were deemed harmful. We expanded ResearchMatch's technical infrastructure to automatically generate and display lay summaries for over 750 published studies that resulted from the platform's recruitment mechanism.Discussion and Conclusion Implementing AI-generated lay summaries on ResearchMatch demonstrates the potential of a scalable framework generalizable to broader platforms for enhancing research accessibility and transparency.
引用
收藏
页码:2294 / 2303
页数:10
相关论文
共 45 条
  • [41] ChatGPT and a new academic reality: Artificial Intelligence-written research papers and the ethics of the large language models in scholarly publishing
    Lund, Brady D.
    Wang, Ting
    Mannuru, Nishith Reddy
    Nie, Bing
    Shimray, Somipam
    Wang, Ziang
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2023, 74 (05) : 570 - 581
  • [42] A SYSTEMATIC REVIEW OF RESEARCH ON THE USE OF ARTIFICIAL INTELLIGENCE IN ENGLISH LANGUAGE TEACHING AND LEARNING (2015-2021): WHAT ARE THE CURRENT EFFECTS?
    Sharadgah, Talha A.
    Sa'di, Rami A.
    JOURNAL OF INFORMATION TECHNOLOGY EDUCATION-RESEARCH, 2022, 21 : 337 - 377
  • [43] Using natural language processing to support peer-feedback in the age of artificial intelligence: A cross-disciplinary framework and a research agenda
    Bauer, Elisabeth
    Greisel, Martin
    Kuznetsov, Ilia
    Berndt, Markus
    Kollar, Ingo
    Dresel, Markus
    Fischer, Martin R. R.
    Fischer, Frank
    BRITISH JOURNAL OF EDUCATIONAL TECHNOLOGY, 2023, 54 (05) : 1222 - 1245
  • [44] Evaluation of the impact of large language learning models on articles submitted to Orthopaedics & Traumatology: Surgery & Research (OTSR): A significant increase in the use of artificial intelligence in 2023
    Maroteau, Gaelle
    An, Jae-Sung
    Murgier, Jerome
    Hulet, Christophe
    Ollivier, Matthieu
    Ferreira, Alexandre
    ORTHOPAEDICS & TRAUMATOLOGY-SURGERY & RESEARCH, 2023, 109 (08)
  • [45] Enhancing Care for Older Adults and Dementia Patients With Large Language Models: Proceedings of the National Institute on Aging-Artificial Intelligence & Technology Collaboratory for Aging Research Symposium
    Abadir, Peter M.
    Battle, Alexis
    Walston, Jeremy D.
    Chellappa, Rama
    JOURNALS OF GERONTOLOGY SERIES A-BIOLOGICAL SCIENCES AND MEDICAL SCIENCES, 2024, 79 (09):