Potential of Large Language Models in Health Care: Delphi Study

被引:37
作者
Denecke, Kerstin [1 ]
May, Richard [2 ]
Romero, Octavio Rivera [3 ,4 ]
机构
[1] Bern Univ Appl Sci, Quallgasse 21, CH-2502 Biel, Switzerland
[2] Harz Univ Appl Sci, Wernigerode, Germany
[3] Univ Seville, Inst Ingn Informat I3US, Seville, Spain
[4] Univ Seville, Dept Elect Technol, Seville, Spain
关键词
large language models; LLMs; health care; Delphi study; natural language processing; NLP; artificial intelligence; language model; Delphi; future; innovation; interview; interviews; informatics; experience; experiences; attitude; attitudes; opinion; perception; perceptions; perspective; perspectives; implementation; ARTIFICIAL-INTELLIGENCE; EDUCATION; FUTURE; SYSTEM;
D O I
10.2196/52399
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: A large language model (LLM) is a machine learning model inferred from text data that captures subtle patterns of language use in context. Modern LLMs are based on neural network architectures that incorporate transformer methods. They allow the model to relate words together through attention to multiple words in a text sequence. LLMs have been shown to be highly effective for a range of tasks in natural language processing (NLP), including classification and information extraction tasks and generative applications. Objective: The aim of this adapted Delphi study was to collect researchers' opinions on how LLMs might influence health care and on the strengths, weaknesses, opportunities, and threats of LLM use in health care. Methods: We invited researchers in the fields of health informatics, nursing informatics, and medical NLP to share their opinions on LLM use in health care. We started the first round with open questions based on our strengths, weaknesses, opportunities, and threats framework. In the second and third round, the participants scored these items. Results: The first, second, and third rounds had 28, 23, and 21 participants, respectively. Almost all participants (26/28, 93% in round 1 and 20/21, 95% in round 3) were affiliated with academic institutions. Agreement was reached on 103 items related to use cases, benefits, risks, reliability, adoption aspects, and the future of LLMs in health care. Participants offered several use cases, including supporting clinical tasks, documentation tasks, and medical research and education, and agreed that LLM-based systems will act as health assistants for patient education. The agreed-upon benefits included increased efficiency in data handling and extraction, improved automation of processes, improved quality of health care services and overall health outcomes, provision of personalized care, accelerated diagnosis and treatment processes, and improved interaction between patients and health care professionals. In total, 5 risks to health care in general were identified: cybersecurity breaches, the potential for patient misinformation, ethical concerns, the likelihood of biased decision-making, and the risk associated with inaccurate communication. Overconfidence in LLM-based systems was recognized as a risk to the medical profession. The 6 agreed-upon privacy risks included the use of unregulated cloud services that compromise data security, exposure of sensitive patient data, breaches of confidentiality, fraudulent use of information, vulnerabilities in data storage and communication, and inappropriate access or use of patient data. Conclusions: Future research related to LLMs should not only focus on testing their possibilities for NLP-related tasks but also consider the workflows the models could contribute to and the requirements regarding quality, integration, and regulations needed for successful implementation in practice.
引用
收藏
页数:21
相关论文
共 58 条
[1]   Virtuous and vicious cycles on the road towards international supply chain management [J].
Akkermans, H ;
Bogerd, P ;
Vos, B .
INTERNATIONAL JOURNAL OF OPERATIONS & PRODUCTION MANAGEMENT, 1999, 19 (5-6) :565-581
[2]   The impact of ERP on supply chain management:: Exploratory findings from a European Delphi study [J].
Akkermans, HA ;
Bogerd, P ;
Yücesan, E ;
van Wassenhove, LN .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2003, 146 (02) :284-301
[3]   The promise of artificial intelligence: a review of the opportunities and challenges of artificial intelligence in healthcare [J].
Aung, Yuri Y. M. ;
Wong, David C. S. ;
Ting, Daniel S. W. .
BRITISH MEDICAL BULLETIN, 2021, 139 (01) :4-15
[4]   When Artificial Intelligence Models Surpass Physician Performance: Medical Malpractice Liability in an Era of Advanced Artificial Intelligence [J].
Banja, John D. ;
Hollstein, Rolf Dieter ;
Bruno, Michael A. .
JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2022, 19 (07) :816-820
[5]   Using and Reporting the Delphi Method for Selecting Healthcare Quality Indicators: A Systematic Review [J].
Boulkedid, Rym ;
Abdoul, Hendy ;
Loustau, Marine ;
Sibony, Olivier ;
Alberti, Corinne .
PLOS ONE, 2011, 6 (06)
[6]  
Braun V., 2006, Qualitative Research in Psychology, V3, P77, DOI [DOI 10.1080/14780887.2020.1769238, DOI 10.1191/1478088706QP063OA]
[7]  
Capurro D, 2022, The digital health validitron
[8]   Impact of ChatGPT on medical chatbots as a disruptive technology [J].
Chow, James C. L. ;
Sanders, Leslie ;
Li, Kay .
FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2023, 6
[9]   Quality of information and appropriateness of ChatGPT outputs for urology patients [J].
Cocci, Andrea ;
Pezzoli, Marta ;
Lo Re, Mattia ;
Russo, Giorgio Ivan ;
Asmundo, Maria Giovanna ;
Fode, Mikkel ;
Cacciamani, Giovanni ;
Cimino, Sebastiano ;
Minervini, Andrea ;
Durukan, Emil .
PROSTATE CANCER AND PROSTATIC DISEASES, 2024, 27 (01) :103-108
[10]   Ethical implications of AI in robotic surgical training: A Delphi consensus statement [J].
Collins, Justin W. ;
Marcus, Hani J. ;
Ghazi, Ahmed ;
Sridhar, Ashwin ;
Hashimoto, Daniel ;
Hager, Gregory ;
Arezzo, Alberto ;
Jannin, Pierre ;
Maier-Hein, Lena ;
Marz, Keno ;
Valdastri, Pietro ;
Mori, Kensaku ;
Elson, Daniel ;
Giannarou, Stamatia ;
Slack, Mark ;
Hares, Luke ;
Beaulieu, Yanick ;
Levy, Jeff ;
Laplante, Guy ;
Ramadorai, Arvind ;
Jarc, Anthony ;
Andrews, Ben ;
Garcia, Pablo ;
Neemuchwala, Huzefa ;
Andrusaite, Alina ;
Kimpe, Tom ;
Hawkes, David ;
Kelly, John D. ;
Stoyanov, Danail .
EUROPEAN UROLOGY FOCUS, 2022, 8 (02) :613-622