Enhancing Large Language Models for Clinical Decision Support by Incorporating Clinical Practice Guidelines

被引:0
作者
Oniani, David [1 ]
Wu, Xizhi [1 ]
Visweswaran, Shyam [1 ]
Kapoor, Sumit [1 ]
Kooragayalu, Shravan [2 ]
Polanska, Katelyn [1 ]
Wang, Yanshan [1 ]
机构
[1] Univ Pittsburgh, Pittsburgh, PA 15260 USA
[2] UPMC Western Maryland, Cumberland, MD 21502 USA
来源
2024 IEEE 12TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS, ICHI 2024 | 2024年
关键词
artificial intelligence; large language models; clinical decision support; prompting; generative ai;
D O I
10.1109/ICHI61247.2024.00111
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large Language Models (LLMs), enhanced with Clinical Practice Guidelines (CPGs), can significantly improve Clinical Decision Support (CDS). However, approaches for incorporating CPGs into LLMs are not well studied. In this study, we develop three distinct methods for incorporating CPGs into LLMs: Binary Decision Tree (BDT), Program-Aided Graph Construction (PAGC), and Chain-of-Thought-Few-Shot Prompting (CoT-FSP), and focus on CDS for COVID-19 outpatient treatment as the case study. Zero-Shot Prompting (ZSP) is our baseline method. To evaluate the effectiveness of the proposed methods, we create a set of synthetic patient descriptions and conduct both automatic and human evaluation of the responses generated by four LLMs: GPT-4, GPT-3.5 Turbo, LLaMA, and PaLM 2. All four LLMs exhibit improved performance when enhanced with CPGs compared to the baseline ZSP. BDT outperformed both CoT-FSP and PAGC in automatic evaluation. All of the proposed methods demonstrate high performance in human evaluation. LLMs enhanced with CPGs outperform plain LLMs with ZSP in providing accurate recommendations for COVID-19 outpatient treatment, highlighting the potential for broader applications beyond the case study.
引用
收藏
页码:694 / 702
页数:9
相关论文
共 38 条
[1]  
Akkaya I., 2024, Gpt-4 technical report
[2]  
Anil R., 2023, Palm 2 technical report
[3]   Leveraging Large Language Models for Decision Support in Personalized Oncology [J].
Benary, Manuela ;
Wang, Xing David ;
Schmidt, Max ;
Soll, Dominik ;
Hilfenhaus, Georg ;
Nassir, Mani ;
Sigler, Christian ;
Knoedler, Maren ;
Keller, Ulrich ;
Beule, Dieter ;
Keilholz, Ulrich ;
Leser, Ulf ;
Rieke, Damian T. .
JAMA NETWORK OPEN, 2023, 6 (11) :E2343689
[4]   Autonomous chemical research with large language models [J].
Boiko, Daniil A. ;
Macknight, Robert ;
Kline, Ben ;
Gomes, Gabe .
NATURE, 2023, 624 (7992) :570-+
[5]  
Brown TB, 2020, ADV NEUR IN, V33
[6]   ChatGPT and large language models in orthopedics: from education and surgery to research [J].
Chatterjee, Srijan ;
Bhattacharya, Manojit ;
Pal, Soumen ;
Lee, Sang-Soo ;
Chakraborty, Chiranjib .
JOURNAL OF EXPERIMENTAL ORTHOPAEDICS, 2023, 10 (01)
[7]   Challenges and opportunities for advancing patient-centered clinical decision support: findings from a horizon scan [J].
Dullabh, Prashila ;
Sandberg, Shana F. ;
Heaney-Huls, Krysta ;
Hovey, Lauren S. ;
Lobach, David F. ;
Boxwala, Aziz ;
Desai, Priyanka J. ;
Berliner, Elise ;
Dymek, Chris ;
Harrison, Michael, I ;
Swiger, James ;
Sittig, Dean F. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2022, 29 (07) :1233-1243
[8]   The technical landscape for patient-centered CDS: progress, gaps, and challenges [J].
Dullabh, Prashila ;
Heaney-Huls, Krysta ;
Lobach, David F. ;
Hovey, Lauren S. ;
Sandberg, Shana F. ;
Desai, Priyanka J. ;
Lomotan, Edwin ;
Swiger, James ;
Harrison, Michael, I ;
Dymek, Chris ;
Sittig, Dean F. ;
Boxwala, Aziz .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2022, 29 (06) :1101-1105
[9]  
Gao Luyu, P MACHINE LEARNING R
[10]  
Gao TY, 2021, 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, P3816