Comparative Analysis of Open-Source Language Models in Summarizing Medical Text Data

被引：0

作者：

Chen, Yuhao ^{[1
]}

Wang, Zhimu ^{[1
]}

Zulkernine, Farhana ^{[1
]}

机构：

[1] Queens Univ, Sch Comp, Kingston, ON, Canada

来源：

2024 IEEE INTERNATIONAL CONFERENCE ON DIGITAL HEALTH, ICDH 2024 | 2024年

基金：

加拿大自然科学与工程研究理事会;

关键词：

Biomedical summarization; Large Language Model; Generative Model;

D O I：

10.1109/ICDH62654.2024.00030

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Unstructured text in medical notes and dialogues contains rich information. Recent advancements in Large Language Models (LLMs) have demonstrated superior performance in question answering and summarization tasks on unstructured text data, outperforming traditional text analysis approaches. However, there is a lack of scientific studies in the literature that methodically evaluate and report on the performance of different LLMs, specifically for domain-specific data such as medical chart notes. We propose an evaluation approach to analyze the performance of open-source LLMs such as Llama2 and Mistral for medical summarization tasks, using GPT-4 as an assessor. Our innovative approach to quantitative evaluation of LLMs can enable quality control, support the selection of effective LLMs for specific tasks, and advance knowledge discovery in digital health.

引用

页码：126 / 128

页数：3

共 29 条

[21] A Systematic Comparison Between Open- and Closed-Source Large Language Models in the Context of Generating GDPR-Compliant Data Categories for Processing Activity Records
von Schwerin, Magdalena
Reichert, Manfred
Future Internet, 2024, 16 (12):
[22] ONCE: Boosting Content-based Recommendation with Both Open- and Closed-source Large Language Models
Liu, Qijiong
Chen, Nuo
Sakai, Tetsuya
Wu, Xiao-Ming
PROCEEDINGS OF THE 17TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, WSDM 2024, 2024, : 452 - 461
[23] Mobile-LLaMA: Instruction Fine-Tuning Open-Source LLM for Network Analysis in 5G Networks
Kan, Khen Bo
Mun, Hyunsu
Cao, Guohong
Lee, Youngseok
IEEE NETWORK, 2024, 38 (05): : 76 - 83
[24] Comparative Analysis of Generic and Fine-Tuned Large Language Models for Conversational Agent Systems
Villa, Laura
Carneros-Prado, David
Dobrescu, Cosmin C.
Sanchez-Miguel, Adrian
Cubero, Guillermo
Hervas, Ramon
ROBOTICS, 2024, 13 (05)
[25] Enhanced Database Interaction using Large Language Models for Improved Data Retrieval and Analysis
Usha, V
Abhinash, Nalagarla Chiru
Chowdary, Sakhamuri Nitin
Sathya, V
Reddy, Eeda Ramakrishna
Priya, Sathiya S.
2024 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT CYBER PHYSICAL SYSTEMS AND INTERNET OF THINGS, ICOICI 2024, 2024, : 1302 - 1306
[26] Comparative Analysis of Artificial Intelligence Virtual Assistant and Large Language Models in Post-Operative Care
Borna, Sahar
Gomez-Cabello, Cesar A.
Pressman, Sophia M.
Haider, Syed Ali
Sehgal, Ajai
Leibovich, Bradley C.
Cole, Dave
Forte, Antonio Jorge
EUROPEAN JOURNAL OF INVESTIGATION IN HEALTH PSYCHOLOGY AND EDUCATION, 2024, 14 (05) : 1413 - 1424
[27] Comparative Analysis of Prompt Strategies for Large Language Models: Single-Task vs. Multitask Prompts
Gozzi, Manuel
Di Maio, Federico
ELECTRONICS, 2024, 13 (23):
[28] The Emerging Role of AI in Patient Education: A Comparative Analysis of the Accuracy of Large Language Models for Pelvic Organ Prolapse
Ocakoglu, Sakine Rahimli
Coskun, Burhan
MEDICAL PRINCIPLES AND PRACTICE, 2024, 33 (04) : 330 - 337
[29] LEST: Large language models and spatio-temporal data analysis for enhanced Sino-US exchange rate forecasting
Han, Di
Guo, Wei
Chen, Han
Wang, Bocheng
Guo, Zikun
INTERNATIONAL REVIEW OF ECONOMICS & FINANCE, 2024, 96

← 1 2 3 →