Comparative Analysis of Open-Source Language Models in Summarizing Medical Text Data

被引:0
|
作者
Chen, Yuhao [1 ]
Wang, Zhimu [1 ]
Zulkernine, Farhana [1 ]
机构
[1] Queens Univ, Sch Comp, Kingston, ON, Canada
来源
2024 IEEE INTERNATIONAL CONFERENCE ON DIGITAL HEALTH, ICDH 2024 | 2024年
基金
加拿大自然科学与工程研究理事会;
关键词
Biomedical summarization; Large Language Model; Generative Model;
D O I
10.1109/ICDH62654.2024.00030
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Unstructured text in medical notes and dialogues contains rich information. Recent advancements in Large Language Models (LLMs) have demonstrated superior performance in question answering and summarization tasks on unstructured text data, outperforming traditional text analysis approaches. However, there is a lack of scientific studies in the literature that methodically evaluate and report on the performance of different LLMs, specifically for domain-specific data such as medical chart notes. We propose an evaluation approach to analyze the performance of open-source LLMs such as Llama2 and Mistral for medical summarization tasks, using GPT-4 as an assessor. Our innovative approach to quantitative evaluation of LLMs can enable quality control, support the selection of effective LLMs for specific tasks, and advance knowledge discovery in digital health.
引用
收藏
页码:126 / 128
页数:3
相关论文
共 29 条
  • [21] A Systematic Comparison Between Open- and Closed-Source Large Language Models in the Context of Generating GDPR-Compliant Data Categories for Processing Activity Records
    von Schwerin, Magdalena
    Reichert, Manfred
    Future Internet, 2024, 16 (12):
  • [22] ONCE: Boosting Content-based Recommendation with Both Open- and Closed-source Large Language Models
    Liu, Qijiong
    Chen, Nuo
    Sakai, Tetsuya
    Wu, Xiao-Ming
    PROCEEDINGS OF THE 17TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, WSDM 2024, 2024, : 452 - 461
  • [23] Mobile-LLaMA: Instruction Fine-Tuning Open-Source LLM for Network Analysis in 5G Networks
    Kan, Khen Bo
    Mun, Hyunsu
    Cao, Guohong
    Lee, Youngseok
    IEEE NETWORK, 2024, 38 (05): : 76 - 83
  • [24] Comparative Analysis of Generic and Fine-Tuned Large Language Models for Conversational Agent Systems
    Villa, Laura
    Carneros-Prado, David
    Dobrescu, Cosmin C.
    Sanchez-Miguel, Adrian
    Cubero, Guillermo
    Hervas, Ramon
    ROBOTICS, 2024, 13 (05)
  • [25] Enhanced Database Interaction using Large Language Models for Improved Data Retrieval and Analysis
    Usha, V
    Abhinash, Nalagarla Chiru
    Chowdary, Sakhamuri Nitin
    Sathya, V
    Reddy, Eeda Ramakrishna
    Priya, Sathiya S.
    2024 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT CYBER PHYSICAL SYSTEMS AND INTERNET OF THINGS, ICOICI 2024, 2024, : 1302 - 1306
  • [26] Comparative Analysis of Artificial Intelligence Virtual Assistant and Large Language Models in Post-Operative Care
    Borna, Sahar
    Gomez-Cabello, Cesar A.
    Pressman, Sophia M.
    Haider, Syed Ali
    Sehgal, Ajai
    Leibovich, Bradley C.
    Cole, Dave
    Forte, Antonio Jorge
    EUROPEAN JOURNAL OF INVESTIGATION IN HEALTH PSYCHOLOGY AND EDUCATION, 2024, 14 (05) : 1413 - 1424
  • [27] Comparative Analysis of Prompt Strategies for Large Language Models: Single-Task vs. Multitask Prompts
    Gozzi, Manuel
    Di Maio, Federico
    ELECTRONICS, 2024, 13 (23):
  • [28] The Emerging Role of AI in Patient Education: A Comparative Analysis of the Accuracy of Large Language Models for Pelvic Organ Prolapse
    Ocakoglu, Sakine Rahimli
    Coskun, Burhan
    MEDICAL PRINCIPLES AND PRACTICE, 2024, 33 (04) : 330 - 337
  • [29] LEST: Large language models and spatio-temporal data analysis for enhanced Sino-US exchange rate forecasting
    Han, Di
    Guo, Wei
    Chen, Han
    Wang, Bocheng
    Guo, Zikun
    INTERNATIONAL REVIEW OF ECONOMICS & FINANCE, 2024, 96