A survey on biomedical automatic text summarization with large language models

被引：0

作者：

Huang, Zhenyu ^{[1
]}

Chen, Xianlai ^{[1
,2
]}

Wang, Yunbo ^{[1
,2
]}

Huang, Jincai ^{[1
,2
]}

Zhao, Xing ^{[1
]}

机构：

[1] Cent South Univ, Big Data Inst, Changsha 410083, Peoples R China

[2] Cent South Univ, Natl Engn Res Ctr Med Big Data Applicat Technol, Changsha 410083, Peoples R China

来源：

INFORMATION PROCESSING & MANAGEMENT | 2025年 / 62卷 / 05期

关键词：

Biomedical; Automatic text summarization; Large language models; Neural networks; Natural language processing; DOMAIN KNOWLEDGE; REPRESENTATION; INFORMATION; EXTRACTION; IMPACT; GPT-4; RISK;

D O I：

10.1016/j.ipm.2025.104216

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Automatic text summarization in the biomedical field can support efficient literature screening, medical knowledge management, and innovative medical research. In recent years, Large Language Models (LLMs), as a disruptive technology in natural language processing, have shown great potential for Biomedical Automatic Text Summarization (BATS). This technology helps to better understand the terminology of biomedical texts, track medical hotspots, and generate personalized diagnoses and treatment plans. This paper provides an in-depth discussion on the development of BATS, and the opportunities as well as challenges brought by applying LLMs to biomedical automatic text summarization. Firstly, the development of BATS is reviewed, where traditional text summarization, neural network-based summarization, and LLMs-based summarization are analyzed systematically. Meanwhile, the applications of various LLMs (e.g., BERT and GPT series) in three types of BATS are presented in detail, including extractive summarization, abstractive summarization, and hybrid summarization. Next, the relevant datasets are introduced, such as PubMed, COVID-19 and MIMIC-III. Then, traditional, emerging, and auxiliary metrics for evaluating the performance of BATS are shown, and the performance evaluation of different models is elaborated. Finally, the opportunities brought by applying LLMs to BATS are described, and the potential challenges along with the corresponding solutions are discussed.

引用

页数：44

共 310 条

[1] Large Language Models in Medical Education: Opportunities, Challenges, and Future Directions [J].

Abd-alrazaq, Alaa ;

AlSaad, Rawan ;

Alhuwail, Dari ;

Ahmed, Arfan ;

Healy, Padraig Mark ;

Latifi, Syed ;

Aziz, Sarah ;

Damseh, Rafat ;

Alrazak, Sadam Alabed ;

Sheikh, Javaid .

JMIR MEDICAL EDUCATION, 2023, 9

[2] Large Language Models and Artificial Intelligence: A Primer for Plastic Surgeons on the Demonstrated and Potential Applications, Promises, and Limitations of ChatGPT [J].

Abi-Rafeh, Jad ;

Xu, Hong Hao ;

Kazan, Roy ;

Tevlin, Ruth ;

Furnas, Heather .

AESTHETIC SURGERY JOURNAL, 2024, 44 (03) :329-343

[3] Clinical Context-Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation [J].

Afzal, Muhammad ;

Alam, Fakhare ;

Malik, Khalid Mahmood ;

Malik, Ghaus M. .

JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (10)

[4]

Agarwal Shashank, 2009, AMIA Annu Symp Proc, V2009, P6

[5]

Aharoni R, 2023, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, P3562

[6] De-identification of electronic health record using neural network [J].

Ahmed, Tanbir ;

Al Aziz, Md Momin ;

Mohammed, Noman .

SCIENTIFIC REPORTS, 2020, 10 (01)

[7] Outpatient health care utilization for sleep disorders in the Cerner Health Facts database [J].

Al-Shawwa, Baha ;

Glynn, Earl ;

Hoffman, Mark A. ;

Ehsan, Zarmina ;

Ingram, David G. .

JOURNAL OF CLINICAL SLEEP MEDICINE, 2021, 17 (02) :203-209

[8] Advances in diffusion models for image data augmentation: a review of methods, models, evaluation metrics and future research directions [J].

Alimisis, Panagiotis ;

Mademlis, Ioannis ;

Radoglou-Grammatikis, Panagiotis ;

Sarigiannidis, Panagiotis ;

Papadopoulos, Georgios Th. .

ARTIFICIAL INTELLIGENCE REVIEW, 2025, 58 (04)

[9]

Almansoori M, 2025, Arxiv, DOI [arXiv:2503.22678, 10.48550/arXiv.2503.22678, DOI 10.48550/ARXIV.2503.22678]

[10] Educational Videos Subtitles' Summarization Using Latent Dirichlet Allocation and Length Enhancement [J].

Alrumiah, Sarah S. ;

Al-Shargabi, Amal A. .

CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (03) :6205-6221

← 1 2 3 4 5 6 7 8 9 10 →