Improving Prediction of Complications Post-Proton Therapy in Lung Cancer Using Large Language Models and Meta-Analysis

被引:1
作者
Chao, Pei-Ju [1 ,2 ,3 ]
Chang, Chu-Ho [1 ]
Wu, Jyun-Jie [1 ]
Liu, Yen-Hsien [1 ]
Shiau, Junping [1 ]
Shih, Hsin-Hung [1 ]
Lin, Guang-Zhi [1 ]
Lee, Shen-Hao [1 ,2 ,3 ,4 ,5 ]
Lee, Tsair-Fwu [1 ,6 ,7 ,8 ]
机构
[1] Natl Kaohsiung Univ Sci & Technol, Med Phys & Informat Lab Elect Engn, 415 Jiangong Rd, Kaohsiung 807, Taiwan
[2] Kaohsiung Chang Gung Mem Hosp, Dept Radiat Oncol, Kaohsiung, Taiwan
[3] Chang Gung Univ, Coll Med, Kaohsiung, Taiwan
[4] Linkou Chang Gung Mem Hosp, Dept Radiat Oncol, Linkou, Taiwan
[5] Chang Gung Univ, Coll Med, Linkou, Taiwan
[6] Kaohsiung Med Univ, Grad Inst Clin Med, Kaohsiung, Taiwan
[7] Kaohsiung Med Univ, Dept Med Imaging & Radiol Sci, Kaohsiung, Taiwan
[8] Kaohsiung Med Univ, Coll Dent Med, Sch Dent, Kaohsiung, Taiwan
关键词
lung cancer; proton therapy; large language model; ChatGPT; meta-analysis; prediction model risk of bias assessment tool; RADIATION PNEUMONITIS; HIGH-RISK; BIAS; ESOPHAGITIS; PROBAST; TOOL;
D O I
10.1177/10732748241286749
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Purpose: This study enhances the efficiency of predicting complications in lung cancer patients receiving proton therapy by utilizing large language models (LLMs) and meta-analytical techniques for literature quality assessment. Materials and Methods: We integrated systematic reviews with LLM evaluations, sourcing studies from Web of Science, PubMed, and Scopus, managed via EndNote X20. Inclusion and exclusion criteria ensured literature relevance. Techniques included meta-analysis, heterogeneity assessment using Cochran's Q test and I2 statistics, and subgroup analyses for different complications. Quality and bias risk were assessed using the PROBAST tool and further analyzed with models such as ChatGPT-4, Llama2-13b, and Llama3-8b. Evaluation metrics included AUC, accuracy, precision, recall, F1 score, and time efficiency (WPM). Results: The meta-analysis revealed an overall effect size of 0.78 for model predictions, with high heterogeneity observed (I2 = 72.88%, P < 0.001). Subgroup analysis for radiation-induced esophagitis and pneumonitis revealed predictive effect sizes of 0.79 and 0.77, respectively, with a heterogeneity index (I2) of 0%, indicating that there were no significant differences among the models in predicting these specific complications. A literature assessment using LLMs demonstrated that ChatGPT-4 achieved the highest accuracy at 90%, significantly outperforming the Llama3 and Llama2 models, which had accuracies ranging from 44% to 62%. Additionally, LLM evaluations were conducted 3229 times faster than manual assessments were, markedly enhancing both efficiency and accuracy. The risk assessment results identified nine studies as high risk, three as low risk, and one as unknown, confirming the robustness of the ChatGPT-4 across various evaluation metrics. Conclusion: This study demonstrated that the integration of large language models with meta-analysis techniques can significantly increase the efficiency of literature evaluations and reduce the time required for assessments, confirming that there are no significant differences among models in predicting post proton therapy complications in lung cancer patients.
引用
收藏
页数:17
相关论文
共 56 条
[41]   Natural Language Processing in Radiology: A Systematic Review [J].
Pons, Ewoud ;
Braun, Loes M. M. ;
Hunink, M. G. Myriam ;
Kors, Jan A. .
RADIOLOGY, 2016, 279 (02) :329-343
[42]   A Review on Large Language Models: Architectures, Applications, Taxonomies, Open Issues and Challenges [J].
Raiaan, Mohaimenul Azam Khan ;
Mukta, Md. Saddam Hossain ;
Fatema, Kaniz ;
Fahad, Nur Mohammad ;
Sakib, Sadman ;
Mim, Most Marufatul Jannat ;
Ahmad, Jubaer ;
Ali, Mohammed Eunus ;
Azam, Sami .
IEEE ACCESS, 2024, 12 :26839-26874
[43]   Utilization of the PICO framework to improve searching PubMed for clinical questions [J].
Schardt, Connie ;
Adams, Martha B. ;
Owens, Thomas ;
Keitz, Sheri ;
Fontelo, Paul .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2007, 7 (1)
[44]  
Schulz MA, 2020, NAT COMMUN, V11, DOI [10.1038/s41467-020-18037-z, 10.1038/s41467-020-18446-0]
[45]  
Touvron H, 2023, Arxiv, DOI [arXiv:2302.13971, DOI 10.48550/ARXIV.2302.13971]
[46]  
Tran Thi-Oanh, 2024, Comput Biol Med, V174, P108408, DOI [10.1016/j.compbiomed.2024.108408, 10.1016/j.compbiomed.2024.108408]
[47]   Predicting the Effect of Proton Beam Therapy Technology on Pulmonary Toxicities for Patients With Locally Advanced Lung Cancer Enrolled in the Proton Collaborative Group Prospective Clinical Trial [J].
Valdes, Gilmer ;
Scholey, Jessica ;
Nano, Tomi F. ;
Gennatas, Efstathios D. ;
Mohindra, Pranshu ;
Mohammed, Nasir ;
Zeng, Jing ;
Kotecha, Rupesh ;
Rosen, Lane R. ;
Chang, John ;
Tsai, Henry K. ;
Urbanic, James J. ;
Vargas, Carlos E. ;
Yu, Nathan Y. ;
Ungar, Lyle H. ;
Eaton, Eric ;
Simone, Charles B. .
INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2024, 119 (01) :66-77
[48]   Large-scale validation of the prediction model risk of bias assessment Tool (PROBAST) using a short form: high risk of bias models show poorer discrimination [J].
Venema, Esmee ;
Wessler, Benjamin S. ;
Paulus, Jessica K. ;
Salah, Rehab ;
Raman, Gowri ;
Leung, Lester Y. ;
Koethe, Benjamin C. ;
Nelson, Jason ;
Park, Jinny G. ;
van Klaveren, David ;
Steyerberg, Ewout W. ;
Kent, David M. .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2021, 138 :32-39
[49]   Quality of Life and Patient-Reported Outcomes Following Proton Radiation Therapy: A Systematic Review [J].
Verma, Vivek ;
Simone, Charles B., II ;
Mishra, Mark V. .
JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2018, 110 (04) :341-353
[50]   Lyman?Kutcher?Burman normal tissue complication probability modeling for radiation -induced esophagitis in non -small cell lung cancer patients receiving proton radiotherapy [J].
Wang, Zeming ;
Chen, Mei ;
Sun, Jian ;
Jiang, Shengpeng ;
Wang, Li ;
Wang, Xiaochun ;
Sahoo, Narayan ;
Gunn, G. Brandon ;
Frank, Steven J. ;
Nguyen, Quynh-Nhu ;
Liao, Zhongxing ;
Chang, Joe Y. ;
Zhu, X. Ronald ;
Zhang, Xiaodong .
RADIOTHERAPY AND ONCOLOGY, 2020, 146 :200-204