Using Large Language Models to Detect and Understand Drug Discontinuation Events in Web-Based Forums: Development and Validation Study

被引：1

作者：

Trevena, William ^{[1
]}

Zhong, Xiang ^{[1
]}

Alvarado, Michelle ^{[1
]}

Semenov, Alexander ^{[1
]}

Oktay, Alp ^{[2
]}

Devlin, Devin ^{[3
]}

Gohil, Aarya Yogesh ^{[1
]}

Chittimouju, Sai Harsha ^{[1
]}

机构：

[1] Univ Florida, Dept Ind & Syst Engn, POB 115002, Gainesville, FL 32611 USA

[2] Univ San Diego, Dept Ind & Syst Engn, San Diego, CA USA

[3] Microsoft, Seattle, WA USA

来源：

JOURNAL OF MEDICAL INTERNET RESEARCH | 2025年 / 27卷

关键词：

natural language processing; large language models; ChatGPT; drug discontinuation events; zero-shot classification; artificial; intelligence; AI;

D O I：

10.2196/54601

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

Background: The implementation of large language models (LLMs), such as BART (Bidirectional and Auto-Regressive Transformers) and GPT-4, has revolutionized the extraction of insights from unstructured text. These advancements have expanded into health care, allowing analysis of social media for public health insights. However, the detection of drug discontinuation events (DDEs) remains underexplored. Identifying DDEs is crucial for understanding medication adherence and patient outcomes. Objective: The aim of this study is to provide a flexible framework for investigating various clinical research questions in data-sparse environments. We provide an example of the utility of this framework by identifying DDEs and their root causes in an open-source web-based forum, MedHelp, and by releasing the first open-source DDE datasets to aid further research in this domain. Representations from Transformer with Disentangled Attention), and BART, among others, to detect and determine the root causes of DDEs in user comments posted on MedHelp. Our study design included the use of zero-shot classification, which allows these models to make predictions without task-specific training. We split user comments into sentences and applied different classification strategies to assess the performance of these models in identifying DDEs and their root causes. Results: Among the selected models, GPT-4o performed the best at determining the root causes of DDEs, predicting only 12.9% of root causes incorrectly (hamming loss). Among the open-source models tested, BART demonstrated the best performance in detecting DDEs, achieving an F1-score of 0.86, a false positive rate of 2.8%, and a false negative rate of 6.5%, all without any fine-tuning. The dataset included 10.7% (107/1000) DDEs, emphasizing the models' robustness in an imbalanced data context. Conclusions: This study demonstrated the effectiveness of open- and closed-source LLMs, such as GPT-4o and BART, for detecting DDEs and their root causes from publicly accessible data through zero-shot classification. The robust and scalable framework we propose can aid researchers in addressing data-sparse clinical research questions. The launch of open-access DDE datasets has the potential to stimulate further research and novel discoveries in this field.

引用

页数：17

共 33 条

[1] Neural network embeddings on corporate annual filings for portfolio selection [J].

Adosoglou, George ;

Lombardo, Gianfranco ;

Pardalos, Panos M. .

EXPERT SYSTEMS WITH APPLICATIONS, 2021, 164

[2]

[Anonymous], [57] MedHelp, All Ask a Doctor Forums And Medical Communities-MedHelp-medhelp.org, [Accessed 26-04-2023].

[3] Sentiment Analysis of Multilingual Tweets Based on Natural Language Processing (NLP) [J].

Bera, Abhijit ;

Ghose, Mrinal Kanti ;

Pal, Dibyendu Kumar .

INTERNATIONAL JOURNAL OF SYSTEM DYNAMICS APPLICATIONS, 2021, 10 (04)

[4]

Brown TB, 2020, ADV NEUR IN, V33

[5]

Dagan I, 2006, LECT NOTES ARTIF INT, V3944, P177

[6]

Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

[7]

facebook / bart-large-mnli, Hugging Face

[8]

FacebookAI / roberta-large-mnli, Hugging Face

[9] What do users think about Virtual Reality relaxation applications? A mixed methods study of online user reviews using natural language processing [J].

Fagernas, Simon ;

Hamilton, William ;

Espinoza, Nicolas ;

Miloff, Alexander ;

Carlbring, Per ;

Lindner, Philip .

INTERNET INTERVENTIONS-THE APPLICATION OF INFORMATION TECHNOLOGY IN MENTAL AND BEHAVIOURAL HEALTH, 2021, 24

[10] Patient-Reported Reasons for Switching or Discontinuing Statin Therapy: A Mixed Methods Study Using Social Media [J].

Golder, Su ;

Weissenbacher, Davy ;

O'Connor, Karen ;

Hennessy, Sean ;

Gross, Robert ;

Hernandez, Graciela Gonzalez .

DRUG SAFETY, 2022, 45 (09) :971-981

← 1 2 3 4 →