Development of a text mining algorithm for identifying adverse drug reactions in electronic health records

被引：1

作者：

van de Burgt, Britt W. M. ^{[1
,2
,3
]}

Wasylewicz, Arthur T. M. ^{[2
]}

Dullemond, Bjorn ^{[4
]}

Jessurun, Naomi T. ^{[5
]}

Grouls, Rene J. E.

Bouwman, R. Arthur ^{[3
,6
]}

Korsten, Erik H. M. ^{[2
,3
]}

Egberts, Toine C. G. ^{[7
,8
]}

机构：

[1] Catharina Hosp, Div Clin Pharm, NL-5623 EJ Eindhoven, Netherlands

[2] Catharina Hosp, Div Healthcare Intelligence, NL-5623 EJ Eindhoven, Netherlands

[3] Tech Univ Eindhoven, Dept Elect Engn, Signal Proc Grp, NL-5612 AP Eindhoven, Netherlands

[4] Tech Univ Eindhoven, Dept Math & Comp Sci, NL-5612 AP Eindhoven, Netherlands

[5] Netherlands Pharmacovigilance Ctr LAREB, NL-5237 MH Shertogenbosch, Netherlands

[6] Catharina Hosp, Dept Anesthesiol, NL-5623 EJ Eindhoven, Netherlands

[7] Univ Med Ctr Utrecht, Dept Clin Pharm, NL-3584 CX Utrecht, Netherlands

[8] Univ Utrecht, Fac Sci, Utrecht Inst Pharmaceut Sci, Dept Pharmacoepidemiol & Clin Pharmacol, NL-3584 CX Utrecht, Netherlands

来源：

JAMIA OPEN | 2024年 / 7卷 / 03期

关键词：

adverse drug reaction; text mining; free-text; natural language processing; clinical decision support systems; electronic health record; EXTRACTION; INFORMATION; MEDICATION; EVENTS; HOSPITALIZATION; DOCUMENTATION;

D O I：

10.1093/jamiaopen/ooae070

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

Objective Adverse drug reactions (ADRs) are a significant healthcare concern. They are often documented as free text in electronic health records (EHRs), making them challenging to use in clinical decision support systems (CDSS). The study aimed to develop a text mining algorithm to identify ADRs in free text of Dutch EHRs.Materials and Methods In Phase I, our previously developed CDSS algorithm was recoded and improved upon with the same relatively large dataset of 35 000 notes (Step A), using R to identify possible ADRs with Medical Dictionary for Regulatory Activities (MedDRA) terms and the related Systematized Nomenclature of Medicine Clinical Terms (SNOMED-CT) (Step B). In Phase II, 6 existing text-mining R-scripts were used to detect and present unique ADRs, and positive predictive value (PPV) and sensitivity were observed.Results In Phase IA, the recoded algorithm performed better than the previously developed CDSS algorithm, resulting in a PPV of 13% and a sensitivity of 93%. For The sensitivity for serious ADRs was 95%. The algorithm identified 58 additional possible ADRs. In Phase IB, the algorithm achieved a PPV of 10%, a sensitivity of 86%, and an F-measure of 0.18. In Phase II, four R-scripts enhanced the sensitivity and PPV of the algorithm, resulting in a PPV of 70%, a sensitivity of 73%, an F-measure of 0.71, and a 63% sensitivity for serious ADRs.Discussion and Conclusion The recoded Dutch algorithm effectively identifies ADRs from free-text Dutch EHRs using R-scripts and MedDRA/SNOMED-CT. The study details its limitations, highlighting the algorithm's potential and significant improvements. The study addressed the challenge of identifying adverse drug reactions (ADRs) in the free-text notes of Dutch electronic health records (EHRs). ADRs are crucial to monitor because they can harm patients and increase healthcare costs. However, they are often documented in an unstructured manner, making it difficult for clinical decision support systems (CDSS) to detect them effectively. To address this, a text mining (TM) algorithm was developed using R programming to identify possible ADRs from these free-text notes. In the first phase, we improved and recoded the existing CDSS algorithm into the TM algorithm and tested it on a large dataset of 35 000 EHR notes. This new algorithm showed better performance in identifying ADRs compared to the old one, with a sensitivity of 86% and a positive predictive value (PPV) of 10%. In the second phase, additional text-mining scripts were used, which significantly improved the algorithm's accuracy. The final results showed a PPV of 70% and a sensitivity of 73%, indicating a substantial improvement in the algorithm's ability to detect ADRs. The study concludes that the new algorithm is effective in identifying ADRs from Dutch EHRs, highlighting its potential to enhance patient safety.

引用

页数：9

共 50 条

[21] Identifying naloxone administrations in electronic health record data using a text-mining tool
Derington, Catherine G.
Mueller, Shane R.
Glanz, Jason M.
Binswanger, Ingrid A.
SUBSTANCE ABUSE, 2021, 42 (04) : 806 - 812
[22] Adverse Drug Reaction extraction on Electronic Health Records written in Spanish
Santiso Gonzalez, Sara
PROCESAMIENTO DEL LENGUAJE NATURAL, 2020, (64): : 119 - 122
[23] Use of the Electronic Health Record for Monitoring Adverse Drug Reactions
Anum F. Muzaffar
Sandra Abdul-Massih
James M. Stevenson
Santiago Alvarez-Arango
Current Allergy and Asthma Reports, 2023, 23 : 417 - 426
[24] Identifying Adverse Drug Reactions by Analyzing Twitter Messages
Rajapaksha, Parinda
Weerasinghe, Ruvan
2015 Fifteenth International Conference on Advances in ICT for Emerging Regions (ICTer), 2015, : 37 - 42
[25] Clinicians’ Reports in Electronic Health Records Versus Patients’ Concerns in Social Media: A Pilot Study of Adverse Drug Reactions of Aspirin and Atorvastatin
Maxim Topaz
Kenneth Lai
Neil Dhopeshwarkar
Diane L. Seger
Roee Sa’adon
Foster Goss
Ronen Rozenblum
Li Zhou
Drug Safety, 2016, 39 : 241 - 250
[26] Mining Severe Drug Hypersensitivity Reaction Cases in Pediatric Electronic Health Records: Methodology Development and Applications
Yu, Yuncui
Zhao, Qiuye
Cao, Wang
Wang, Xiaochuan
Li, Yanming
Xie, Yuefeng
Wang, Xiaoling
JMIR MEDICAL INFORMATICS, 2022, 10 (09)
[27] Data Processing and Text Mining Technologies on Electronic Medical Records: A Review
Sun, Wencheng
Cai, Zhiping
Li, Yangyang
Liu, Fang
Fang, Shengqun
Wang, Guoyan
JOURNAL OF HEALTHCARE ENGINEERING, 2018, 2018
[28] Machine Learning Based Text Mining in Electronic Health Records: Cardiovascular Patient Cases
Sikorskiy, Sergey
Metsker, Oleg
Yakovlev, Alexey
Kovalchuk, Sergey
COMPUTATIONAL SCIENCE - ICCS 2018, PT III, 2018, 10862 : 818 - 824
[29] Automatic Estimation of the Most Likely Drug Combination in Electronic Health Records Using the Smooth Algorithm: Development and Validation Study
Ouchi, Dan
Giner-Soriano, Maria
Gomez-Lumbreras, Ainhoa
Urgell, Cristina Vedia
Torres, Ferran
Morros, Rosa
JMIR MEDICAL INFORMATICS, 2022, 10 (11)
[30] ADEPt, a semantically-enriched pipeline for extracting adverse drug events from free-text electronic health records
Iqbal, Ehtesham
Mallah, Robbie
Rhodes, Daniel
Wu, Honghan
Romero, Alvin
Chang, Nynn
Dzahini, Olubanke
Pandey, Chandra
Broadbent, Matthew
Stewart, Robert
Dobson, Richard J. B.
Ibrahim, Zina M.
PLOS ONE, 2017, 12 (11):

← 1 2 3 4 5 →