Development of a text mining algorithm for identifying adverse drug reactions in electronic health records

被引:1
|
作者
van de Burgt, Britt W. M. [1 ,2 ,3 ]
Wasylewicz, Arthur T. M. [2 ]
Dullemond, Bjorn [4 ]
Jessurun, Naomi T. [5 ]
Grouls, Rene J. E.
Bouwman, R. Arthur [3 ,6 ]
Korsten, Erik H. M. [2 ,3 ]
Egberts, Toine C. G. [7 ,8 ]
机构
[1] Catharina Hosp, Div Clin Pharm, NL-5623 EJ Eindhoven, Netherlands
[2] Catharina Hosp, Div Healthcare Intelligence, NL-5623 EJ Eindhoven, Netherlands
[3] Tech Univ Eindhoven, Dept Elect Engn, Signal Proc Grp, NL-5612 AP Eindhoven, Netherlands
[4] Tech Univ Eindhoven, Dept Math & Comp Sci, NL-5612 AP Eindhoven, Netherlands
[5] Netherlands Pharmacovigilance Ctr LAREB, NL-5237 MH Shertogenbosch, Netherlands
[6] Catharina Hosp, Dept Anesthesiol, NL-5623 EJ Eindhoven, Netherlands
[7] Univ Med Ctr Utrecht, Dept Clin Pharm, NL-3584 CX Utrecht, Netherlands
[8] Univ Utrecht, Fac Sci, Utrecht Inst Pharmaceut Sci, Dept Pharmacoepidemiol & Clin Pharmacol, NL-3584 CX Utrecht, Netherlands
关键词
adverse drug reaction; text mining; free-text; natural language processing; clinical decision support systems; electronic health record; EXTRACTION; INFORMATION; MEDICATION; EVENTS; HOSPITALIZATION; DOCUMENTATION;
D O I
10.1093/jamiaopen/ooae070
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objective Adverse drug reactions (ADRs) are a significant healthcare concern. They are often documented as free text in electronic health records (EHRs), making them challenging to use in clinical decision support systems (CDSS). The study aimed to develop a text mining algorithm to identify ADRs in free text of Dutch EHRs.Materials and Methods In Phase I, our previously developed CDSS algorithm was recoded and improved upon with the same relatively large dataset of 35 000 notes (Step A), using R to identify possible ADRs with Medical Dictionary for Regulatory Activities (MedDRA) terms and the related Systematized Nomenclature of Medicine Clinical Terms (SNOMED-CT) (Step B). In Phase II, 6 existing text-mining R-scripts were used to detect and present unique ADRs, and positive predictive value (PPV) and sensitivity were observed.Results In Phase IA, the recoded algorithm performed better than the previously developed CDSS algorithm, resulting in a PPV of 13% and a sensitivity of 93%. For The sensitivity for serious ADRs was 95%. The algorithm identified 58 additional possible ADRs. In Phase IB, the algorithm achieved a PPV of 10%, a sensitivity of 86%, and an F-measure of 0.18. In Phase II, four R-scripts enhanced the sensitivity and PPV of the algorithm, resulting in a PPV of 70%, a sensitivity of 73%, an F-measure of 0.71, and a 63% sensitivity for serious ADRs.Discussion and Conclusion The recoded Dutch algorithm effectively identifies ADRs from free-text Dutch EHRs using R-scripts and MedDRA/SNOMED-CT. The study details its limitations, highlighting the algorithm's potential and significant improvements. The study addressed the challenge of identifying adverse drug reactions (ADRs) in the free-text notes of Dutch electronic health records (EHRs). ADRs are crucial to monitor because they can harm patients and increase healthcare costs. However, they are often documented in an unstructured manner, making it difficult for clinical decision support systems (CDSS) to detect them effectively. To address this, a text mining (TM) algorithm was developed using R programming to identify possible ADRs from these free-text notes. In the first phase, we improved and recoded the existing CDSS algorithm into the TM algorithm and tested it on a large dataset of 35 000 EHR notes. This new algorithm showed better performance in identifying ADRs compared to the old one, with a sensitivity of 86% and a positive predictive value (PPV) of 10%. In the second phase, additional text-mining scripts were used, which significantly improved the algorithm's accuracy. The final results showed a PPV of 70% and a sensitivity of 73%, indicating a substantial improvement in the algorithm's ability to detect ADRs. The study concludes that the new algorithm is effective in identifying ADRs from Dutch EHRs, highlighting its potential to enhance patient safety.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Learning to extract adverse drug reaction events from electronic health records in Spanish
    Casillas, Arantza
    Perez, Alicia
    Oronoz, Maite
    Gojenola, Koldo
    Santiso, Sara
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 61 : 235 - 245
  • [42] Comparative analysis of pharmacovigilance methods in the detection of adverse drug reactions using electronic medical records
    Liu, Mei
    Hinz, Eugenia Renne McPeek
    Matheny, Michael Edwin
    Denny, Joshua C.
    Schildcrout, Jonathan Scott
    Miller, Randolph A.
    Xu, Hua
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2013, 20 (03) : 420 - 426
  • [43] Natural Language Processing and Machine Learning for Identifying Incident Stroke From Electronic Health Records: Algorithm Development and Validation
    Zhao, Yiqing
    Fu, Sunyang
    Bielinski, Suzette J.
    Decker, Paul A.
    Chamberlain, Alanna M.
    Roger, Veronique L.
    Liu, Hongfang
    Larson, Nicholas B.
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (03)
  • [44] Mining clinical text for signals of adverse drug-drug interactions
    Iyer, Srinivasan V.
    Harpaz, Rave
    LePendu, Paea
    Bauer-Mehren, Anna
    Shah, Nigam H.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2014, 21 (02) : 353 - 362
  • [45] A Multi-Relational Association Mining Algorithm for Screening Suspected Adverse Drug Reactions
    Ji, Yanqing
    Shen, Fangyang
    Tran, John
    2014 11TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS (ITNG), 2014, : 407 - 412
  • [46] From free-text electronic health records to structured cohorts: Onconum, an innovative methodology for real-world data mining in breast cancer
    Simoulin, Antoine
    Thiebaut, Nicolas
    Neuberger, Karl
    Ibnouhsein, Issam
    Brunel, Nicolas
    Vine, Raphael
    Bousquet, Nicolas
    Latapy, Jules
    Reix, Nathalie
    Moliere, Sebastien
    Lodi, Massimo
    Mathelin, Carole
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 240
  • [47] Adverse Event extraction from Structured Product Labels using the Event-based Text-mining of Health Electronic Records (ETHER) system
    Pandey, Abhishek
    Kreimeyer, Kory
    Foster, Matthew
    Oanh Dang
    Ly, Thomas
    Wang, Wei
    Forshee, Richard
    Botsis, Taxiarchis
    HEALTH INFORMATICS JOURNAL, 2019, 25 (04) : 1232 - 1243
  • [48] Utilizing Text Mining on Online Medical Forums to Predict Label Change due to Adverse Drug Reactions
    Feldman, Ronen
    Netzer, Oded
    Peretz, Aviv
    Rosenfeld, Binyamin
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 1779 - 1788
  • [49] Mining Adverse Drug Reactions from online healthcare forums using Hidden Markov Model
    Sampathkumar, Hariprasad
    Chen, Xue-wen
    Luo, Bo
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2014, 14
  • [50] Exploring the Association of Cancer and Depression in Electronic Health Records: Combining Encoded Diagnosis and Mining Free-Text Clinical Notes
    Leis, Angela
    Casadevall, David
    Albanell, Joan
    Posso, Margarita
    Macia, Francesc
    Castells, Xavier
    Ramirez-Anguita, Juan Manuel
    Roldan, Jordi Martinez
    Furlong, Laura, I
    Sanz, Ferran
    Ronzano, Francesco
    Mayer, Miguel A.
    JMIR CANCER, 2022, 8 (03):