Adverse drug events and medication relation extraction in electronic health records with ensemble deep learning methods

被引:61
作者
Christopoulou, Fenia [1 ,2 ]
Thy Thy Tran [1 ,2 ]
Sahu, Sunil Kumar [1 ]
Miwa, Makoto [2 ,3 ]
Ananiadou, Sophia [1 ,2 ]
机构
[1] Univ Manchester, Natl Ctr Text Min, Sch Comp Sci, Manchester, Lancs, England
[2] Natl Inst Adv Ind Sci & Technol, Artificial Intelligence Res Ctr, Tokyo, Japan
[3] Toyota Technol Inst, Nagoya, Aichi, Japan
基金
英国生物技术与生命科学研究理事会;
关键词
neural networks; adverse drug events; relation extraction; ensemble methods; electronic health records; RECOGNITION;
D O I
10.1093/jamia/ocz101
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: Identification of drugs, associated medication entities, and interactions among them are crucial to prevent unwanted effects of drug therapy, known as adverse drug events. This article describes our participation to the n2c2 shared-task in extracting relations between medication-related entities in electronic health records. Materials and Methods: We proposed an ensemble approach for relation extraction and classification between drugs and medication-related entities. We incorporated state-of-the-art named-entity recognition (NER) models based on bidirectional long short-term memory (BiLSTM) networks and conditional random fields (CRF) for end-to-end extraction. We additionally developed separate models for intra- and inter-sentence relation extraction and combined them using an ensemble method. The intra-sentence models rely on bidirectional long short-term memory networks and attention mechanisms and are able to capture dependencies between multiple related pairs in the same sentence. For the inter-sentence relations, we adopted a neural architecture that utilizes the Transformer network to improve performance in longer sequences. Results: Our team ranked third with a micro-averaged F1 score of 94.72% and 87.65% for relation and end-toend relation extraction, respectively (Tracks 2 and 3). Our ensemble effectively takes advantages from our proposed models. Analysis of the reported results indicated that our proposed approach is more generalizable than the top-performing system, which employs additional training data- and corpus-driven processing techniques. Conclusions: We proposed a relation extraction system to identify relations between drugs and medicationrelated entities. The proposed approach is independent of external syntactic tools. Analysis showed that by using latent Drug-Drug interactions we were able to significantly improve the performance of non-Drug-Drug pairs in EHRs.
引用
收藏
页码:39 / 46
页数:8
相关论文
共 42 条
  • [1] [Anonymous], 2018, INT WORKSH MED ADV D
  • [2] [Anonymous], 2018, ATTENTIVE SEQUENCE M
  • [3] INCIDENCE OF ADVERSE DRUG EVENTS AND POTENTIAL ADVERSE DRUG EVENTS - IMPLICATIONS FOR PREVENTION
    BATES, DW
    CULLEN, DJ
    LAIRD, N
    PETERSEN, LA
    SMALL, SD
    SERVI, D
    LAFFEL, G
    SWEITZER, BJ
    SHEA, BF
    HALLISEY, R
    VANDERVLIET, M
    NEMESKAL, R
    LEAPE, LL
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1995, 274 (01): : 29 - 34
  • [4] Bekoulis G, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P2830
  • [5] Joint entity recognition and relation extraction as a multi-head selection problem
    Bekoulis, Giannis
    Deleu, Johannes
    Demeester, Thomas
    Develder, Chris
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 114 : 34 - 45
  • [6] Björne J, 2018, SIGBIOMED WORKSHOP ON BIOMEDICAL NATURAL LANGUAGE PROCESSING (BIONLP 2018), P98
  • [7] Chiu B., 2016, Association for Computational Linguistics (ACL), P166, DOI [DOI 10.18653/V1/W16-2922, 10.18653/v1/w16-2922]
  • [8] Christopoulou F, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, P81
  • [9] Dandala B, 2017, P TAC
  • [10] Dasgupta T, 2017, P CICLING, P463