Named entity recognition in electronic health records using transfer learning bootstrapped Neural Networks

被引:58
|
作者
Gligic, Luka [1 ]
Kormilitzin, Andrey [1 ]
Goldberg, Paul [1 ]
Nevado-Holgado, Alejo [1 ]
机构
[1] Univ Oxford, Oxford, England
基金
英国医学研究理事会;
关键词
Neural Networks; NLP; Named entity recognition; Electronic health records; Transfer learning; LSTM; PATIENT SMOKING STATUS; MEDICATION INFORMATION;
D O I
10.1016/j.neunet.2019.08.032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural networks (NNs) have become the state of the art in many machine learning applications, such as image, sound (LeCun et al., 2015) and natural language processing (Young et al., 2017; Linggard et al., 2012). However, the success of NNs remains dependent on the availability of large labelled datasets, such as in the case of electronic health records (EHRs). With scarce data, NNs are unlikely to be able to extract this hidden information with practical accuracy. In this study, we develop an approach that solves these problems for named entity recognition, obtaining 94.6 F1 score in I2B2 2009 Medical Extraction Challenge (Uzuner et al., 2010), 4.3 above the architecture that won the competition. To achieve this, we bootstrap our NN models through transfer learning by pretraining word embeddings on a secondary task performed on a large pool of unannotated EHRs and using the output embeddings as a foundation of a range of NN architectures. Beyond the official I2B2 challenge, we further achieve 82.4 F1 on extracting relationships between medical terms using attention-based seq2seq models bootstrapped in the same manner. Crown Copyright (C) 2019 Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:132 / 139
页数:8
相关论文
共 50 条
  • [31] Named Entity Recognition in Semi Structured Documents Using Neural Tensor Networks
    Shehzad, Khurram
    Ul-Hasan, Adnan
    Malik, Muhammad Imran
    Shafait, Faisal
    DOCUMENT ANALYSIS SYSTEMS, 2020, 12116 : 398 - 409
  • [32] Transfer Learning for Domain-Specific Named Entity Recognition in German
    Torge, Sunna
    Hahn, Waldemar
    Jaekel, Rene
    2020 6TH IEEE CONGRESS ON INFORMATION SCIENCE AND TECHNOLOGY (IEEE CIST'20), 2020, : 321 - 327
  • [33] A Named Entity Recognition Approach for Electronic Medical Records Using BERT Semantic Enhancement and BiLSTM
    Lai, Xuewei
    Jie, Qingqing
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2023, 19 (01)
  • [34] Leveraging weak supervision to perform named entity recognition in electronic health records progress notes to identify the ophthalmology exam
    Wang, Sophia Y.
    Huang, Justin
    Hwang, Hannah
    Hu, Wendeng
    Tao, Shiqi
    Hernandez-Boussard, Tina
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2022, 167
  • [35] Medical Named Entity Recognition Using Weakly Supervised Learning
    Long-Long Ma
    Jie Yang
    Bo An
    Shuaikang Liu
    Gaijuan Huang
    Cognitive Computation, 2022, 14 : 1068 - 1079
  • [36] Medical Named Entity Recognition Using Weakly Supervised Learning
    Ma, Long-Long
    Yang, Jie
    An, Bo
    Liu, Shuaikang
    Huang, Gaijuan
    COGNITIVE COMPUTATION, 2022, 14 (03) : 1068 - 1079
  • [37] Mongolian Named Entity Recognition with Bidirectional Recurrent Neural Networks
    Wang, Weihua
    Bao, Feilong
    Gao, Guanglai
    2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 495 - 500
  • [38] Neural Architecture for Persian Named Entity Recognition
    Hafezi, Leila
    Rezaeian, Mehdi
    2018 4TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2018, : 61 - 64
  • [39] Named entity recognition using point prediction and active learning
    Kobayashi, Koga
    Wakabayashi, Kei
    IIWAS2019: THE 21ST INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES, 2019, : 287 - 293
  • [40] Ensemble Transfer Learning on Augmented Domain Resources for Oncological Named Entity Recognition in Chinese Clinical Records
    Zhou, Meifeng
    Tan, Jindian
    Yang, Song
    Wang, Haixia
    Wang, Lin
    Xiao, Zhifeng
    IEEE ACCESS, 2023, 11 : 80416 - 80428