Improving RNN with Attention and Embedding for Adverse Drug Reactions

被引:14
作者
Pandey, Chandra [1 ]
Ibrahim, Zina [1 ]
Wu, Honghan [1 ]
Iqbal, Ehtesham [1 ]
Dobson, Richard [1 ]
机构
[1] Kings Coll London, IoPPN, Denmark Hill, London SE5 8AF, England
来源
PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON DIGITAL HEALTH (DH'17) | 2017年
基金
英国经济与社会研究理事会; 英国惠康基金; 英国工程与自然科学研究理事会; 英国医学研究理事会;
关键词
Recurrent Neural Networks; Named Entity Recognition; Adverse Drug Reactions; TEXT;
D O I
10.1145/3079452.3079501
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Electronic Health Records (EHR) narratives are a rich source of information, embedding high-resolution information of value to secondary research use. However, because the EHRs are mostly in natural language free-text and highly ambiguity-ridden, many natural language processing algorithms have been devised around them to extract meaningful structured information about clinical entities. The performance of the algorithms however, largely varies depending on the training dataset as well as the effectiveness of the use of background knowledge to steer the learning process. In this paper we study the impact of initializing the training of a neural network natural language processing algorithm with pre-defined clinical word embeddings to improve feature extraction and relationship classification between entities. We add our embedding framework to a bi-directional long short-term memory (Bi-LSTM) neural network, and further study the effect of using attention weights in neural networks for sequence labelling tasks to extract knowledge of Adverse Drug Reactions (ADRs). We incorporate unsupervised word embeddings using Word2Vec and GloVe from widely available medical resources such as Multiparameter Intelligent Monitoring in Intensive Care (MIMIC) II corpora, Unified Medical Language System (UMLS) as well as embed pharmaco lexicon from available EHRs. Our algorithm, implemented using two datasets, shows that our architecture outperforms baseline Bi-LSTM or Bi-LSTM networks using linear chain and Skip-Chain conditional random fields (CRF).
引用
收藏
页码:67 / 71
页数:5
相关论文
共 19 条
[1]  
Aronson AR, 2001, J AM MED INFORM ASSN, P17
[2]  
Chorowski J, 2014, Arxiv, DOI [arXiv:1412.1602, DOI 10.48550/ARXIV.1412.1602]
[3]  
Duchi J, 2011, J MACH LEARN RES, V12, P2121
[4]  
Friedman C, 2000, J AM MED INFORM ASSN, P270
[5]   Extraction of potential adverse drug events from medical case reports [J].
Gurulingappa, Harsha ;
Mateen-Rajput, Abdul ;
Toldo, Luca .
JOURNAL OF BIOMEDICAL SEMANTICS, 2012, 3
[6]   Identifying adverse drug event information in clinical notes with distributional semantic representations of context [J].
Henriksson, Aron ;
Kvist, Maria ;
Dalianis, Hercules ;
Duneld, Martin .
JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 57 :333-349
[7]  
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[8]  
Huang ZH, 2015, Arxiv, DOI [arXiv:1508.01991, 10.48550/arXiv.1508.01991]
[9]   Identification of Adverse Drug Events from Free Text Electronic Patient Records and Information in a Large Mental Health Case Register [J].
Iqbal, Ehtesham ;
Mallah, Robbie ;
Jackson, Richard George ;
Ball, Michael ;
Ibrahim, Zina M. ;
Broadbent, Matthew ;
Dzahini, Olubanke ;
Stewart, Robert ;
Johnston, Caroline ;
Dobson, Richard J. B. .
PLOS ONE, 2015, 10 (08)
[10]  
Jagannatha Abhyuday N, 2016, Proc Conf Empir Methods Nat Lang Process, V2016, P856