Recurrent neural networks for classifying relations in clinical notes

被引:103
作者
Luo, Yuan [1 ]
机构
[1] Northwestern Univ, Div Hlth & Biomed Informat, Dept Prevent Med, Chicago, IL 60611 USA
关键词
Natural language processing; Medical relation classification; Recurrent neural network; Long Short-Term Memory; Machine learning; EXTRACTION; TEXT; RECORDS;
D O I
10.1016/j.jbi.2017.07.006
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We proposed the first models based on recurrent neural networks (more specifically Long Short-Term Memory - LSTM) for classifying relations from clinical notes. We tested our models on the i2b2/VA relation classification challenge dataset. We showed that our segment LSTM model, with only word embedding feature and no manual feature engineering, achieved a micro-averaged f-measure of 0.661 for classifying medical problem-treatment relations, 0.800 for medical problem-test relations, and 0.683 for medical problem-medical problem relations. These results are comparable to those of the state-ofthe-art systems on the i2b2/VA relation classification challenge. We compared the segment LSTM model with the sentence LSTM model, and demonstrated the benefits of exploring the difference between concept text and context text, and between different contextual parts in the sentence. We also evaluated the impact of word embedding on the performance of LSTM models and showed that medical domain word embedding help improve the relation classification. These results support the use of LSTM models for classifying relations between medical concepts, as they show comparable performance to previously published systems while requiring no manual feature engineering. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:85 / 95
页数:11
相关论文
共 61 条
[1]  
Anick P., 2010, P 2010 I2B2 VA WORKS
[2]  
[Anonymous], P 2010 I2B2 VA WORKS
[3]  
[Anonymous], 2010, P 2010 I2B2 VA WORKS
[4]  
[Anonymous], 2016, NAACL
[5]  
[Anonymous], 2010, P PYTH SCI C
[6]  
[Anonymous], 2014, P 2014 C EMP METH NA
[7]  
[Anonymous], P 2010 I2B2 VA WORKS
[8]  
[Anonymous], 2010, P 2010 I2B2 VA WORKS
[9]  
[Anonymous], BRIEFINGS BIOINFORM
[10]  
[Anonymous], ARXIV160704315