An attention-based effective neural model for drug-drug interactions extraction

被引:66
作者
Zheng, Wei [1 ,2 ]
Lin, Hongfei [1 ]
Luo, Ling [1 ]
Zhao, Zhehuan [1 ]
Li, Zhengguang [1 ,2 ]
Zhang, Yijia [1 ]
Yang, Zhihao [1 ]
Wang, Jian [1 ]
机构
[1] Dalian Univ Technol, Coll Comp Sci & Technol, Dalian, Peoples R China
[2] Dalian JiaoTong Univ, Coll Software, Dalian, Peoples R China
关键词
Attention; Recurrent neural network; Long short-term memory; Drug-drug interactions; Text mining;
D O I
10.1186/s12859-017-1855-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Drug-drug interactions (DDIs) often bring unexpected side effects. The clinical recognition of DDIs is a crucial issue for both patient safety and healthcare cost control. However, although text-mining-based systems explore various methods to classify DDIs, the classification performance with regard to DDIs in long and complex sentences is still unsatisfactory. Methods: In this study, we propose an effective model that classifies DDIs from the literature by combining an attention mechanism and a recurrent neural network with long short-term memory (LSTM) units. In our approach, first, a candidate-drug-oriented input attention acting on word-embedding vectors automatically learns which words are more influential for a given drug pair. Next, the inputs merging the position- and POS-embedding vectors are passed to a bidirectional LSTM layer whose outputs at the last time step represent the high-level semantic information of the whole sentence. Finally, a softmax layer performs DDI classification. Results: Experimental results from the DDIExtraction 2013 corpus show that our system performs the best with respect to detection and classification (84.0% and 77.3%, respectively) compared with other state-of-the-art methods. In particular, for the Medline-2013 dataset with long and complex sentences, our F-score far exceeds those of top-ranking systems by 12.6%. Conclusions: Our approach effectively improves the performance of DDI classification tasks. Experimental analysis demonstrates that our model performs better with respect to recognizing not only close-range but also long-range patterns among words, especially for long, complex and compound sentences.
引用
收藏
页数:11
相关论文
共 43 条
[1]   Medication errors: prevention using information technology systems [J].
Agrawal, Abha .
BRITISH JOURNAL OF CLINICAL PHARMACOLOGY, 2009, 67 (06) :681-686
[2]   Text mining and its potential applications in systems biology [J].
Ananiadou, Sophia ;
Kell, Douglas B. ;
Tsujii, Jun-ichi .
TRENDS IN BIOTECHNOLOGY, 2006, 24 (12) :571-579
[3]  
[Anonymous], 2016, P 25 INT JOINT C ART
[4]  
[Anonymous], 2017, ARXIV170108303
[5]  
[Anonymous], 2014, ARXIV14103916
[6]  
[Anonymous], 2016, ABS160203609 CORR
[7]  
Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, DOI 10.48550/ARXIV.1409.0473]
[8]   Text mining for pharmacovigilance: Using machine learning for drug name recognition and drug-drug interaction extraction and classification [J].
Ben Abacha, Asma ;
Chowdhury, Md. Faisal Mahbub ;
Karanasiou, Aikaterini ;
Mrabet, Yassine ;
Lavelli, Alberto ;
Zweigenbaum, Pierre .
JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 58 :122-132
[9]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[10]  
Bengio Y, 2001, ADV NEUR IN, V13, P932