Feature assisted stacked attentive shortest dependency path based Bi-LSTM model for protein-protein interaction

被引:47
|
作者
Yadav, Shweta [1 ]
Ekbal, Asif [1 ]
Saha, Sriparna [1 ]
Kumar, Ankit [1 ]
Bhattacharyya, Pushpak [1 ]
机构
[1] Indian Inst Technol Patna, Dept Comp Sci & Engn, Patna, Bihar, India
关键词
Relation extraction; Protein-protein interaction; Bi-directional long short term memory(Bi-LSTM); Stacked attention; Deep learning; Shortest dependency path; Support vector machine; INTERACTION EXTRACTION; INFORMATION; NETWORK; NAMES;
D O I
10.1016/j.knosys.2018.11.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge about protein-protein interactions is essential for understanding the biological processes such as metabolic pathways, DNA replication, and transcription etc. However, a majority of the existing Protein-Protein Interaction (PPI) systems are dependent primarily on the scientific literature, which is not yet accessible as a structured database. Thus, efficient information extraction systems are required for identifying PPI information from the large collection of biomedical texts. In this paper, we present a novel method based on attentive deep recurrent neural network, which combines multiple levels of representations exploiting word sequences and dependency path related information to identify protein-protein interaction (PPI) information from the text. We use the stacked attentive bi-directional long short term memory (Bi-LSTM) as our recurrent neural network to solve the PPI identification problem. This model leverages joint modeling of proteins and relations in a single unified framework, which is named as the 'Attentive Shortest Dependency Path LSTM' (Att-sdpLSTM) model. Experimentation of the proposed technique was conducted on five popular benchmark PPI datasets, namely AiMed, Biolnfer, HPRD50, IEPA, and LLL The evaluation shows the F1-score values of 93.29%, 81.68%, 78.73%, 76.25%, & 83.92% on AiMed, Biolnfer, HPRD50, IEPA, and LLL dataset, respectively. Comparisons with the existing systems show that our proposed approach attains state-of-the-art performance. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:18 / 29
页数:12
相关论文
共 25 条