Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks

被引:13
|
作者
Zhang, Canlin [1 ]
Bis, Daniel [2 ]
Liu, Xiuwen [2 ]
He, Zhe [3 ]
机构
[1] Florida State Univ, Dept Math, Tallahassee, FL 32306 USA
[2] Florida State Univ, Dept Comp Sci, Tallahassee, FL 32306 USA
[3] Florida State Univ, Sch Informat, Tallahassee, FL 32306 USA
关键词
Word sense disambiguation; LSTM; Self-attention; Biomedical;
D O I
10.1186/s12859-019-3079-8
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background In recent years, deep learning methods have been applied to many natural language processing tasks to achieve state-of-the-art performance. However, in the biomedical domain, they have not out-performed supervised word sense disambiguation (WSD) methods based on support vector machines or random forests, possibly due to inherent similarities of medical word senses. Results In this paper, we propose two deep-learning-based models for supervised WSD: a model based on bi-directional long short-term memory (BiLSTM) network, and an attention model based on self-attention architecture. Our result shows that the BiLSTM neural network model with a suitable upper layer structure performs even better than the existing state-of-the-art models on the MSH WSD dataset, while our attention model was 3 or 4 times faster than our BiLSTM model with good accuracy. In addition, we trained "universal" models in order to disambiguate all ambiguous words together. That is, we concatenate the embedding of the target ambiguous word to the max-pooled vector in the universal models, acting as a "hint". The result shows that our universal BiLSTM neural network model yielded about 90 percent accuracy. Conclusion Deep contextual models based on sequential information processing methods are able to capture the relative contextual information from pre-trained input word embeddings, in order to provide state-of-the-art results for supervised biomedical WSD tasks.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks
    Canlin Zhang
    Daniel Biś
    Xiuwen Liu
    Zhe He
    BMC Bioinformatics, 20
  • [2] Layered Multistep Bidirectional Long Short-Term Memory Networks for Biomedical Word Sense Disambiguation
    Bis, Daniel
    Zhang, Canlin
    Liu, Xiuwen
    He, Zhe
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 313 - 320
  • [3] Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification
    Zhou, Peng
    Shi, Wei
    Tian, Jun
    Qi, Zhenyu
    Li, Bingchen
    Hao, Hongwei
    Xu, Bo
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, 2016, : 207 - 212
  • [4] Word embeddings and recurrent neural networks based on Long-Short Term Memory nodes in supervised biomedical word sense disambiguation
    Yepes, Antonio Jimeno
    JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 73 : 137 - 147
  • [5] Biomedical Ontology Matching Through Attention-Based Bidirectional Long Short-Term Memory Network
    Xue, Xingsi
    Jiang, Chao
    Zhang, Jie
    Hu, Cong
    JOURNAL OF DATABASE MANAGEMENT, 2021, 32 (04) : 14 - 27
  • [6] Effective Attention-based Neural Architectures for Sentence Compression with Bidirectional Long Short-Term Memory
    Nhi-Thao Tran
    Viet-Thang Luong
    Ngan Luu-Thuy Nguyen
    Minh-Quoc Nghiem
    PROCEEDINGS OF THE SEVENTH SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2016), 2016, : 123 - 130
  • [7] Biomedical Word Sense Disambiguation Based on Graph Attention Networks
    Zhang, Chun-Xiang
    Wang, Ming-Lei
    Gao, Xue-Yao
    IEEE ACCESS, 2022, 10 : 123328 - 123336
  • [8] Enhancing Word Sense Disambiguation for Amharic homophone words using Bidirectional Long Short-Term Memory network
    Belete, Mequanent Degu
    Shiferaw, Lijalem Getanew
    Alitasb, Girma Kassa
    Tamir, Tariku Sinshaw
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2024, 23
  • [9] Image Captioning with Bidirectional Semantic Attention-Based Guiding of Long Short-Term Memory
    Cao, Pengfei
    Yang, Zhongyi
    Sun, Liang
    Liang, Yanchun
    Yang, Mary Qu
    Guan, Renchu
    NEURAL PROCESSING LETTERS, 2019, 50 (01) : 103 - 119
  • [10] Image Captioning with Bidirectional Semantic Attention-Based Guiding of Long Short-Term Memory
    Pengfei Cao
    Zhongyi Yang
    Liang Sun
    Yanchun Liang
    Mary Qu Yang
    Renchu Guan
    Neural Processing Letters, 2019, 50 : 103 - 119