A multiple distributed representation method based on neural network for biomedical event extraction

被引:26
作者
Wang, Anran [1 ]
Wang, Jian [1 ]
Lin, Hongfei [1 ]
Zhang, Jianhai [1 ]
Yang, Zhihao [1 ]
Xu, Kan [1 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Peoples R China
关键词
Biomedical event extraction; Distributed representation; Deep learning; Convolutional neural network;
D O I
10.1186/s12911-017-0563-9
中图分类号
R-058 [];
学科分类号
摘要
Background: Biomedical event extraction is one of the most frontier domains in biomedical research. The two main subtasks of biomedical event extraction are trigger identification and arguments detection which can both be considered as classification problems. However, traditional state-of-the-art methods are based on support vector machine (SVM) with massive manually designed one-hot represented features, which require enormous work but lack semantic relation among words. Methods: In this paper, we propose a multiple distributed representation method for biomedical event extraction. The method combines context consisting of dependency-based word embedding, and task-based features represented in a distributed way as the input of deep learning models to train deep learning models. Finally, we used softmax classifier to label the example candidates. Results: The experimental results on Multi-Level Event Extraction (MLEE) corpus show higher F-scores of 77.97% in trigger identification and 58.31% in overall compared to the state-of-the-art SVM method. Conclusions: Our distributed representation method for biomedical event extraction avoids the problems of semantic gap and dimension disaster from traditional one-hot representation methods. The promising results demonstrate that our proposed method is effective for biomedical event extraction.
引用
收藏
页数:8
相关论文
共 18 条
  • [1] [Anonymous], 2011, P 2011 C EMPIRICAL M
  • [2] [Anonymous], 2013, P BIONLP SHAR TASK 2
  • [3] [Anonymous], 2007, P EMP METH NAT LANG
  • [4] Berger J., 2010, Proceedings of the Python for Scientific Computing Conference (SciPy), number Scipy, P1
  • [5] Bjorne J., 2013, Proceedings of the BioNLP Shared Task 2013 Workshop, P16
  • [6] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [7] Dependency-Based Word Embeddings
    Levy, Omer
    Goldberg, Yoav
    [J]. PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2014, : 302 - 308
  • [8] Ma MB, 2015, PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, P174
  • [9] Mikolov T., 2013, ADV NEURAL INFORM PR, P3111
  • [10] Mikolov T, 2012, IEEE W SP LANG TECH, P234, DOI 10.1109/SLT.2012.6424228