Distant supervision for relation extraction with hierarchical selective attention

被引:38
作者
Zhou, Peng [1 ,2 ]
Xu, Jiaming [1 ]
Qi, Zhenyu [1 ]
Bao, Hongyun [1 ]
Chen, Zhineng [1 ]
Xu, Bo [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing, Peoples R China
基金
国家高技术研究发展计划(863计划); 中国国家自然科学基金;
关键词
Relation extraction; Distant supervision; Hierarchical attention; Piecewise convolutional neural networks; NEURAL-NETWORKS;
D O I
10.1016/j.neunet.2018.08.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distant supervised relation extraction is an important task in the field of natural language processing. There are two main shortcomings for most state-of-the-art methods. One is that they take all sentences of an entity pair as input, which would result in a large computational cost. But in fact, few of most relevant sentences are enough to recognize the relation of an entity pair. To tackle these problems, we propose a novel hierarchical selective attention network for relation extraction under distant supervision. Our model first selects most relevant sentences by taking coarse sentence-level attention on all sentences of an entity pair and then employs word-level attention to construct sentence representations and fine sentence-level attention to aggregate these sentence representations. Experimental results on a widely used dataset demonstrate that our method performs significantly better than most of existing methods. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:240 / 247
页数:8
相关论文
共 30 条
  • [1] Agichtein Eugene, 2008, Proceedings of the 17th International Conference on World Wide Web, P467
  • [2] [Anonymous], 16 CHIN NAT C COMP L
  • [3] [Anonymous], 2013, ADV NEURAL INF PROCE
  • [4] [Anonymous], 2012, COMPUTER ENCE
  • [5] [Anonymous], 2005, P 43 ANN M ASS COMP
  • [6] Bahdanau D., 2015, Neural machine translation
  • [7] A neural probabilistic language model
    Bengio, Y
    Ducharme, R
    Vincent, P
    Jauvin, C
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) : 1137 - 1155
  • [8] Reading Wikipedia to Answer Open-Domain Questions
    Chen, Danqi
    Fisch, Adam
    Weston, Jason
    Bordes, Antoine
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1870 - 1879
  • [9] Chorowski J, 2015, ADV NEUR IN, V28
  • [10] Ciresan D, 2012, PROC CVPR IEEE, P3642, DOI 10.1109/CVPR.2012.6248110