Document-level relation extraction via graph transformer networks and temporal convolutional networks

被引:18
作者
Shi, Yong [2 ,4 ]
Xiao, Yang [1 ,2 ]
Quan, Pei [1 ,2 ]
Lei, MingLong [3 ]
Niu, Lingfeng [2 ,4 ]
机构
[1] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 101408, Peoples R China
[2] Chinese Acad Sci, Res Ctr Fictitious Econ & Data Sci, Beijing 100190, Peoples R China
[3] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[4] Univ Chinese Acad Sci, Sch Econ & Management, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Document-level relation extraction; Heterogeneous graph; Temporal convolutional networks; REPRESENTATION; MODEL;
D O I
10.1016/j.patrec.2021.06.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Relation Extraction (RE) aims at extracting meaningful relation facts between entities in texts. It is an important semantic processing task in the field of natural language processing (NLP) and has many ap-plications. Traditional RE focuses on extracting entity relationships from a single input sentence. Recently, the research scope has been extended from sentence level to document level. However, compared with sentence-level RE, document-level RE, which needs to identify the inter-sentence relations from entities scattered in different sentences, is more complex and still lacks of solutions. To solve this problem, we propose a novel document-level RE method based on Heterogeneous Graph Neural Networks in this pa -per. Concretely, to obtain token embeddings containing long-distance dependency signals well, we encode the document with Temporal Convolutional Networks, whose dilated convolution and residual structure allow the effective and efficient preservation of historical information. To better describe the interaction between different elements, we construct the input documents as heterogeneous graphs with different node and edge types and utilize Graph Transformer Networks to generate semantic paths. Numerical experiments on two document-level biomedical datasets demonstrate the effectiveness of the proposed method. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:150 / 156
页数:7
相关论文
共 48 条
[41]   Heterogeneous Graph Attention Network [J].
Wang, Xiao ;
Ji, Houye ;
Shi, Chuan ;
Wang, Bai ;
Cui, Peng ;
Yu, P. ;
Ye, Yanfang .
WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, :2022-2032
[42]  
Wu Y, 2019, LECT N BIOINFORMAT, V11467, P272, DOI 10.1007/978-3-030-17083-7_17
[43]  
Yun S., 2019, ADV NEURAL INFORM PR, V32, P11983
[44]  
Zeng D., 2015, P 2015 C EMP METH NA, P1753
[45]  
Zeng Daojian, 2014, P COLING AUG
[46]  
Zeng W., 2017, P EMNLP, P1768
[47]   An effective neural model extracting document level chemical-induced disease relations from biomedical literature [J].
Zheng, Wei ;
Lin, Hongfei ;
Li, Zhiheng ;
Liu, Xiaoxia ;
Li, Zhengguang ;
Xu, Bo ;
Zhang, Yijia ;
Yang, Zhihao ;
Wang, Jian .
JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 83 :1-9
[48]   Exploiting syntactic and semantics information for chemical-disease relation extraction [J].
Zhou, Huiwei ;
Deng, Huijie ;
Chen, Long ;
Yang, Yunlong ;
Jia, Chen ;
Huang, Degen .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016,